My Photo

Favorite Links

Related blogs

Machine Learning (Theory)

Blog powered by TypePad

Other links

  • Listed on BlogShares
My Squidoo Lens

« Can a computer think? | Main | Bayesian brain »

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8342dcba453ef00d834c3726069e2

Listed below are links to weblogs that reference When does sparsity occur?:

Comments

lorenzo

hi! i read with pleasure your weblog. i am not sure i understood your intuition on sparsity, but a paper i recently read which seems to go in this direction is an old paper by federico girosi:

An equivalence between sparse approximation and support vector machines

i think that the interesting point is that one have some insight on the role played by loss function and penalty term with respect to sparsity.

pierre

Hello,
If I followed, you ask "Why sparse model have good properties?" and on the other hand your intuition is "unregularity of optimised criteria implies sparsity".

So even if your intuition is good, you don't answer to the question, do you ? The question is justed shifted to "why unregularity of criteria implies good popreties". (in paticular statitical robusteness) ???

From another point of view, would it be correct to link sparcity with short description lenth, simplicty, Occam razor and then good generalisation ???


Yaroslav Bulatov

Encouraging sparsity increases bias, and hence reduces the variance. This is a good thing to do if you think that in your parametrization setting most parameter to 0 won't hurt you too much.

Andrew Ng showed in his "rotational invariance" paper that if a fixed number of features is needed to predict well, the sample complexity of L1 regularized logistic regression grows logarithmically in the number of irrelevant features, but L2 or any other rotationally invariant regularization scheme grows at least linearly. http://ai.stanford.edu/~ang/papers/icml04-l1l2.pdf

There's an issue of parametrization dependence: encouraging sparsity in one parametrization may be a good thing while in a reparametrized version of the problem be terrible.

So this raises the question -- how can we come up with parametrizations where sparse solutions are not very far from best solution?

We could let the "best solution" be chosen adversarially. Then for discrete distributions, the uniform distribution minimizes worst loss, so it makes sense that we pick a parametrization where parameter vector=0 corresponds to uniform distribution. But what will the rest of parametrization look like?

Truncated series expansions are a kind of "sparse approximations" for functions, so perhaps there are results in approximation theory that answer this?

zwald

Hi Olivier, I am not sure to understand your intuition but I have a question. I was wondering if the Kernel Projection Machine (NIPS'04) is a sparse algorithm or not.

Olivier Cappé

Bonjour Olivier,

It does seem to me that your intuition is more than vaguely related to some (non-ML) works by Mila Nikolova (curr. in ENS Cachan), in particular:

Local strong homogeneity of a regularized estimator, SIAM Journal on Applied Mathematics, vol. 61, no. 2, 2000, pp. 633-658
http://www.cmla.ens-cachan.fr/Utilisateurs/nikolova/Hom.pdf

where she shows that in image reconstruction problems, the fact that a nonsmooth at zero penalty function (regularization term) is used is instrumental in obtaining reconstructed signals that display, in her terminology, strongly homogeneous zones (of which neighbouring constant values is a typical example).

Benjamin Guo

Seems fairly a long time discussion. I like your intuition. Well, my question is, since computing exponential is not that expensive (I mean computing kernel functions, e.g. in the case of Gaussian), how can sparsity be useful other than balancing the bias and variance?

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment