Statistics – Machine Learning
Scientific paper
2012-04-20
Statistics
Machine Learning
38 pages
Scientific paper
We consider supervised learning problems where the features are embedded in a graph, such as gene expressions in a gene network. In this context, it is of much interest to take into account the problem structure, and automatically select a subgraph with a small number of connected components. By exploiting prior knowledge, one can indeed improve the prediction performance and/or obtain better interpretable results. Regularization or penalty functions for selecting features in graphs have recently been proposed but they raise new algorithmic challenges. For example, they typically require solving a combinatorially hard selection problem among all connected subgraphs. In this paper, we propose computationally feasible strategies to select a sparse and "well connected" subset of features sitting on a directed acyclic graph (DAG). We introduce structured sparsity penalties over paths on a DAG called "path coding" penalties. Unlike existing regularization functions, path coding penalties can both model long range interactions between features in the graph and be tractable. The penalties and their proximal operators involve path selection problems, which we efficiently solve by leveraging network flow optimization. We experimentally show on synthetic, image, and genomic data that our approach is scalable and lead to more connected subgraphs than other regularization functions for graphs.
Mairal Julien
Yu Bin
No associations
LandOfFree
Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-5278