A Convex Optimization Framework for Active Learning
Ehsan Elhamifar, Guillermo Sapiro, Allen Yang, Shankar S. Sastry

Citation
Ehsan Elhamifar, Guillermo Sapiro, Allen Yang, Shankar S. Sastry. "A Convex Optimization Framework for Active Learning". International Conference on Computer Vision (ICCV), 2013.

Abstract
In many image/video/web classification problems, we have access to a large number of unlabeled samples. However, it is typically expensive and time consuming to obtain labels for the samples. Active learning is the problem of progressively selecting and annotating the most informative unlabeled samples, in order to obtain a high classification performance. Most existing active learning algorithms select only one sample at a time prior to retraining the classifier. Hence, they are computationally expensive and cannot take advantage of parallel labeling systems such as Mechanical Turk. On the other hand, algorithms that allow the selection of multiple samples prior to retraining the classifier, may select samples that have significant information overlap or they involve solving a non-convex optimization. More importantly, the majority of active learning algorithms are developed for a certain classifier type such as SVM. In this paper, we develop an efficient active learning framework based on convex programming, which can select multiple samples at a time for annotation. Unlike the state of the art, our algorithm can be used in conjunction with any type of classifiers, including those of the family of the recently proposed Sparse Representation-based Classification (SRC). We use the two principles of classifier uncertainty and sample diversity in order to guide the optimization program towards selecting the most informative unlabeled samples, which have the least information overlap. Our method can incorporate the data distribution in the selection process by using the appropriate dissimilarity between pairs of samples. We show the effectiveness of our framework in person detection, scene categorization and face recognition on real-world datasets.

Electronic downloads

Citation formats  
  • HTML
    Ehsan Elhamifar, Guillermo Sapiro, Allen Yang, Shankar S.
    Sastry. <a
    href="http://robotics.eecs.berkeley.edu/pubs/2.html"
    >A Convex Optimization Framework for Active
    Learning</a>, International Conference on Computer
    Vision (ICCV), 2013.
  • Plain text
    Ehsan Elhamifar, Guillermo Sapiro, Allen Yang, Shankar S.
    Sastry. "A Convex Optimization Framework for Active
    Learning". International Conference on Computer Vision
    (ICCV), 2013.
  • BibTeX
    @inproceedings{ElhamifarSapiroYangSastry13_ConvexOptimizationFrameworkForActiveLearning,
        author = {Ehsan Elhamifar and Guillermo Sapiro and Allen
                  Yang and Shankar S. Sastry},
        title = {A Convex Optimization Framework for Active Learning},
        booktitle = {International Conference on Computer Vision (ICCV)},
        year = {2013},
        abstract = {In many image/video/web classification problems,
                  we have access to a large number of unlabeled
                  samples. However, it is typically expensive and
                  time consuming to obtain labels for the samples.
                  Active learning is the problem of progressively
                  selecting and annotating the most informative
                  unlabeled samples, in order to obtain a high
                  classification performance. Most existing active
                  learning algorithms select only one sample at a
                  time prior to retraining the classifier. Hence,
                  they are computationally expensive and cannot take
                  advantage of parallel labeling systems such as
                  Mechanical Turk. On the other hand, algorithms
                  that allow the selection of multiple samples prior
                  to retraining the classifier, may select samples
                  that have significant information overlap or they
                  involve solving a non-convex optimization. More
                  importantly, the majority of active learning
                  algorithms are developed for a certain classifier
                  type such as SVM. In this paper, we develop an
                  efficient active learning framework based on
                  convex programming, which can select multiple
                  samples at a time for annotation. Unlike the state
                  of the art, our algorithm can be used in
                  conjunction with any type of classifiers,
                  including those of the family of the recently
                  proposed Sparse Representation-based
                  Classification (SRC). We use the two principles of
                  classifier uncertainty and sample diversity in
                  order to guide the optimization program towards
                  selecting the most informative unlabeled samples,
                  which have the least information overlap. Our
                  method can incorporate the data distribution in
                  the selection process by using the appropriate
                  dissimilarity between pairs of samples. We show
                  the effectiveness of our framework in person
                  detection, scene categorization and face
                  recognition on real-world datasets.},
        URL = {http://robotics.eecs.berkeley.edu/pubs/2.html}
    }
    

Posted by Ehsan Elhamifar on 7 May 2014.
Groups: ehumans
For additional information, see the Publications FAQ or contact webmaster at robotics eecs berkeley edu.

Notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright.