Skip to main content
Cornell University
We gratefully acknowledge support from
the Simons Foundation and member institutions.
arxiv logo > cs > arXiv:1412.7839

Help | Advanced Search

Computer Science > Machine Learning

(cs)
[Submitted on 25 Dec 2014 (v1), last revised 17 Aug 2015 (this version, v2)]

Title:Cloud K-SVD: A Collaborative Dictionary Learning Algorithm for Big, Distributed Data

Authors:Haroon Raja, Waheed U. Bajwa
Download PDF
Abstract: This paper studies the problem of data-adaptive representations for big, distributed data. It is assumed that a number of geographically-distributed, interconnected sites have massive local data and they are interested in collaboratively learning a low-dimensional geometric structure underlying these data. In contrast to previous works on subspace-based data representations, this paper focuses on the geometric structure of a union of subspaces (UoS). In this regard, it proposes a distributed algorithm---termed cloud K-SVD---for collaborative learning of a UoS structure underlying distributed data of interest. The goal of cloud K-SVD is to learn a common overcomplete dictionary at each individual site such that every sample in the distributed data can be represented through a small number of atoms of the learned dictionary. Cloud K-SVD accomplishes this goal without requiring exchange of individual samples between sites. This makes it suitable for applications where sharing of raw data is discouraged due to either privacy concerns or large volumes of data. This paper also provides an analysis of cloud K-SVD that gives insights into its properties as well as deviations of the dictionaries learned at individual sites from a centralized solution in terms of different measures of local/global data and topology of interconnections. Finally, the paper numerically illustrates the efficacy of cloud K-SVD on real and synthetic distributed data.
Comments: Accepted for Publication in IEEE Trans. Signal Processing (2015); 16 pages, 3 figures
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as: arXiv:1412.7839 [cs.LG]
  (or arXiv:1412.7839v2 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.1412.7839
arXiv-issued DOI via DataCite
Journal reference: IEEE Trans. Signal Processing, vol. 64, no. 1, pp. 173-188, Jan. 2016
Related DOI: https://doi.org/10.1109/TSP.2015.2472372
DOI(s) linking to related resources

Submission history

From: Waheed Bajwa [view email]
[v1] Thu, 25 Dec 2014 17:01:52 UTC (269 KB)
[v2] Mon, 17 Aug 2015 21:27:03 UTC (107 KB)
Full-text links:

Download:

  • PDF
  • Other formats
(license)
Current browse context:
cs.LG
< prev   |   next >
new | recent | 1412
Change to browse by:
cs
cs.IT
math
math.IT
stat
stat.ML

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar

DBLP - CS Bibliography

listing | bibtex
Haroon Raja
Waheed U. Bajwa
a export bibtex citation Loading...

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack