Skip to main content
Cornell University
We gratefully acknowledge support from
the Simons Foundation and member institutions.
arxiv logo > cs > arXiv:1702.08169

Help | Advanced Search

Computer Science > Machine Learning

(cs)
[Submitted on 27 Feb 2017]

Title:Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis

Authors:Dan Garber, Ohad Shamir, Nathan Srebro
Download a PDF of the paper titled Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis, by Dan Garber and 2 other authors
Download PDF
Abstract: We study the fundamental problem of Principal Component Analysis in a statistical distributed setting in which each machine out of m stores a sample of n points sampled i.i.d. from a single unknown distribution. We study algorithms for estimating the leading principal component of the population covariance matrix that are both communication-efficient and achieve estimation error of the order of the centralized ERM solution that uses all mn samples. On the negative side, we show that in contrast to results obtained for distributed estimation under convexity assumptions, for the PCA objective, simply averaging the local ERM solutions cannot guarantee error that is consistent with the centralized ERM. We show that this unfortunate phenomena can be remedied by performing a simple correction step which correlates between the individual solutions, and provides an estimator that is consistent with the centralized ERM for sufficiently-large n. We also introduce an iterative distributed algorithm that is applicable in any regime of n, which is based on distributed matrix-vector products. The algorithm gives significant acceleration in terms of communication rounds over previous distributed algorithms, in a wide regime of parameters.
Subjects: Machine Learning (cs.LG)
Cite as: arXiv:1702.08169 [cs.LG]
  (or arXiv:1702.08169v1 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.1702.08169
arXiv-issued DOI via DataCite

Submission history

From: Dan Garber [view email]
[v1] Mon, 27 Feb 2017 07:45:58 UTC (36 KB)
Full-text links:

Download:

    Download a PDF of the paper titled Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis, by Dan Garber and 2 other authors
  • PDF
  • PostScript
  • Other formats
(license)
Current browse context:
cs.LG
< prev   |   next >
new | recent | 1702
Change to browse by:
cs

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar

DBLP - CS Bibliography

listing | bibtex
Dan Garber
Ohad Shamir
Nathan Srebro
a export bibtex citation Loading...

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack