UCL  IRIS
Institutional Research Information Service
UCL Logo
Please report any queries concerning the funding data grouped in the sections named "Externally Awarded" or "Internally Disbursed" (shown on the profile page) to your Research Finance Administrator. Your can find your Research Finance Administrator at https://www.ucl.ac.uk/finance/research/rs-contacts.php by entering your department
Please report any queries concerning the student data shown on the profile page to:

Email: portico-services@ucl.ac.uk

Help Desk: http://www.ucl.ac.uk/ras/portico/helpdesk
Publication Detail
Regression on Probability Measures: A Simple and Consistent Algorithm
  • Publication Type:
    Conference presentation
  • Publication Sub Type:
    Presentation
  • Authors:
    Szabo Z, Sriperumbudur B, Poczos B, Gretton A
  • Date:
    29/05/2015
  • Name of Conference:
    CRiSM Seminars, Department of Statistics, University of Warwick
  • Conference place:
    Coventry, UK
  • Conference start date:
    29/05/2015
  • Conference finish date:
    29/05/2015
  • Keywords:
    consistency, convergence rate, distribution regression, mean embedding, set kernel, two-stage sampling
  • Notes:
    abstract: "http://www.gatsby.ucl.ac.uk/~szabo/talks/invited_talk/Zoltan_Szabo_invited_talk_University_of_Oxford_01_05_2015_abstract.pdf", slides: "http://www.gatsby.ucl.ac.uk/~szabo/talks/invited_talk/Zoltan_Szabo_invited_talk_University_of_Warwick_29_05_2015.pdf"
Abstract
We address the distribution regression problem: we regress from probability measures to Hilbert-space valued outputs, where only samples are available from the input distributions. Many important statistical and machine learning problems can be phrased within this framework including point estimation tasks without analytical solution, or multi-instance learning. However, due to the two-stage sampled nature of the problem, the theoretical analysis becomes quite challenging: to the best of our knowledge the only existing method with performance guarantees requires density estimation (which of ten performs poorly in practise) and the distributions to be defined on a compact Euclidean domain. We present a simple, analytically tractable alternative to solve the distribution regression problem: we embed the distributions to a reproducing kernel Hilbert space and perform ridge regression from the embedded distributions to the outputs. We prove that this scheme is consistent under mild conditions (for distributions on separable topological domains endowed with kernels), and construct explicit finite sample bounds on the excess risk as a function of the sample numbers and the problem difficulty, which hold with high probability. Specifically, we establish the consistency of set kernels in regression, which was a 15-year-old-openquestion, and also present new kernels on embedded distributions. The practical efficiency of the studied technique is illustrated in supervised entropy learning and aerosol prediction using multispectral satellite images.
Publication data is maintained in RPS. Visit https://rps.ucl.ac.uk
 More search options
UCL Researchers
Author
Gatsby Computational Neurosci Unit
University College London - Gower Street - London - WC1E 6BT Tel:+44 (0)20 7679 2000

© UCL 1999–2011

Search by