Theoretical foundations

Esther Rolf; Jonathan Proctor; Tamma Carleton; Ian Bolliger; Vaishaal Shankar; Miyabi Ishihara; Benjamin Recht; Solomon Hsiang

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Theoretical foundations

ER Esther Rolf

JP Jonathan Proctor

TC Tamma Carleton

IB Ian Bolliger

VS Vaishaal Shankar

MI Miyabi Ishihara

BR Benjamin Recht

SH Solomon Hsiang

This method is extracted from research article: Nat Commun, Jul 2021

A generalizable and accessible approach to machine learning with global satellite imagery

DOI: 10.1038/s41467-021-24638-z

Request a Protocol

Ask a question

Favorite

MOSAIKS is motivated by the goal of enabling generalizable and skillful SIML predictions. It achieves this by embedding images in a basis that is both descriptive (i.e., models trained using this single basis achieve high skill across diverse labels) and efficient (i.e., such skill is achieved using a relatively low-dimensional basis). The approach for this embedding relies on the theory of random kitchen sinks^¹⁶, a method for feature generation that enables the linear approximation of arbitrary well-behaved functions. This is akin to the use of polynomial features or discrete Fourier transforms for function approximation generally, such as functions of one dimension. When users apply these features in linear regression, they identify linear weightings of these basis vectors important for predicting a specific set of labels. With inputs of high dimension, such as the satellite images we consider, it has been shown experimentally^{¹⁷–¹⁹} and theoretically^⁴³ that a randomly selected subspace of the basis often performs as well as the entire basis for prediction problems.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol