Research Projects
BOSS Net: A Self-consistent Data-driven Model for Determining Stellar Parameters from Optical and Near-infrared Spectra
Data-driven models, which apply machine learning to infer physical properties from large quantities of data, have become increasingly important for extracting stellar properties from spectra. In general, these methods have been applied to data in one wavelength regime or another. For example, APOGEE Net has been applied to near-IR spectra from the Sloan Digital Sky Survey (SDSS)–V APOGEE survey to predict stellar parameters ($T_{\rm eff}$, $\log g$, and $[\rm Fe/H]$) for all stars with $T_{\rm eff}$ from 3000 to 50,000\,K, including pre-main-sequence stars, OB stars, main-sequence dwarfs, and red giants. The increasing number of large surveys across multiple wavelength regimes provides the opportunity to improve data-driven models through learning from multiple data sets at once. In SDSS-V, a number of spectra of stars will be observed not just with APOGEE in the near-IR, but also with BOSS in the optical regime. Here, we aim to develop a complementary model, BOSS Net, that will replicate the performance of APOGEE Net in these optical data through label transfer. We further improve the model by extending it to brown dwarfs, as well as white dwarfs, resulting in a comprehensive coverage between $1700 < T_{\rm eff} < 100\,000\,{\rm K}$ and $0 < \log g < 10$, to ensure BOSS Net can reliably measure parameters of most of the commonly observed objects within this parameter space. We also update APOGEE Net to achieve a comparable performance in the near-IR regime. The resulting models provide a robust tool for measuring stellar evolutionary states, and, in turn, enable characterization of the star-forming history of the Galaxy.

STARS: Sensor-Agnostic Transformer Architecture For Remote Sensing
We present a sensor-agnostic spectral transformer as the basis for spectral foundation models. To that end, we introduce a Universal Spectral Representation (USR) that leverages sensor meta-data, such as sensing kernel specifications and sensing wavelengths, to encode spectra obtained from any spectral instrument into a common representation, such that a single model can ingest data from any sensor. Furthermore, we develop a methodology for pre-training such models in a self-supervised manner using a novel random sensor-augmentation and reconstruction pipeline to learn spectral features independent of the sensing paradigm. We demonstrate that our architecture can learn sensor independent spectral features that generalize effectively to sensors not seen during training. This work sets the stage for training foundation models that can both leverage and be effective for the growing diversity of spectral data.

TRONN BEM: Tractable, Reliable, and Operational Neural Networks for Buildings Energy Management.
Control problems naturally arise across a wide range of domains, from robotics to building energy management. In the latter case, model-free learning-based control algorithms have shown significant promise, particularly differentiable predictive control (DPC) and deep reinforcement learning (DRL). However, the relative advantages and disadvantages of DPC and DRL for building control are not yet fully explored in terms of control performance, sample efficiency, computational cost and ease of use. This work contributes a benchmarking study of DPC and DRL across two control scenarios across six systems. We consider both the whitebox control scenario, where the true equations governing the system are known, and the blackbox scenario, where a neural state space model (NSSM) is trained from offline interaction with the true system and used as a surrogate for the true system to train the controller. The former allows us to isolate the controller training process under idealized conditions, while the latter investigates the more realistic scenario where only historical observations are available, and the true governing equations are not known. We benchmark the control performance of these algorithms across six systems: two classic control systems, two classic reinforcement learning systems, and two building envelope models. In general, we find DPC to be competitive with the best of the seven DRL algorithms for the task, with a slight advantage on the classic control and building systems, but worse performance than DRL on the classic reinforcement learning systems. By benchmarking DRL and DPC across system types and control scenarios, we hope to accelerate the deployment of these methods in real-world settings.