Department of Mathematical Sciences
Amos Eaton 216 @ 4:00pm-5:00pm EST, Mondays
This seminar is a venue for math students to communicate applied research, to learn from engineers and scientists, and to dine together. All RPI faculty and students are welcome.
Chair | Chanaka D. Mapa Mudiyanselage Corey Currun |
Advisor | Prof. Fabian M. Faulstich |
Contact |
Kangbo Li (sponsorship)
lik17@rpi.edu |
Date | Title (click for abstract) | Speaker |
---|---|---|
09/08 |
Realizing Quantum Dynamical Phase Transitions on IBM Quantum ComputersQuantum Dynamical Phase Transitions (DPTs) are a framework to understand how quantum systems evolve over time. Due to the high circuit depth needed for accurate time evolution of large-scale quantum systems on digital quantum computers, DPTs have been mostly investigated on analog quantum simulators (ex: trapped-ion) on large 1D systems and small 2D systems. Utilizing the lower circuit depth provided by fractional-gate equipped IBM quantum computers, we realize DPTs by applying quantum quenches of large 2D Transverse Field Ising Models with hardware-native geometry. Experimental results on real quantum hardware demonstrate that fractional gates outperform standard basis gates and DPTs can be realized for large 2D systems (156 qubits). Note: All necessary background knowledge will be built up, so anyone that is interested in quantum computing is invited to attend.
Red Curry |
RPI Math/Jack Mandell |
09/15 |
A Multi-Frequency Helmholtz Solver Based on the WaveHoltz AlgorithmWe develop and analyze a new approach for simultaneously computing multiple solutions to the Helmholtz equation for different frequencies and different forcing functions. The new Multi-Frequency WaveHoltz (MFWH) algorithm is an extension of the original WaveHoltz method and both are based on time-filtering solutions to an associated wave-equation. With MFWH, the different Helmholtz solutions are computed simultaneously by solving a single wave equation combined with multiple time filters. The MFWH algorithm defines a fixed-point iteration which can be accelerated with Krylov methods such as GMRES. The solution of the wave equation can be efficiently solved with implicit time-stepping using as few as five time-steps per period. Discretization errors in time can be completely removed by various adjustments. Numerical results are given to confirm the convergence theory when solving energy conserving problems. |
RPI Math/Francis Appiah |
09/22 |
Extending Graph Condensation to Multi-Label DatasetsTraining Graph Neural Networks (GNNs) on large graphs is often hindered by redundancy and high computational demands. Existing graph condensation methods typically target single-label scenarios, but many real-world graphs are multi-label—nodes can belong to multiple classes simultaneously. In this talk, I’ll introduce GCond: a graph condensation framework adapted to the multi-label setting, utilizing K-Center initialization and binary cross-entropy loss. Through experiments on eight real-world multi-label graph datasets, GCond consistently achieves state-of-the-art performance, enhancing both scalability and efficiency for multi-label graph learning. |
RPI CS/Liangliang(Lia) Zhang |
09/30 (Tuesday) |
TBATBA |
Texas A&M/Dr. Sottile |
10/06 |
Machine Learned Interatomic Potentials for 2D MaterialsDensity Functional Theory (DFT) is the standard algorithm for many electronic structure calculations. Like many methods, DFT sacrifices some scalability for accuracy. This makes DFT levels of accuracy difficult to achieve for large systems like those in multilayer 2D heterostructures. We have obtained DFT accuracy by training atomic cluster expansions (ACE) on multilayer graphene. |
U. Minnesota/Drake Clark |
10/13 | Columbus Day: No Seminar | |
10/20 |
CTSuggest: An LLM-powered Open Source Application to support Clinical Trial Design by Suggesting Baseline FeaturesThe specification of potentially confounding baseline features or covariates is a crucial step in the design of prospective and retrospective clinical trials. Baseline features are critical for ensuring the integrity of the study design, the validity of the results, and the generalizability of the findings. We introduce CTSuggest, an application leveraging large language models (LLMs) to generate baseline features as part of the clinical trial design process. Users first specify basic trial metadata, and then CTSuggest suggests appropriate features with an explanation for each feature. Users can create an entirely new trial or start with metadata from an existing trial from ClinicalTrials.gov. We perform experiments validating the quality of CTSuggest’s baseline features using the benchmark CT-Pub dataset taken from clinical trial publications and evaluating using the “LLM-as-a-Judge” (LaaJ) framework provided in the CTBench benchmark. The results show that the feature suggestions generated by the state-of-the-art GPT-4o model meet or exceed the previously published CTBench results. We also show the promise of using a smaller open-source Llama model. Additionally, we examine the reliability of LaaJ evaluation within this setting. Coherence checking revealed hallucinations in the LaaJ’s evaluation, necessitating a postprocessing correction step that yielded lower but more accurate performance metrics. Three different types of hallucination were observed. The hallucination rate provides a quantifiable coherence metric that can be systematically used to improve LaaJ reliability. Our findings underscore the challenges in developing reliable LLM evaluation methods in healthcare applications and demonstrate a potential framework for improving LaaJ systems. |
RPI Math/Correy Currun (Tentative) |
10/27 |
TBATBA |
RPI Math/ Zihan Nie |
11/03 |
TBATBA |
RPI ECSE/Heshan Fernando |
11/10 |
Canonical Polyadic Decomposition for efficient Electron Correlation MethodsTBA |
RPI Math/Talha Aziz |
11/17 |
TBATBA |
RPI Math/Zachery Wolski (Tentative) |
Location: Amos Eaton 215 | Time: Thursdays 4:00PM - 5:00PM EST
Date | Title (click for abstract) | Speaker |
---|---|---|
02/06 |
Damped Proximal Augmented Lagrangian Method for weakly-Convex Problems with Convex ConstraintsWe give a damped proximal augmented Lagrangian method (DPALM) for solving problems with a weakly-convex objective and convex linear/non-linear constraints. Instead of taking a full stepsize, DPALM adopts a damped dual stepsize to ensure the boundedness of dual iterates. We show that DPALM can produce a (near) ε-KKT point within O(ε−2) outer iterations if each DPALM subproblem is solved to a proper accuracy. In addition, we establish overall iteration complexity of DPALM when the objective is either a regularized smooth function or in a regularized compositional form. For the former case, DPALM achieves the complexity of eO ε−2.5 to produce an ε-KKT point by applying an accelerated proximal gradient (APG) method to each DPALM subproblem. For the latter case, the complexity of DPALM is eO ε−3 to produce a near ε-KKT point by using an APG to solve a Moreau-envelope smoothed version of each subproblem. Our outer iteration complexity and the overall complexity either generalize existing best ones from unconstrained or linear-constrained problems to convex-constrained ones, or improve over the best-known results on solving the same-structured problems. Furthermore, numerical experiments on linearly/quadratically constrained non-convex quadratic programs and linear-constrained robust nonlinear least squares are conducted to demonstrate the empirical efficiency of the proposed DPALM over several state-of-the art methods. |
Hari Dahal |
02/13 |
Superconducting Qubit Control with Single Flux Quantum Pulse TrainsQubit control is one of the main challenges of building a scalable Quantum computer with superconducting qubits. Current technologies are based on room temperature microwave generators. Scaling microwave control to millions of qubits is more than an engineering challenge due to the excessive heat delivered to the cryostat and the hardware cost. This talk will introduce the basics of microwave qubit control and some of its issues. Then we will discuss the theory and experimental realization of a control technology using only superconducting (cold) digital (cheap) circuits, which opens a path towards scalable qubit control. |
Kangbo Li |
02/21 (Friday) |
Computing Arrangements of HypersurfacesIn this talk, I will present a Julia package, HypersurfaceRegions.jl, for computing all connected components in the complement of an arrangement of real algebraic hypersurfaces in $R^n$. The package is based on a modified implementation of the algorithm from the paper "Smooth Connectivity in Real Algebraic Varieties" by Cummings et al. I will outline the theory behind the algorithm and our implementation. I will demonstrate the use of the package through various examples. |
Ada Wang (Harvard) |
02/27 |
An algorithm for numerically solving the Maxwell-Bloch equations.As described by quantum mechanics, energy is absorbed and emitted from atoms in the form of photons with discrete energy values. When an atom absorbs or emits a photon, electrons transition up or down energy levels. The energy associated with these transition determines the frequency, i.e. color, of the absorbed or emitted light. Using Schrodinger’s equation to describe the atomic structure, and Maxwell’s equations to describe the light, a system of equations that fully describes the behavior of the light matter interaction can be derived. This system is known as the Maxwell-Bloch equations. In this talk, an algorithm for approximating solutions to the Maxwell-Bloch equations is discussed. Algorithms for an atom with 2 and 3 energy levels are discussed, as well as an algorithm for the more general M level problem. We also use these algorithms to probe the behavior of soliton solutions for the 2 and 3 energy level systems. |
Miles Corn |
03/06 | Spring Break: No Seminar | |
03/13 |
EigenWave: Computing Eigenvalues and Eigenvectors by Time-Filtering the Wave EquationA novel EigenWave algorithm is described to compute eigenvalues and eigenfunctions of elliptic boundary value problems. Based on the recently developed WaveHoltz scheme, the algorithm solves a related time-dependent wave equation as part of an iteration. At each iteration, the solution is filtered in time. After filtering, the solution mainly contains eigenmodes whose eigenvalues are near the target frequency of the filter. The iteration is embedded within a matrix-free Arnoldi algorithm, allowing the efficient computation of multiple eigenpairs near the target frequency. The approach allows the computation of eigenvalues anywhere in the spectrum without the need to invert an indefinite matrix, as is common with other approaches. The approach is demonstrated by finding eigenpairs of the Laplacian in complex geometry using overset grids. Results in two and three-space dimensions are presented. For large enough problems, it is demonstrated that EigenWave can outperform modern Arnoldi-type eigenvalue algorithms. |
Ngan Le |
03/20 |
Guaranteeing Performance in Autonomous Helicopter Aerial RefuelingHelicopter aerial refueling refers to the process of refueling a helicopter in mid-flight with the aid of a tanker aircraft. This maneuver is particularly challenging due to 1) complex aerodynamic interactions between the helicopter, the tanker, and the refueling hose-drogue system, 2) high pilot workload, 3) strict safety constraints, and 4) the contact-critical nature of the operation. To address these challenges, we propose a novel autonomous control methodology that combines model-based control with data-driven approaches such as reinforcement learning (RL). Given that aerial refueling is a safety-critical operation, it is essential to guarantee stability and performance for this hybrid control system. This talk will explain how Lyapunov stability and ultimate boundedness can be leveraged to establish rigorous stability and performance guarantees while accounting for controller parameters and uncertainties in the drogue dynamics. |
Damsara Jayarathna (RPI MANE) |
03/27 |
Accurate and efficient linear-scaling framework for hybrid DFT in finite-gap systemsBy admixing a fraction of exact exchange (EXX), hybrid DFT provides a more accurate and reliable description of electronic structure than traditional semi-local DFT (density functional theory). However, the conventional reciprocal-space EXX evaluation is cubic scaling and computationally demanding (typically 10x–100x more expensive than semi-local DFT), which limits the applicability of hybrid DFT. To overcome this bottleneck, we have developed an accurate linear-scaling approach that exploits the sparsity of the EXX interaction using a localized representation of the occupied space. For crystalline systems with general k-point sampling, further optimization can be made to exploit both locality and periodicity. |
Ju-an Zhang (Cornell) |
03/31 (Monday) |
Algebraic Varieties Arising in Second QuantizationWe develop algebraic geometry for coupled cluster theory using second quantization. The high-dimensional eigenvalue problems that encode the electronic Schrödinger equation are approximated by a hierarchy of polynomial systems at various levels of truncation. The truncated eigenstates parametrize well known varieties such as the Grassmannian, flag varieties and spinor varieties. We will offer a detailed study into the truncation varieties. Additionally in second quantization we work within the exterior algebra. There we can define interior and exterior operators called the creation and annihilation operators. They span a special Clifford algebra called the Fermi-Dirac algebra, which we will study as well. |
Svala Sverrisdóttir (Berkeley) |
04/03 |
Inferring the number of active molecular motors on a cargo from cargo trajectoriesTo function, cells must move material internally. This intracellular transport is achieved by molecular motors, which transport vesicle-bound cargo along protein filaments. In vitro experiments have uncovered the mechanochemistry of how single, isolated motors turn chemical energy into mechanical work as they "walk" along a protein filament. In cells, however, multiple motors transport cargo. Some of these motors bind to the protein filament and contribute to cargo transport; others diffuse over the surface of the cargo, and the motors transition between the two roles. It is a challenge to experimentally determine the number of motors on a given cargo, let alone the time-varying number of these motors bound to the protein filament. To address this, we developed a method to estimate the number of motors and the number of filament-bound motors from position-time measurements of the cargo. The method uses our knowledge of the mechanics of the system and the mechanochemistry of the motors to infer the most likely number of all motors on the cargo and of filament-bound motors. |
Yonatan Ashenafi (WPI) |
04/10 |
Neighbor-Sampling Based Adam-Type Stochastic Methods for Training Graph Neural NetworksGraph convolutional networks (GCNs) are a powerful tool for graph representation learning. Due to the recursive neighborhood aggregations employed by GCNs, efficient training methods suffer from a lack of theoretical guarantees or are missing important practical elements from modern deep learning algorithms, such as adaptivity and momentum. We present several neighbor-sampling (NS) based Adam-type stochastic methods for solving a nonconvex GCN training problem. We utilize the control variate technique proposed by [Chen et al., 2017] to reduce the stochastic error caused by neighbor sampling. Under standard assumptions for Adam-type methods, we show that our methods enjoy the optimal convergence rate. In addition, we conduct extensive numerical experiments on node classification tasks with several benchmark datasets. The results demonstrate superior performance of our methods over classic NS-based SGD that also uses the control-variate technique, especially for large-scale graph datasets. |
Molly Noel |
04/17 |
Algorithmic Designs to Investigate Trustworthiness in Machine Learning ModelsWith the surge of AI models in everyday use, examining the trustworthiness of these models remains a crucial concern. Trustworthiness is defined by several key factors: vulnerabilities inherent in the model architecture that can be exploited by adversaries, leading to faulty model use; vulnerabilities in the training data that result in unfair demographic biases, along with mechanisms to mitigate these biases; and latent representations of models that can be used to recover sensitive training information. This presentation will explore these aspects in depth, discussing how adversarial attacks can compromise model integrity, the impact of biased training data on model fairness, and the risks associated with extracting sensitive information from model representations. Additionally, we will examine algorithmic strategies designed to enhance the robustness and fairness of machine learning models, ensuring their reliability and ethical deployment in real-world applications. |
Huzaifa Arif (RPI ECSE) |