Leverage Large Language Models for Complex Robot Manipulation
PI Researcher: Chenliang Xu
In robot manipulation, adapting to dynamic environments with flexible task specifications is challenging. Language-based vision manipulation systems offer a solution by linking language instructions to visual data and generating actions. However, current approaches often develop vision models and action policies separately, leading to poor integration. To address this, we propose ACTLLM, a method that unifies visual interpretation and policy learning using large language models (LLMs). By generating structured scene descriptions and incorporating an action consistency loss, ACTLLM is expected to enhance the fusion of visual and policy elements, facilitating the efficient execution of complex tasks within a multi-turn visual dialogue framework.

PI Researcher: 
PI Researcher: 
PI Researcher: 
PI Researcher: Mark Bocko
PI Researcher: 
PI Researchers:
PI Researcher:
Toward Speaker-Specific Voice Spoofing Countermeasures

PI Researcher: 

PI Researcher:
IDEX Health and Science – Machine Learning for the Enhanced Design of Multilayer Optical Filters
PI Researcher: 
PI Researcher: 

PI Researcher:
Parverio – Second Generation Computer Vision Tools for the Study of Luekocyte Trafficking Across Vascular Barriers In Vitro
PI Researcher:
Advanced Atomization Technologies – Establishing a Multi-modal Data Fusion Framework for Acquiring Tacit Knowledge from Machinists
PI Researcher: 
Causal modeling represents an active area of research for determining causal relationships when a randomized controlled experiment is not available. The RPI team has developed extensive experience building causal models using micro-level data. 
Trendly – Few-Shot learning for Fine-Grained Object Recognition
L3Harris – Scalable Architectures and Increased Capacity for Bi-stable Resistively-coupled Ising Machine – BRIM
IBM – Learning to Localize Sources of Network Diffusion
Flaum Eye Institute – 3D eye imaging and machine learning strategies to improve cataract surgery
Cataract surgery is the most often performed surgery in any hospital of the world (28 million/year). However, the process by which the intraocular lens to replace crystalline lens is selected relies on limited anatomical information and rudimentary formulas. 


Immersitech – Development of a Framework for the Evaluation of Spatial Audio System Performance
ACV Auctions – Auto Auction Data as a Leading Indicator of Economic Activity and Vehicle Valuation
IBM – Learning to Localize Sources of Network Diffusion