M. Usman Rafique
About Usman
I am a Machine Learning Researcher with experience of applying cutting-edge research to solve real-world problems. In my current role at Zoox, I am focused on Data Optimization and Machine Learning for the Behavior Autonomy of our cool robo-taxi.
Previously, I was a Senior Machine Learning Engineer at Bastian Solutions (a Toyota company) from 2023 to 2024, where I developed and deployed state-of-the-art computer vision and machine learning solutions for autonomous pick-and-place robots. Before that, I served as a Senior Research and Development Engineer at Kitware Inc. (2021-2023), tackling diverse computer vision challenges such as change detection from overhead imagery, person identification, novel view synthesis, and atmospheric turbulence correction.
I earned my Ph.D. in Electrical Engineering from the University of Kentucky, with research focused on weakly supervised deep learning methods for image synthesis, semantic segmentation, and change detection.
I’m passionate about staying at the forefront of AI advancements. You can find examples of my work with Large Language Models (LLMs) on my Github, including LLM-Forge, a playground for building practical LLMs with limited compute resources.
My areas of expertise include:
Data-Centric AI for Autonomous Behavior: Architecting data optimization strategies for learned behavior models that control robo-taxi trajectory. My work involves curating large-scale datasets to ensure full coverage of common and edge-case driving scenarios, maximizing model performance while managing data size for efficient, practical training.
Computer Vision for Autonomous Systems: Developing and deploying robust computer vision and ML systems for autonomous robots. This enables complex tasks in dynamic environments, such as object picking, depalletizing, and real-time scene understanding.
Continual Learning: Implementing AI systems that continuously learn and adapt to new data without forgetting previous knowledge.
Production ML: Designing, implementing, and deploying machine learning models for real-time applications in production environments.
Multi-modal Understanding: Combining data from different sources, such as aerial and ground-level imagery, to gain a more comprehensive understanding of a scene.
Vision-Language Models: Integrating computer vision and natural language processing to create AI systems that can understand both images and text.
Academic Background
I completed my PhD at the University of Kentucky, where I was a member of the Multimodal Vision Research Lab. My research focused on combining information from multiple images for scene understanding and image synthesis. My PhD advisors were Dr. Nathan Jacobs and Dr. Samson Cheung
Professional Experience
- Senior Machine Learning Engineer, Zoox: Feb 2024 - present
- Developing machine learning solutions for autonomous robo-taxi
- Senior Machine Learning Engineer, Bastian Solutions (Toyota): Aug 2023 - Feb 2024
- Developing computer vision and machine learning solutions for autonomous robotic systems.
- Senior Research and Development Engineer, Kitware Inc.: Aug 2021 - May 2023
- Conducted research on change detection, person identification, and novel view synthesis.
Old Teaching Website
My old website, from my teaching days is available here.
Research Projects
Near-Remote Sensing
Diverse View Synthesis
Novel View Synthesis
Multi-Image Fusion
Weakly Supervised Segmentation
Recent News
- Oct 9, 2025: published a post ICCV 2025 Area Chair Experience
- Aug 1, 2025: invited to serve as Program Committee for AAAI 2026
- Feb 10, 2025: joined Zoox Inc. as a Senior Software Engineer in the Machine Learning team. LinkedIn Post
- Feb 1, 2025: invited to be an Area Chair (AC) at ICCV 2025
- Jan 15, 2025: recognized as an outstanding reviewer for WACV 2025
- Sep 17, 2024: released LLM-Forge Library on Github, a playground for building and training practical LLMs with limited computational resources.
- May 23, 2024: recognized as an outstanding reviewer for CVPR 2024. Pleased to be among top 2% of 9872 reviewers.
- Aug 21, 2023: glad to join RnD Lab of Bastian Solutions (a Toyota Advanced Logistics Company) as a Senior Machine Learning Engineer. I will be doing research on computer vision models for robotics and automation.
- June 5, 2023: implemented GPT-Nano, a light-weight large language model (LLM), implemented from scratch in PyTorch.
- May 6, 2023: recognized as an outstanding reviewer for CVPR 2023. I am very pleased to be one of 232 outstanding reviewers out of a total of 7000 reviewers.
- Jan 20, 2023: two papers accepted at IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 2023.
- Jan 20, 2023: wrote a blog post: “Reflections on Reviewing Computer Vision Papers”.
- Aug 17, 2022: paper “Handling Image and Label Resolution Mismatch in Remote Sensing” (PDF) accepted to WACV 2023.
- March 3, 2022: paper “Revisiting Near/Remote Sensing With Geospatial Attention” (PDF) accepted to CVPR 2022.
- Feb 15, 2022: paper on sinkhole segmentation published to AGU Earth and Space Science Journal
- Nov 24 2021: recognized as an outstanding reviewer for BMVC 2021.
- Aug 2, 2021: joined Kitware Inc. as a Senior Research and Development Engineer
- June 8, 2021: I have successfully defended my PhD dissertation :confetti_ball: Bonus: the announcement tweet by my advisor
- May 20, 2021: recognized as an outstanding reviewer for CVPR 2021
- April 11, 2021: paper acceptd to NTIRE: New Trends in Image Restoration and Enhancement workshop and challenges at CVPR 2021
- March 16, 2021: paper accepted to IEEE International Geosciences and Remote Sensing Symposium (IGARSS) 2021
- December 12, 2020: gave a talk on “Automatic Identification of Sinkholes Using Deep Learning from Remote Sensing Data” at Kentucky Geological Survey
- July 31, 2020: paper accepted to BioImage Computing (BIC) workshop held at ECCV 2020
- July 29, 2020: paper accpeted to The British Machine Vision Conference (BMVC) 2020
- March 29, 2020: paper accepted to IEEE International Geosciences and Remote Sensing Symposium (IGARSS) 2020
- December 10, 2019: successfully defended my dissertation proposal
- June 17, 2019: presented my paper at EarthVision 2019 (CVPR 2019), Long Beach, CA
- April 5, 2019: paper accepted to IEEE International Geosciences and Remote Sensing Symposium (IGARSS) 2019
- April 4, 2019: paper accepted at EarthVision Workshop 2019 held in conjunction with CVPR 2019
