Wisdom O. Ikezogwo

Ph.D. Candidate in Computer Science & Engineering | AI Research Scientist
Seattle, US.

About

Highly accomplished Ph.D. Candidate in Computer Science & Engineering with a strong foundation in Artificial Intelligence, Machine Learning, and Electrical Engineering. Specializes in Multimodal AI, Generative Models, and Data Curation, with a proven track record of leading impactful research in medical imaging, egocentric data, and financial analytics. Leverages deep expertise in neural networks, large language models, and advanced data processing to develop innovative solutions and drive significant advancements in AI research and applications. Published extensively in top-tier conferences and journals, demonstrating a commitment to advancing the field.

Work

Mayo Clinic
|

Ph.D. Quantitative Health Sciences Intern

Summary

Contributed to the development of foundational AI models for histopathology.

Highlights

Led research efforts to develop a foundational model for histopathology, training on millions of gigapixel-sized histology images.

Scaled-up computational resources on the Argonne National Lab computing cluster to process and analyze massive datasets.

Clinically evaluated developed models, ensuring their efficacy and relevance in real-world medical applications.

Okra, Inc.
|

ML Engineer

Summary

Developed and deployed machine learning models for financial data analysis.

Highlights

Developed robust models for extracting critical customer financial information from unstructured banking data.

Identified and extracted key customer earning and spending data to feed into downstream lending systems.

Implemented predictive models for income, spending patterns, and financial reconciliations, enhancing lending decision accuracy.

Demz Analytics Limited
|

Data Scientist / ML Engineer

Summary

Designed and implemented production-ready recommendation systems.

Highlights

Developed production recommendation systems leveraging advanced techniques such as attention mechanisms and epsilon-greedy bandit strategies.

Improved user engagement and system efficiency through data-driven insights and model optimization.

Obafemi Awolowo University
|

UG. Research Assistant – Biosignal Processing, Inst. & Control Lab

Summary

Conducted research on biosignal processing and neural network applications.

Highlights

Integrated disparate multivariate time series data, characterized spectral components for dynamic dimensionality reduction.

Trained neural networks for robust classification of brain EEG signals, enhancing diagnostic capabilities.

Performed detailed characterization of EEG spectral components to understand brain activity patterns.

University of Washington
|

Research Assistant – Graphics and Imaging Laboratory (GRAIL)

Summary

Driving cutting-edge research in multimodal AI, focusing on medical imaging and advanced generative models.

Highlights

Led research in Multimodal Large Language Models (LLMs) for medical imaging, developing foundational medical multimodal datasets (Quilt-1M, MedNarratives) with over 1 million image-text pairs.

Engineered state-of-the-art multimodal AI models (QuiltNet, Quilt-LLaVA) for comprehensive medical image analysis, achieving superior performance.

Pioneered a multi-agent AI framework, PathFinder, for clinical diagnosis, outperforming human experts in diagnostic accuracy.

Developed robust benchmarks, MedBlink, to evaluate the performance of multimodal medical AI models.

Leading initiatives to enhance the 'physics' in image and video generative models, focusing on Newtonian physics principles to generate large-scale video scene graph datasets and pipelines.

Apple
|

Ph.D. Machine Learning Research Intern

Summary

Conducted advanced research on efficient multimodal representations for egocentric data.

Highlights

Led research to develop efficient multimodal representations for egocentric data, integrating video, text, audio, IMU, and hand data.

Developed 'Perceive-Predict' framework, leveraging predictive coding between co-occurring modalities to reconstruct missing data.

Applied research to significantly reduce the capture cost of expensive modalities like video, demonstrating practical impact on data collection efficiency.

Education

University of Washington

Ph.D.

Computer Science and Engineering

Grade: 3.97/4.00

Courses

Advanced Machine Learning

Deep Learning

Computer Vision

Natural Language Processing

Data Programming

Introduction to Artificial Intelligence

Obafemi Awolowo University

B.Sc.

Electronic & Electrical Engineering

Grade: 4.73/5.00 (Class Rank 2/120)

Courses

Digital Signal Processing

Control Systems

Electromagnetics

Circuit Theory

Microprocessor Systems

Awards

Population Health Initiative AI Pilot Research Grant Award

Awarded By

University of Washington

Awarded a grant of $100,000 for innovative AI research in population health.

Microsoft's Accelerate Foundation Models Research Grant

Awarded By

Microsoft

Received a $20,000 grant to advance research in foundational AI models.

IBRO-Simons Computational Neuroscience Summer School Travel Grant

Awarded By

International Brain Research Organization (IBRO) & Simons Foundation

Awarded a travel grant to attend the Computational Neuroscience Summer School in Cape Town.

Prof. Kehinde Prize for the Best Graduating Student in the Control Option

Awarded By

Obafemi Awolowo University

Recognized as the top-performing student in the Control Systems specialization.

Oyebolu Prize for Best Male Graduating Student

Awarded By

Obafemi Awolowo University

Awarded for outstanding academic achievement as the best male graduating student.

Federal Government Scholarship Award, Nigeria

Awarded By

Federal Government of Nigeria

Recipient of a national scholarship with a cumulative value of $1500 for academic excellence.

Total/NNPC National Merit Scholarship

Awarded By

Total/NNPC

Recipient of a national merit scholarship with a cumulative value of $1500.

Etisalat Nigeria Merit Scholarship

Awarded By

Etisalat Nigeria

Awarded a merit scholarship valued at $250.

Publications

Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos

Published by

IEEE/CVF Conference on Computer Vision and Pattern Recognition

Summary

Introduced a novel approach for visual instruction tuning using localized narratives from histopathology videos, enhancing multimodal AI understanding in medical contexts.

Quilt-1M: One Million Image-Text Pairs for Histopathology

Published by

NeurIPS

Summary

Presented a large-scale dataset of one million image-text pairs specifically curated for histopathology, significantly advancing research in medical multimodal learning. (ORAL presentation)

Multi-modal Masked Autoencoders Learn Compositional Histopathological Representations

Published by

Machine Learning for Health (ML4H)

Summary

Explored the effectiveness of multi-modal masked autoencoders in learning compositional representations for histopathological images, contributing to robust medical image analysis.

Risk Stratification of Solitary Fibrous Tumor Using Whole Slide Image Analysis

Published by

LABORATORY INVESTIGATION, ELSEVIER SCIENCE INC

Summary

Developed a method for risk stratification of solitary fibrous tumors using whole slide image analysis, providing a valuable tool for pathological assessment.

Recent Advances, Applications and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables

Published by

ML4H Symposium

Summary

Contributed to a comprehensive reflection on recent advancements, applications, and challenges in machine learning for health, summarizing key discussions from research roundtables.

Supervised domain generalization for integration of disparate scalp EEG datasets for automatic epileptic seizure detection

Published by

Computers in Biology and Medicine

Summary

Developed a supervised domain generalization approach to integrate diverse scalp EEG datasets, improving automated epileptic seizure detection across varied data sources.

Empirical Characterization of the Temporal Dynamics of EEG Spectral Components

Published by

International Journal of Online and Biomedical Engineering (IJOE)

Summary

Conducted an empirical study to characterize the temporal dynamics of EEG spectral components, providing insights into brain signal patterns.

Languages

English

Native

Skills

Machine Learning

Deep Learning, Generative Models, Multimodal AI, Computer Vision, Natural Language Processing, Neural Networks, Recommendation Systems, Predictive Modeling, Domain Generalization.

AI/ML Frameworks & Tools

PyTorch, TensorFlow, Scikit-learn, Hugging Face Transformers, OpenCV.

Data Science & Analytics

Data Curation, Data Analysis, Big Data, Statistical Modeling, Data Visualization, Feature Engineering.

Programming Languages

Python, MATLAB, C++.

Medical AI & Biosignals

Medical Imaging, Histopathology, EEG Signal Processing, Clinical Diagnosis, Biosignal Processing.

Research & Development

Scientific Writing, Experiment Design, Algorithm Development, Prototyping, Academic Publishing.

Cloud & Distributed Computing

Argonne National Lab Computing Cluster (ANL), Cloud Platforms (conceptual understanding).

Interests

Research

Generative Modeling, Multimodal Representation Learning, Data Curation, Medical AI, Computer Vision, Natural Language Processing.

Projects

PathFinder: A Multi-Modal Multi-Agent Framework for Diagnostic Decision-Making in Histopathology

Summary

Developing an innovative multi-modal, multi-agent AI framework to enhance diagnostic decision-making in histopathology by integrating various data types and AI agents.

MedicalNarratives: Connecting Medical Vision and Language with Procedural and Localized Narratives across all medical imaging domains

Summary

A project focused on bridging medical vision and language understanding through procedural and localized narratives to create a unified framework for medical imaging analysis.

MedBlink: Probing the Fundamental Medical Imaging Knowledge of Multimodal Language Models

Summary

Investigating and evaluating the core medical imaging knowledge embedded within multimodal language models to identify strengths and limitations.

Percieve-Predict: Modality and Time-Aware Egocentric Efficient Multi-Modal Representations

Summary

Developing efficient multi-modal representations for egocentric data (video, text, audio, IMU, hands) by leveraging predictive coding to reconstruct missing modalities.

VPhysics: Temporally consistent Physics in Video (multiframe) Generation via Alignment

Summary

Research on generating temporally consistent video frames that adhere to Newtonian physics principles, leveraging alignment techniques.

Synthetic Video Scene Graph Generation

Summary

A project focused on generating synthetic video scene graphs, crucial for training and evaluating advanced computer vision models.