Culture & Research Philosophy
Our interdisciplinary team works at the frontier of computer vision, natural language processing and human-computer interaction and its intersection, guided by curiosity and scientific rigor:
- Experimental: Conduct reproducible experiments that advance fundamental understanding.
- Computational: Leverage algorithms, models, and coding expertise to tackle challenging questions.
Lab Entry
We welcome students from all disciplines with strong curiosity and a passion for rigorous, original research. We commonly submit our work to conferences including ACL, NIPS, CVPR and ACM CHI.
- Liveness Detection: This is the critical gatekeeper for digital identity, but it's vulnerable to attacks. Key research problems include defending against sophisticated fraud (deepfakes, 3D masks, morphing attacks), reducing demographic bias, and designing lightweight models for on-device use.
- Facial Recognition: This technology carries significant technical and societal risks. Key problems for a thesis are ensuring fairness, eliminating demographic bias, and establishing ethical best practices.
- OCR (Optical Character Recognition): This is the core technology for digitizing the physical world, but it struggles with complex, real-world text. Key research problems are accurately reading complex layouts (tables, forms), deciphering handwriting, and handling low-quality images.
- LLM Reasoning and Hallucinations: Large Language Models serve as the engine for complex applications like the AI tutor, mental health chatbot, AI knowledge portal, AI interviewer, and AI market research, but they frequently suffer from logical failures and hallucinations. Key research problems for a thesis include developing frameworks to improve reasoning (e.g., Chain-of-Thought) and methods to detect and reduce hallucinations.
- Small Language Models (SLMs): While LLMs are powerful, they are too large for many real-world applications. This research focuses on training compact models (e.g., <7B parameters) that retain high reasoning capabilities for targeted domains. Key thesis problems involve knowledge distillation, quantization, and efficient training frameworks.
- Synthetic Datasets: High-quality data is the primary bottleneck for modern AI. This topic focuses on using LLMs to generate training data for other models, solving issues of scarcity and privacy. Key research problems include ensuring diversity and preventing "model collapse" (where models degrade when trained on synthetic data).
- Synthetic Personas: This topic explores using generative AI to create realistic, AI-driven user profiles as a new method for market research. The core research challenge for a Master's or PhD project is validation: how do we prove that the insights from these synthetic personas accurately reflect real human behavior and avoid AI-generated "echo chambers"?
- Human-AI Interaction: This is foundational research for ensuring AI makes us *better*, not just replaces us. It focuses on designing collaborative systems that augment human cognition, creativity, and learning. Key research questions for a student project include: How do we design AI that is truly collaborative and understandable? How do we measure its real impact on human learning? And how do we ensure it supports human well-being?
- Medical VQA (Visual Question Answering): This technology can act as a "co-pilot" for doctors, helping them interpret medical scans by answering natural language questions. Key problems are the extreme need for accuracy in a high-stakes environment and the lack of large-scale annotated datasets. A core PhD challenge is creating these datasets, which requires expert manual annotation from doctors and exploring synthetic data to augment rare-disease examples.
- Speech2Text and Voice Cloning: This technology breaks down communication barriers and creates new possibilities for personalized media. Key problems are achieving accuracy in noisy environments with diverse accents and addressing the ethical risks of deepfake audio. A major project area is creating large-scale, diverse speech datasets, which involves balancing expensive manual transcription with crowdsourced and synthetically-generated audio to cover many languages and accents.
- EEG (Brain-Computer Interfaces): BCI offers a life-changing communication channel for people with severe motor disabilities. Key research problems for a Master's or PhD project involve improving the low speed and signal-to-noise ratio of non-invasive EEG, reducing user fatigue, and engineering robust hybrid BCI systems (like those combining SSVEP and P300 signals) that are fast and reliable enough for daily use.
- Raman Spectroscopy: This research could lead to a new class of non-invasive medical diagnostics, like a "no-prick" portable glucose monitor. Key problems for a student project involve applying advanced signal processing and machine learning to extract a clear, reliable signal (e.g., glucose) from the highly complex and "noisy" spectroscopic data of human tissue, and then engineering an accurate, portable, and affordable device.
