auralis-vfs
Pip-installable Python library for vocal fatigue scoring from raw speech. Built on ECAPA-TDNN-VHE — returns a normalized fatigue score from any audio input.
INFERENCE Lab conducts reproducible AI research, builds production systems, and trains engineers who can deploy — not just describe. Engineering discipline over hype, evidence over branding.
INFERENCE Lab is an applied AI research and engineering organization founded by Muhammad Khubaib Ahmad. It operates as original research, AI/software engineering services, and structured engineering education — under one identity, each track reinforcing the others' credibility.
Our aim: close the gap between people who know AI concepts and engineers who can build, deploy and maintain AI systems.
Real, deployed output — not demos that break the moment they leave a notebook.
Reproducible pipelines, proper evaluation, and a permanent DOI on every research release.
You leave with systems on GitHub and models on HuggingFace, not a PDF.
Research informs the curriculum. The curriculum trains the engineers. The engineers ship the services. Each track makes the others credible.
Original research in low-resource NLP and speech intelligence — conducted independently and through international academic collaboration.
AI/ML engineering, LLM and RAG systems, computer vision, speech AI, and full-stack delivery for organizations.
A structured, deployment-focused curriculum that takes developers from basic Python syntax to deploying production AI systems.
Independent and international research — every release ships reproducible pipelines, evaluation documentation and a permanent DOI.
ECAPA-TDNN-VHE designed from scratch with supervised contrastive loss — 2.5× accuracy over baseline (78% vs 36%), F1 scores 0.85 / 0.78 / 0.70 across three fatigue classes.
Development and occupational validation of an automated vocal load assessment tool for professional voice users — clinical-grade speech analysis in production.
First large-scale Roman Urdu emotion corpus — 134K labeled samples with Fleiss κ = 0.658 (substantial agreement), multi-institute annotation, fully open-source on HuggingFace and Harvard Dataverse.
Large-scale Roman Urdu sentiment corpus built via privacy-preserving embedding pipelines. Benchmarks state-of-the-art Transformer models — addressing a critical gap in low-resource South Asian NLP.
Largest high-quality Roman Urdu sentiment dataset via privacy-preserving embedding pipelines — SOTA 0.84 accuracy, 0.83 Macro-F1.
Time-series forecasting and smart-agriculture DSS — demonstrated 50–60% yield improvement through data-driven intervention.
Multi-institutional international study applying Cognitive Systems Engineering to healthcare ergonomics — systematic analysis of workload, safety, and intervention efficacy.
Libraries, APIs and platforms spanning speech AI, LLM observability, RAG, agents and MLOps infrastructure.
Pip-installable Python library for vocal fatigue scoring from raw speech. Built on ECAPA-TDNN-VHE — returns a normalized fatigue score from any audio input.
Real-time vocal health monitoring tool — streams microphone input, runs auralis-vfs, and surfaces fatigue level and vocal load over a session.
Open-source speaker verification framework using embedding cosine similarity on ECAPA-TDNN representations. Lightweight, no external APIs required.
Pakistani synthetic data library for realistic test fixtures — Urdu and Roman Urdu names, CNIC numbers, phone formats, addresses, and locale-aware records.
End-to-end LLM platform with real-time hallucination detection, structured logging, and a developer dashboard — hallucination rate, latency, P50/P95.
Multi-agent system for automated EDA, cleaning, visualization, and report generation — autonomous data analysis pipelines built on CrewAI.
Image encryption scheme scoring 100/100 against 14 benchmark algorithms across standard cryptographic security tests.
A structured, deployment-focused curriculum across 6 phases and 12.5 months. Every 1.5-hour session is live coding — Saturday introduces the concept, Sunday goes deeper into edge cases, and assignments ship to GitHub by Friday.
From a working terminal to production database architecture. Writing Python that does not break.
8 weeks
NumPy through publication-quality visualization, grounded in real statistical thinking.
8 weeks
Classical ML the engineering way: pipelines, leakage-free evaluation, and experiment tracking.
7 weeks
Neural networks from scratch to fine-tuned Transformers and speech AI pipelines.
8 weeks
Production AI APIs, disciplined LLM engineering, RAG systems, and agents that actually work.
4 weeks
Containerize, automate, ship and monitor. The difference between a notebook and a product.
3 weeks
AI Research Engineer
An AI research engineer at the intersection of rigorous research and production engineering — designing architectures from scratch, publishing reproducible research with open-source artefacts, and deploying end-to-end ML systems independently. Mentors and trains engineers in applied AI.
Beyond the cohort, the Fellowship is hands-on, project-based contribution to the lab's open-source surface. The first Fellowship runs across three active projects — you write code that real users depend on.
Extend the Pakistani synthetic data library with new locales, providers and record types.
Design and ship the public-facing engineering surface for the lab and its open-source work.
A tooling project to automatically profile, score and flag quality issues in research datasets.
We're building AI systems, conducting research, and training engineers. Every position involves real, shipped work.
3-Month · Remote · Flexible
Work on real AI systems alongside the lab — agentic pipelines, LLM applications, speech AI, and production MLOps. Designed for engineers who want serious project experience, not busy-work.
Rolling Applications
Contribute to open-source tooling, dataset annotation, or benchmarking tasks on a flexible schedule. No minimum commitment — just genuine interest in doing useful work.
Project-Based · Open
Partner with INFERENCE Lab on applied AI R&D. We work with organizations that need rigorous, reproducible engineering — not a vendor relationship, a research partnership.
Currently running our first online cohort and preparing the first Engineering Fellowship. Reach out about training, research collaboration, or AI engineering services.