MSc Researcher · Ontario Tech University

Dvip Patel

AI engineer & graduate researcher working on trustworthy NLP for Requirements Engineering — LLM governance, knowledge discovery, and empirical software engineering.

MSc researcher working at the intersection of NLP and Requirements Engineering (RE). My thesis presents TRAC-RE — a Traceable, Reliable, Auditable, and Contextual framework for automated requirements engineering, structured as four interconnected phases that take raw stakeholder prose to structured, audit-grade knowledge artifacts: context-aware requirement identification, audit-grade keyword extraction, governed LLM parsing, and implicit keyword discovery. The framework is grounded in an industrial FinTech/SaaS corpus of 110 requirements and 1,997 High-Level JSON (HLJ) artifacts. Across three supporting papers (CASCON 2025 published, RE'26 Main under review, RE'26 RE@Next! in revision), I argue that governance and distributional alignment — not raw model scale — are the primary levers for trustworthy RE automation.

dvippatel.math@gmail.com·GitHub LinkedIn· Oshawa, Ontario, Canada

Current Focus

Research

Thesis in progress

Toward Automated Requirements Engineering: Empirical and Architectural Foundations for Structured Parsing and Knowledge Discovery

MSc, Software Engineering · Ontario Tech University
Supervisor: Prof. Sanaa Alwidian

TRAC-RE is a framework for automated requirements engineering (RE) built from four interconnected phases. Together they take raw stakeholder prose to implicit domain-knowledge discovery — spanning context-aware requirement identification, audit-grade keyword extraction, governed LLM parsing into structured JSON, and multi-signal implicit keyword discovery, all grounded in an industrial FinTech and SaaS corpus.

TRAC-RE— Traceable, Reliable, Auditable, and Contextual

Traceable: Every keyword, tag, and classification carries per-decision provenance back to its source artifact.
Reliable: Versioned governance and confidence scoring make pipeline outputs reproducible and defensible.
Auditable: A bounded LLM-as-judge design and SME-audited samples keep human verification in the loop.
Contextual: Local-context modeling, not raw model scale, drives requirement identification accuracy.

Pages

179

Phases

Papers

HLJ artifacts

1,997

Requirements

110

The four phases

P1 · Context-aware requirement identificationRE'26 Main (under review)

Context-aware requirement identification

Sentence-level classification with structured local-context features. F1 improved from 0.664 to 0.894 on 110 real-world FinTech and SaaS documents; 4K domain-aligned samples (F1=0.894) beat 15K mixed (F1=0.883) with 73% fewer samples.

P2 · Audit-grade keyword extractionRE'26 RE@Next! (in revision)

Audit-grade keyword extraction

Perfect-precision extraction that also surfaced a structural coverage ceiling of 1.5 canonical keywords per artifact, motivating the implicit-discovery phase.

P3 · Governed LLM parsing (HLJ)CASCON 2025 (published)

Governed LLM parsing (HLJ)

Multi-model parsing into confidence-scored High-Level JSON with a versioned tag-governance stack. Tag precision improved from 0.657 to 0.897 (v2); +0.31 F1 over GPT-4.1.

P4 · Implicit keyword discoveryRE'26 RE@Next! (in revision)

Implicit keyword discovery

A multi-signal engine — neighbor transfer, graph walks over an enriched 275,164-edge co-occurrence graph, and UMAP+HDBSCAN cluster-gap detection — grounded in a 13,725-entry domain dictionary, with full per-keyword provenance.

The throughline across all four phases: governance and distributional alignment, not raw model scale, are the primary levers for trustworthy RE automation. LLMs are powerful but don't belong in every role — in TRAC-RE they parse and they tiebreak, but they never generate ground-truth knowledge without a traceable, auditable trail.

The Research, Interactive

Don't just read the thesis — operate it

Most research lives in a PDF. Mine runs. Below is a short, guided walkthrough of the actual pipeline behind the three papers — in 2 steps. You'll see how the pieces connect, tune the knobs that drive the results, and watch the core finding play out on real requirement text. No prior context needed — each step explains itself.

1. The research map2. Tune the pipeline

Step 1 of 2 · What am I looking at?

The research map

A live map of how my work fits together — each circle is a paper, a research question, a dataset, or a finding, and the lines show how they connect. Click any node to read what it is. Drag to rearrange, scroll to zoom.

Step 2 of 2 · Now make it yours

Tune the pipeline

These are the real knobs from my experiments — thresholds, models, training modes. Move a slider and the metrics (F1, precision, recall) update instantly from pre-computed results of actual runs. This is the ablation study, made playable.

Parameters

Adjust parameters to explore pre-computed results

Clustering

FAISS Threshold0.80

0.60.9

Deduplication

Deduplication Threshold0.83

0.750.92

NLU Validation

NLU Similarity Threshold0.68

0.550.78

Scoring

Top-K Tags per HLJ9

312

Model

Token Filter

SBERT Rescue Threshold (Strong)0.40

0.30.6

Results

84.7%

Precision

94.9%

Recall

79.9%

Confusion Matrix

Want the full control room?

Open the Research Lab

Everything above — the map, the knobs, the pipeline stages, and a simulated peer-reviewer that critiques the methodology — in one full-screen workspace. This is the closest thing to sitting next to me while I run experiments.

Selected Work

Publications

[3]Published★ Open Research Object
Improving Reliability of LLMs in RE with Structured Confidence & Tag Governance
Dvip Patel, Sanaa Alwidian · CASCON 2025, 2025
Read →
A modular multi-model LLM pipeline that converts raw stakeholder requirements into High-Level JSON (HLJ) artifacts with confidence-scored fields, paired with a versioned tag-governance system that catches prompt-leak exploitation, hallucinated tags, and low-agreement outputs before they reach downstream stages.
LLM governanceRequirements EngineeringTag validationHLJ artifactsMulti-model benchmarking
[2]In Revision
From Explicit to Implicit: Towards Traceable Keyword Discovery in Requirements Engineering
Dvip Patel, Amarachi Nwosu, Sanaa Alwidian · RE'26, RE@Next!, 2026
Explicit keyword extraction — even at audit-grade precision — hits a structural ceiling of roughly 1.5 canonical keywords per HLJ artifact, with Jaccard agreement below 0.11 across KeyBERT, RAKE, and YAKE. This paper presents a 5-phase implicit keyword discovery engine combining a 13,725-entry domain dictionary, a 275,164-edge enriched co-occurrence graph, UMAP+HDBSCAN clustering, and a bounded LLM-as-judge that tiebreaks only the borderline scoring band. Every implicit keyword traces back to its discovery signal(s) with full per-signal evidence.
Implicit knowledge discoveryGraph-based NLPUMAP / HDBSCANLLM-as-judgeAudit trails
[1]Under Review
Towards Improving Sentence-Level Requirements Identification via Explicit Local Context Modeling
Dvip Patel, Sanaa Alwidian · RE'26, Main Track, 2026
An empirical study of sentence-level requirement classification on 110 real-world FinTech and SaaS documents (~5,700 candidate sentences, balanced to 15K mixed / 4K domain-only). We compare all-mpnet-base-v2 (110M) and DeBERTa-v3-base (184M) across frozen / LoRA / full fine-tuning and context window sizes k=0–3, and show that structured local-context features — not raw concatenation — are the critical signal.
Requirements classificationLocal context modelingLoRA fine-tuningDeBERTa-v3Empirical software engineering

Where I've worked

Experience

Graduate Researcher (MSc, Software Engineering) · Ontario Tech University
Sep 2024 – Present
Oshawa, ON · Supervisor: Prof. Sanaa Alwidian
- Designed TRAC-RE — a Traceable, Reliable, Auditable, and Contextual framework for automated Requirements Engineering, built from four interconnected phases that take raw stakeholder prose to implicit domain-knowledge discovery, grounded in an industrial FinTech/SaaS corpus of 110 requirements and 1,997 High-Level JSON (HLJ) artifacts.
- Phase 1 — Context-aware requirement identification: empirical study of sentence-level classification (all-mpnet-base-v2, DeBERTa-v3-base) across frozen / LoRA / full fine-tuning and context windows k=0–3, showing structured local context adds +16 F1 points (0.664 → 0.894) and that 4K domain-aligned samples (F1=0.894) beat 15K mixed (F1=0.883). Supporting paper submitted to RE'26 Main.
- Phase 3 — Governed LLM parsing: built and benchmarked a multi-model parsing pipeline (GPT-4.1, Claude Opus 4, Meta-70B) producing confidence-scored HLJ artifacts with a versioned tag-governance stack (Harvest → Filter → Cluster → Validate → Whitelist → Audit → Drift); tag precision improved 0.657 → 0.897 at the strictest v2 stage. Supporting paper published at CASCON 2025.
- Phases 2 & 4 — Audit-grade extraction and implicit discovery: surfaced a structural coverage ceiling of 1.5 canonical keywords per artifact, then architected a multi-signal discovery engine combining neighbor transfer, graph walks over a 275,164-edge enriched co-occurrence graph, and UMAP+HDBSCAN cluster-gap detection, with a bounded LLM-as-judge tiebreaker and full per-keyword provenance. In revision at RE'26 RE@Next! with Amarachi Nwosu.
- Co-designed a domain dictionary of 13,725 entries / 35,799 lookup keys / 108 detected abbreviations used as synonym normalizer, confidence booster, stoplist, and novelty flagger across the framework.
- Translated the framework into 3 supporting papers across CASCON and RE'26; conducted literature reviews, experimental design, ablation planning (hop depth, signal composition, dictionary impact), and supervisory reporting.
AI Engineer · Palomino Systems
Nov 2025 – Present
Remote · Part-time
- Designed and shipped Laravel Upgrader, a fully autonomous CLI-driven AI agent that migrates legacy Laravel codebases end-to-end — a 10-step pipeline (analysis → planning → transformation → validation → self-healing) processing ~10k LOC/run at under $80/run, with 85–90% automated issue resolution via detect → fix → retry loops.
- Measured two full migrations by hand first, then automated those exact steps — cutting manual migration effort by ~80% (a baseline-grounded number, not an estimate).
- Owned the codebase end-to-end and was the primary technical contact with Palomino stakeholders, presenting the agent architecture and translating tradeoffs into prioritized recommendations.
Founding Engineer · Mediabridge
Apr 2025 – Present
Remote · Part-time
- Co-built and launched a multi-tenant SaaS platform from scratch, owning architecture decisions across frontend, backend, and AWS deployment.
- Personally built the core systems — a custom role-based access control (RBAC) layer, a dynamic form builder, and an event-driven notification system.
- Owned the full lifecycle from requirements through production launch and served as primary technical contact for the client.
Laravel Developer · Finserve Infotech
Jan 2024 – Dec 2024
India
- Built and delivered 4 ERP / POS systems in Laravel, covering data models, business logic, and client-facing workflows.

Built & Shipped

Projects

Inbox Intelligence Platform

Gmail-native communication intelligence layer — one ingestion pipeline, six analysis modules.

Built at a 48-hour hackathon. Turns unstructured email into structured, actionable intelligence: a single ingestion + parsing core (Claude classifier, entity extractor, temporal indexer) feeds six independent modules — financial waste detection, relationship decay signals, contract/obligation tracking, follow-up commitments, a RAG-based institutional memory, and health-admin tracking. One parse, many consumers — no redundant API calls, and an entity graph that compounds over time. Currently being taken from hackathon build to a live product.

Claude SonnetGmail MCPGoogle Calendar MCPRAG / pgvectorPostgreSQLReactTailwindNode.js

Laravel Upgrader

Autonomous 10-step AI migration agent for legacy Laravel codebases.

CLI-driven migration agent that ingests legacy Laravel code and runs analysis → planning → transformation → validation → self-healing loops. Processes ~10k LOC per run at under $80 cost, with 85–90% automated issue resolution.

PythonClaude / LLM OrchestrationLaravelCLISelf-healing retry loops

TRAC-RE — Automated Requirements Engineering Framework

Research

A Traceable, Reliable, Auditable, and Contextual framework for automated Requirements Engineering — structured parsing, context-aware classification, and implicit keyword discovery.

The research framework behind my MSc thesis at Ontario Tech University. **TRAC-RE** — Traceable, Reliable, Auditable, and Contextual — converts raw stakeholder prose into audit-grade knowledge artifacts across four interconnected phases: 1. **Phase 1 — Context-aware requirement identification** — Sentence-level classification with structured local-context features. F1 improved 0.664 → 0.894 on 110 FinTech/SaaS documents; 4K domain-aligned samples (F1=0.894) beat 15K mixed (F1=0.883). Supporting paper under review at RE'26 Main. 2. **Phase 2 — Audit-grade keyword extraction** — Perfect-precision extraction that surfaced a structural coverage ceiling of 1.5 canonical keywords per artifact, motivating the implicit-discovery phase. Supporting paper in revision at RE'26 RE@Next!. 3. **Phase 3 — Governed LLM parsing (HLJ)** — Multi-model parsing (GPT-4.1, Claude Opus 4, Meta-70B) into confidence-scored High-Level JSON with versioned tag governance (v0 → v2); tag precision 0.657 → 0.897. Supporting paper published at CASCON 2025. 4. **Phase 4 — Implicit keyword discovery** — A 5-stage engine over a 275,164-edge enriched co-occurrence graph, with UMAP+HDBSCAN cluster-gap detection and a bounded LLM-as-judge tiebreaker. Supporting paper in revision at RE'26 RE@Next!. **Cross-cutting infrastructure** — 1,997 HLJ artifacts, a 13,725-entry domain dictionary, a FAISS index over 768-dim SBERT vectors, full per-decision audit logging, and drift monitoring across pipeline runs. The framework's central argument: governance and distributional alignment, not raw model scale, are the primary levers for trustworthy RE automation.

PythonPyTorchHuggingFaceSBERT / MPNetDeBERTa-v3FAISSUMAPHDBSCANLoRA

For the full engineering breakdown — story, stack, and impact — see the corporate view →

The Toolkit

Skills & Education

Technical competencies

Research Areas

Requirements Engineering, NLP for Software Engineering, LLM Governance, Knowledge Discovery, Empirical SE

Languages

Python, TypeScript / JavaScript, PHP (Laravel), SQL, Bash

AI / ML

PyTorch, HuggingFace Transformers, sentence-transformers, SBERT / MPNet, MiniLM, DeBERTa-v3, KeyBERT, RAKE, FAISS, UMAP + HDBSCAN, LoRA Fine-tuning, RAG Pipelines, Anthropic / Claude API, MCP, Aider

Frontend

React, Next.js, Tailwind CSS, Zustand

Backend / Cloud

Laravel, Flask, REST APIs, Microservices, RBAC, Docker, AWS, CI/CD, GitHub Actions

Testing

Vitest, fast-check (property-based), Testing Library

Education

MASc, Software Engineering (Thesis)
Ontario Tech University
2024 – Present · Oshawa, ON, Canada
Thesis: “Toward Automated Requirements Engineering: Empirical and Architectural Foundations for Structured Parsing and Knowledge Discovery”

Certifications

AWS Certified Cloud Practitioner — Amazon Web Services (2024)

Teaching & Authored

OVIN EV Micro-Credential Course — Ontario Vehicle Innovation Network (2024)

Let's talk

Get in touch

I'm open to research collaboration, AI engineering work, and good conversations about NLP for software engineering. The fastest way to reach me is email.

✉ dvippatel.math@gmail.com GitHub LinkedIn

Research

Toward Automated Requirements Engineering: Empirical and Architectural Foundations for Structured Parsing and Knowledge Discovery

Context-aware requirement identification

Audit-grade keyword extraction

Governed LLM parsing (HLJ)

Implicit keyword discovery

Don't just read the thesis — operate it

The research map

Tune the pipeline

Parameters

Open the Research Lab

Publications

Improving Reliability of LLMs in RE with Structured Confidence & Tag Governance

From Explicit to Implicit: Towards Traceable Keyword Discovery in Requirements Engineering

Towards Improving Sentence-Level Requirements Identification via Explicit Local Context Modeling

Experience

Graduate Researcher (MSc, Software Engineering) · Ontario Tech University

AI Engineer · Palomino Systems

Founding Engineer · Mediabridge

Laravel Developer · Finserve Infotech

Projects

Inbox Intelligence Platform

Laravel Upgrader

TRAC-RE — Automated Requirements Engineering Framework

Skills & Education

Get in touch