SEMLA 2026 Program

Industry Day – June 1st, 2026

Talk venue

Amphithéâtre Bernard Lamarre (C-631)

6th floor of Pavillon principal, Polytechnique Montréal

Reception venue

Atrium Lassonde (M-3500)

3rd floor of Pavillons Lassonde, Polytechnique Montréal

Industry Day brings together engineers, researchers, and technical leaders from across the software and AI industry to share hard-won lessons from building and deploying AI systems at scale. From foundation models in financial services to enterprise coding agents, the day’s talks reflect the real challenges of making AI work in production, reliably, responsibly, and cost-effectively. The day closes with a reception and poster session in the Atrium Lassonde, a prime networking opportunity to connect with fellow professionals, meet the speakers, and carry the day’s conversations forward.

8:30 – 9:00

Coffee and assorted pastries

9:00 – 9:15

SEMLA 2026 Opening Session

9:15 – 10:00

KeynoteMoving Faster with Less Risk: DRS for Code Freeze and Code Review at Scale
Peter Rigby · Concordia University and Meta

View abstract

As software engineers increasingly direct, review, and integrate AI-assisted code, organizations need better ways to decide which changes are safe to ship and which deserve scarce expert attention. This talk presents a line of work on Diff/Defect Risk Score (DRS), a just-in-time defect prediction system that estimates whether a code change is likely to cause a production fault at Meta.

The talk begins with the deployment setting at Meta, where DRS has been remarkably successful. By using LLM-based risk prediction to distinguish safer changes from riskier ones, DRS enabled release processes to move away from broad code freezes toward more targeted, risk-aware deployment decisions. This reduced unnecessary blocking of low-risk work while still helping teams control production risk.

The same risk signal can also improve code review. Change safety depends not only on whether a diff is risky, but also on whether the right people review it. The Contribution-aware Changeset Safety Ratio captures the combined expertise of the author and the reviewer team, and motivates using risk signals to focus expert reviewers on the riskiest diffs. Building on this, the talk covers reviewer recommendation systems and live experiments that add or recommend expert reviewers for high-risk changes, increasing review depth and engagement with minimal impact on review latency.

10:00 – 10:30

Coffee break

10:30 – 11:00

AI Deployment, Governance, and Lakewatch
Vincent Fortier · Databricks

View abstract

AI is being deployed faster than the controls around it, and security teams are the ones paying the bill. In this session, Vincent shows how Databricks gives you a single governed foundation for data, AI, and agents, so innovation doesn’t outrun oversight. The talk closes with a look at Lakewatch, Databricks’ open agentic SIEM built for machine-speed defense.

11:00 – 11:30

Foundation Models for Financial Services
Nikita Dvornik · RBC Borealis

View abstract

This talk explores the emerging paradigm of foundation models trained on transaction data: large-scale models that learn behavioural embeddings from billions of financial events, much like LLMs learn from text. We discuss how these models are built and used, on their own or in tandem with LLMs, and share how they unlock powerful downstream applications across credit decisioning, fraud detection, and personalized advisory, opening a new chapter in AI for financial services.

11:30 – 12:00

From Coding Assistants to Enterprise Knowledge Workers
Patrice Béchard · ServiceNow

12:00 – 13:30

Lunch break

13:30 – 14:30

Panel Discussion: The Future of Software Engineering
Moderator: Maxime Lamothe · Panelists: Peter Rigby, Suhaib Mujahid, Yu Huang

14:30 – 15:00

Lessons from Building an Enterprise Coding Agent: Open Questions and Challenges
Gustavo Pinto · Zup Innovation

15:00 – 15:30

Coffee break

15:30 – 16:00

From Idea to Production: Building an AI Code Reviewer for a Large Codebase
Suhaib Mujahid · Mozilla

View abstract

Shipping an AI reviewer into daily engineering use comes down to a series of decisions about scope, signal, filtering, and audience, and each decision has a real tradeoff. Using Mozilla’s Review Helper, an AI code reviewer running across the Firefox codebase, as a running example, this talk discusses how those decisions compound, where the non-obvious tradeoffs sit, and how the choices that look small at the start come to define what the agent can and cannot be.

We reflect on which decisions proved load-bearing in practice and on the open questions they raise for the design and evaluation of agentic systems in software engineering.

16:00 – 16:30

The Importance of Trustworthy AI in Heavily Regulated Domains
Lovedeep Gondara · Vanguard

View abstract

The deployment of AI systems in heavily regulated domains such as healthcare and finance presents unique challenges that extend beyond technical performance metrics to encompass fundamental questions of trust, accountability, and societal impact. Both sectors share critical characteristics: they involve high-stakes decisions with material consequences for individuals, operate under stringent regulatory oversight, and are fundamentally client-facing, requiring that end-users place significant trust in system outputs that may influence their health outcomes or financial wellbeing.

This talk examines the core pillars of trustworthy AI in these contexts, including transparency and explainability of model decisions, robustness to distribution shift and adversarial inputs, fairness across demographic groups, rigorous validation against domain-specific standards, and mechanisms for human oversight and intervention.

16:30 – 17:00

Robot Learning: Making Robots Learn How to Behave from Data
Julen Urain · Amazon Far

17:00 – 19:00

Reception & Poster Session
Atrium Lassonde (M-3500)

Research Day – June 2nd, 2026

Talk venue

Amphithéâtre Bernard Lamarre (C-631)

6th floor of Pavillon principal, Polytechnique Montréal

Research Day spans the full breadth of the field, from the social and ethical dimensions of AI in education to the technical challenges of building agents that are trustworthy, efficient, and safe to deploy. Talks and a panel draw on perspectives from academia and industry alike, covering multi-agent testing, reinforcement learning, LLM-based code generation, and AI systems in high-stakes domains.

8:30 – 9:00

Coffee and assorted pastries

9:00 – 10:00

KeynoteSocial Dimensions of AI: Challenges and Opportunities for Software Engineering Education
Tanja Tajmel · Concordia University

View abstract

Artificial intelligence has fundamentally transformed a wide range of industries over the past decade and is expected to reshape nearly every aspect of society in unprecedented ways. Software developers are at the forefront of this development, both as creators and as professionals whose working lives are shaped by the very technology they build. Rather than viewing AI as a neutral set of tools, it is becoming clear that it is embedded in complex social, economic, and ecological systems. From dependence on global networks of labor, data, and natural resources to the diverse applications of AI systems, these technologies raise critical questions about ethics, responsibility, and what we as humans consider a decent life. The talk explores how higher education in science and engineering fields, particularly in software engineering, can respond to these pressing questions of our times.

10:00 – 10:30

Coffee break

10:30 – 11:00

Sustainability in AI Systems: Challenges and Opportunities
Tushar Sharma · Dalhousie University

View abstract

Artificial Intelligence (AI) models are energy-hungry. As these models grow larger and become more complex, their energy consumption increases substantially. This leads to higher carbon emissions and rising operational costs, creating serious challenges for sustainable computing. This talk highlights the environmental impact of modern AI systems, introduces key metrics for measuring energy use and carbon footprint, and discusses available tools and frameworks for measuring and monitoring energy consumption.

The session summarizes current research and practical techniques for developing greener AI systems, including model pruning, quantization, efficient architecture design, and workload scheduling, and situates these within the wider movement toward responsible and sustainable AI development.

11:00 – 11:30

Testing Multi-Agent AI Systems
Nafiseh Kahani · Carleton University

View abstract

Multi-agent AI systems are increasingly used to coordinate agents, tools, permissions, and delegation across complex workflows. However, current evaluations often focus mainly on final task success, which may not show whether the system’s internal coordination structure has actually been evaluated. This talk introduces a new evaluation approach for multi-agent systems, focusing on how to evaluate agent reachability, tool access, permission boundaries, and delegation paths.

11:30 – 12:00

Towards Robust Perception for Autonomous Driving Systems
Zhijie Wang · Concordia University

View abstract

Autonomous driving systems increasingly rely on AI-enabled perception systems based on multi-sensor fusion, yet their robustness and reliability under real-world conditions remain insufficiently understood. As these systems are deployed in increasingly complex and safety-critical environments, ensuring their dependable operation has become a fundamental challenge for both academia and industry. This talk first discusses the key robustness challenges faced by modern autonomous driving perception systems, then presents recent research on automated testing techniques for improving effectiveness and efficiency through realistic multi-modal test generation and dynamic driving scenario synthesis.

12:00 – 13:30

Lunch break

13:30 – 14:00

Beyond Vibe Coding: Towards Agentic Software Engineering
Zhen Ming (Jack) Jiang · York University

View abstract

Foundation Models (FMs) are rapidly democratizing technology across many domains, and software engineering is no exception. Today, even individuals without a programming background can build functional software through natural language interactions with AI, a phenomenon called Vibe Coding. While this lowers the barrier to software creation, developing production-quality, reliable software demands something far more rigorous. This need motivates Agentic Software Engineering (Agentic SE): a discipline in which FM-powered agents plan, reason, act, and verify across the entire software engineering lifecycle, with humans maintaining meaningful oversight.

In this talk, I present our recent work along two fronts of Agentic SE. First, I introduce a dynamic benchmarking framework grounded in the Bloom’s Taxonomy that transforms static SE benchmarks into structured, multi-layered evaluations.

14:00 – 14:30

Decoding and Encoding Expertise: Toward Human-Centered AI for Software Engineering
Yu Huang · Vanderbilt University

View abstract

As AI becomes a fundamental partner in the software development lifecycle, a critical gap has emerged: while modern models excel at predicting code tokens, they remain cognitively blind to the high-level reasoning, intentionality, and memory constraints of human programmers. This cognitive misalignment results in AI models that often fail to match human judgment, leading to diminishing returns as task complexity increases.

By establishing a systematic methodology to model developers’ cognitive processes, Dr. Huang uncovers the complex relationship between programming and natural language processing while providing a framework to quantify human expertise for software engineering.

14:30 – 15:00

Challenges in Building Voice Agents for the Enterprise
Orlando E. Marquez · ServiceNow

15:00 – 15:30

Coffee break

15:30 – 16:00

Reliable Generation of Software Artifacts with LLMs: From Ad-Hoc Validation to Semantics-Aware Generation
Boqi (Percy) Chen · University of Ottawa

View abstract

Software engineering demands artifacts that satisfy rigorous syntactic and semantic constraints, yet large language models are inherently probabilistic and non-deterministic. As LLMs are increasingly used to automate software engineering tasks, ensuring the reliability of the artifacts they produce becomes a critical challenge. This talk presents recent efforts toward reliable AI-based generation of software artifacts, particularly domain-specific languages and other structured representations.

It first demonstrates how current LLMs frequently violate both syntactic and semantic constraints, then presents two complementary approaches: AbsCon, a post-processing framework that leverages self-consistency to ensure provable specification compliance, and Projectional Decoding, a semantics-guided decoding approach that proactively steers generation toward valid outputs.

16:00 – 16:30

Engineering Trustworthy AI Decision-Support Systems for Law Enforcement
Dalal Alrajeh · Imperial College London

View abstract

Artificial Intelligence (AI) and Machine Learning (ML) systems are increasingly explored to support law-enforcement investigations, including crime linkage analysis and online child sexual abuse investigations. However, many AI-enabled systems fail in practice because user requirements, operational constraints, and human responsibilities are not sufficiently understood during system design.

This talk presents a series of collaborative studies conducted with UK law-enforcement agencies on the engineering and evaluation of AI-assisted investigative systems. First, we report findings from qualitative studies examining how analysts make decisions in practice and identify requirements for scalability, explainability, trustworthiness, adaptability, and human oversight.

Second, we describe our experience applying a Goal-Oriented Requirements Engineering framework to the design of an ML-enabled suspect identification system, highlighting the strong interdependencies between goals, data, and ML performance. Finally, we present results from an industrial evaluation of an AI-enabled crime-linkage tool, showing that analysts selectively considered AI predictions while validating outputs using traditional behavioural evidence.

16:30 – 17:00

Engineering Trusted Reinforcement Learning Agents
Zhou Yang · University of Alberta

View abstract

Reinforcement learning has enabled the development of effective agents for critical tasks across domains such as robotic control, autonomous driving, AI coding assistance, and computer use. In this talk, I will discuss the trustworthiness challenges of RL-based agents. I will first uncover robustness issues in RL agents and show how curiosity-driven mechanisms can be leveraged to expose flaws in their decision-making. Next, I will discuss how to provide runtime verification and protection for these agents.

SEMTL and Tutorials – June 3rd, 2026

SEMTL room

M-1010

1st floor of Pavillons Lassonde, Polytechnique Montréal

Tutorial rooms

M-2204, L-2708

Pavillons Lassonde, Polytechnique Montréal

SEMTL and Tutorials Day brings SEMLA to a close with a morning dedicated to the SEMTL community, a recurring seminar series connecting software engineering researchers and practitioners across Montréal. After lunch, two in-depth tutorials offer a practitioner’s guide to the engineering challenges that will define the next generation of AI-driven software.

8:30 – 9:00

Coffee and assorted pastries

9:00 – 9:05

SEMTL Opening

9:05 – 10:00

KeynoteJust Fax It Over: Why Digitalizing Municipalities Is Hard
Maxime Lamothe · Polytechnique Montréal

View abstract

Cities promise smart infrastructure, real‑time data, and seamless digital services. Behind the scenes, however, many municipal systems still rely on fragmented processes, siloed governance, and aging legacy technology. This keynote cuts through the hype to show why applying software engineering to society is both essential and incredibly complex.

Drawing on the Ville_IA experience, we will explore decentralized data ownership, multi‑actor coordination across utilities and provincial agencies, and the messy reality that undermines assumptions about digital twins and urban AI. From incompatible datasets to projects managed by multiple independent entities, municipal digitalization is ultimately a socio‑technical maze rather than a simple technical challenge.

This talk delivers an honest look at why cities can struggle to fully leverage AI, offering concrete insights on what it will actually take to move from fax‑era workflows to sovereign, data‑driven urban systems.

10:00 – 10:30

Coffee break

10:30 – 11:00

Into the Unknown: Enhancing the Transparency of AI-based Software Systems
Qiaolin Qin (Isabelle) · Polytechnique Montréal

View abstract

We are shifting toward an era of AI-based software engineering, where large language models and AI agents are being continuously integrated into software systems. While their integration improves software performance, these non-deterministic models also turn software systems into unexplainable black boxes. This talk reveals transparency challenges in major AI-based software systems and proposes low-cost, high-performance strategies to promote software transparency and trustworthiness.

11:00 – 11:30

Can AI support autonomy for aging in place when self-awareness fades?
Thomas Tannou · IUGM, Université de Montréal, IVADO

View abstract

Artificial intelligence is increasingly presented as a promising tool to support aging populations, especially in contexts of cognitive decline and aging in place. However, real-world clinical practice reveals a more complex reality. Autonomy is not limited to functional independence. Cognitive disorders can alter insight, judgment, and risk perception themselves. As a result, expressed wishes may not always align with underlying needs, creating major challenges for decision-making algorithms and AI-supported care.

Drawing from clinical geriatrics and ongoing real-world technology projects involving older adults living with cognitive impairment, this talk will examine how AI systems confront human vulnerability, fluctuating cognition, and relational decision-making. Particular attention will be given to anosognosia, impaired self-awareness frequently observed in Alzheimer’s disease, as a critical but often overlooked challenge for trustworthy AI design.

11:30 – 12:00

Computational fluid dynamics and my losing fight against code complexity
Fabian Denner · Polytechnique Montréal

View abstract

Developing scientific software capable of addressing real-world fluid dynamics problems is inherently complex, particularly for researchers who were never formally trained as software engineers. In this talk, I will share my personal journey in computational fluid dynamics software development, beginning with a blank page as a PhD student in 2009, passing through ambitious attempts to build “one-size-fits-all” simulation frameworks, and eventually arriving at a more pragmatic philosophy centered on specialized tools for specific classes of flows and applications.

Beyond the technical challenges of numerical methods and classical software maintenance, onboarding new students into rapidly evolving software ecosystems is becoming an ever more challenging task.

12:00 – 13:30

Lunch break

13:30 – 15:00

Software Engineering for Foundation Models (SE4FM)
Gopi Krishnan Rajbahadur · Huawei Canada · Room M-2204 · View tutorial details

View abstract

Foundation models are among humanity’s most complex software systems. This 90-minute technical briefing presents SE4FM, a disciplined software engineering framing for foundation models organized around four pillars: DataOps, ExperimentOps, EvalOps, and FieldOps. The session gives software engineering researchers an insider view of how foundation models are built and operated at scale, while surfacing concrete software engineering challenges where new methods and tools are urgently needed.

15:30 – 17:00

Agentic Software Engineering: A Roadmap to Software Engineering 3.0
Hao Li · Queen’s University · Room M-2204 ·
Behavior-Driven Fuzzing for AI Coding Agents
Wuyang (David) Dai · York University · Room L-2708

View abstract — Agentic Software Engineering

Software Engineering is shifting from AI-assisted development to Agentic Software Engineering, where autonomous AI agents act as collaborators. This technical briefing establishes the foundations of a disciplined SE 3.0 practice, synthesizing a large-scale characterization of agent activity in the wild, an empirical study of 567 Claude Code pull requests across 157 open-source software projects, and the Structured Agentic Software Engineering framework that reframes actors, processes, artifacts, and tools.

View abstract — Behavior-Driven Fuzzing

AI coding agents powered by large language models are rapidly entering real-world software development, yet their reliability is still evaluated mostly through task-correctness benchmarks. This talk presents a behavior-driven fuzzing framework for systematically testing coding agents under realistic multi-step workflows. By mining real-world agent failure reports, we derive reusable interaction patterns and action types, compose them into repository-grounded fuzzing cases, and evaluate agent behavior through execution traces, file diffs, generated artifacts, and manual validation.