Luca Zavarella

6 contributions

📄 Articles

📄 article

lucazavarella.medium.com

Mar 14, 2026

Building a Spider2-Inspired Benchmark to Measure the Real Robustness of a Fabric Data Agent in Italian

This article moves from working demos to measurable reliability by introducing a Spider2-inspired benchmark for evaluating a Fabric Data Agent in Italian. It explains why manual spot checks are not enough, and shows how to design a more rigorous evaluation framework that separates already-taught patterns from true generalization. The result is a practical benchmark design for assessing multilingual Fabric Data Agents beyond isolated successful examples.

data-agent fabric multilingual

Author: Luca Zavarella

📄 article

medium.com

Mar 4, 2026

Fabric Data Agents Are English-First (For Now): A Hands-On Guide to Configuring One on Zava DIY for Non-English Users

This article provides a hands-on, incremental guide to configuring a Microsoft Fabric Data Agent on the Zava DIY dataset for non-English users, while keeping the agent grounded in an English-first setup. It shows how to improve reliability step by step through data source descriptions, agent instructions, domain constraints, formatting rules, and validated example queries, then extends the configuration with a practical "translate in, translate out" approach. The result is a reproducible quick-win pattern for making the agent more analytics-ready across languages without introducing external translation layers or custom front ends.

fabric data-agent multilingual

Author: Luca Zavarella

📄 article

medium.com

Apr 7, 2026

New article: Which Verdicts Changed, and Why: a Row-Level Audit of Fabric Data Agent Evaluation

The author performs a detailed row‑level audit of a 72‑question benchmark to understand why evaluation verdicts changed after fixing errors in the benchmark itself. Many initial “failures” turn out to be caused by faulty ground truth, ambiguous phrasing, or inconsistent casing rules rather than true Data Agent mistakes. After refining benchmark wording, tightening Agent instructions, and clarifying metric definitions, accuracy rises to 97.2%. The few remaining errors stem from extremely complex multi‑step prompts and ambiguous schema references, revealing limits of the underlying model rather than flaws in the benchmark.

data-agent fabric evaluation

Author: Luca Zavarella

📄 article

medium.com

Jan 2, 2026

Using Microsoft Fabric Data Agent in Non-English Languages: A Practical Exploration

This article examines what Microsoft Fabric Data Agent's current non-English limitation means in practice, using Italian as a concrete business scenario. Rather than stopping at the official "English-first" guidance, it presents three pragmatic patterns for enabling multilingual experiences today: English instructions with translate-in/translate-out behavior, Copilot Studio as a multilingual front-end, and a translation gateway built around the Data Agent API. The goal is to help teams choose the right architecture for multilingual adoption without overestimating native language support.

data-agent fabric multilingual

Author: Luca Zavarella

📄 article

lucazavarella.medium.com

Mar 17, 2026

We Built the Benchmark. Now Let’s Evaluate the Fabric Data Agent for Real

This article shows how to move from a benchmark design to a real evaluation workflow for a Microsoft Fabric Data Agent. Starting from a 72-question benchmark built in a previous article for an Italian multilingual scenario, it explains how to complete the ground-truth dataset, run evaluate_data_agent on Fabric, inspect summary and row-level results, and use notebooks to operationalize the full process. A key insight is that part of the observed weakness may come not only from the Data Agent, but also from the evaluation layer itself. By inspecting the SDK source code and testing a stricter custom critic prompt, the article shows how evaluation reliability can improve significantly without changing the agent or the benchmark. Overall, the piece is a practical guide to benchmarking and evaluating Fabric Data Agents more rigorously, especially in multilingual business scenarios.

data-agent fabric evaluation multilingual

Author: Luca Zavarella

📅 Events

📅 event

lodestar.eu

May 20, 2026

Fabric Data Agent in a Day

Fabric Data Agent in a Day is a hands-on half-day workshop on Microsoft Fabric Data Agent, scheduled for 20 May 2026 in Milan, designed to show how to move from raw data ingestion to conversational agents that can answer business questions in natural language. During the session, participants populate a Lakehouse and a SQL Database in Fabric, build a first SQL-based Data Agent, make it more effective for Italian-language queries, apply Row Level Security, and measure its performance with Microsoft’s evaluation tools. The workshop then moves to a second agent built on a semantic model with DAX, so attendees can compare the semantic-model approach with the SQL-based one. Overall, the workshop is meant for data and BI professionals who want a practical introduction to building secure, multilingual, end-to-end conversational AI experiences on top of Microsoft Fabric data, using patterns that are closer to real projects than to simple demos.

workshop fabric data-agent