From Algorithms to Apples:
ML Reports

Building a shared reporting experience for the machine learning lifecycle

Reports – Hero

MY ROLE

I was the design lead, responsible for the end-to-end design process

TEAM

Manuel Bähr, Engineering Manager
Nils Braun, Software Engineer
Mats Pörschke, Software Engineer
Jake Konstantinos, Software Engineer

VERSION

v1.0

Metrics

THE PROBLEM

The AIML organization faced growing complexity in knowledge sharing, collaboration, and decision-making. Scattered tools and processes made it difficult to analyze ML workflows cohesively, resulting in duplicated efforts, delayed insights, and siloed knowledge.

Quip2

Before, ML engineers and data scientists would use tools such as Quip for reporting

Apple-full

After, reporting is centralized and integrated into the machine learning workflow

GOALS

Centralized Solutions

The goal was to build a centralized, flexible, browser-based reporting solution that allows teams to make better decisions through improved data visibility and analysis capabilities

  • Create an intuitive interface where teams can seamlessly explore their data
  • Let teams focus their time where it matters
  • Make it easy for teams to document decisions and share knowledge

DISCOVERY

The Reality of Reporting

At the outset of the project, I did not have a clear mission, and with no pre-existing insights, I partnered with the DRI to explore how users use reporting in their workflows.

I talked to machine learning engineers from machine translation, GenAI music, video engineering, and foundation models teams to understand their current challenges. Additionally, I conducted competitive analysis, watched product demos and tutorials, and learned how other products implemented reporting and analytics.

Key insights:

  • Fragmented, manual tools discouraged documentation and created knowledge gaps
  • No simple way to share interactive, drill-down reports across teams
  • Reporting required technical expertise, excluding non-technical users
  • Siloed data and tools led to inefficient workflows and poor collaboration

“My preference is always to leverage what is inside. It's much simpler in terms of billing and approval and all that kind of stuff and it makes our training ecosystem better. Which is super important to me, because my efficiency is always gonna be limited by the infrastructure that we have in house. The better the infrastructure is in house, the easier the job is for me, and the more I can concentrate on where really my value is.”
~ ML Engineering Manager

How might we easily create, share, and access interactive data visualizations and metrics so teams can effectively and efficiently analyze their machine learning workflows?

SOLUTION

From Intertia to Innovation

ML Reports is a comprehensive web-based ML reporting tool designed to streamline the creation and sharing of ML insights across the AIML organization.

The platform's dual-interface approach accommodates users of varying technical needs. A web-based intuitive editor for direct in-browser creation and modification, and a Python SDK for advanced customization. All reports are seamlessly integrated with ML Hub projects, ensuring team-wide access to insights. To promote collaboration and reduce duplicate efforts, reports can be shared at multiple levels – from individual team members to the broader Apple ML community.

Reports – Python Edit Copy@2x

Create custom visualizations with Python

Reports – Empty Copy@2x

Connect to any data source to explore your data

Reports – Preview@2x

Start with a template or from scratch

Reports – Publish v2@2x

Version reports for more flexibility and control

EXPLORATION

Navigating Ambiguity Through Strategic Prioritization

One of the biggest challenges I faced throughout this project was balancing moving forward with designs while navigating ambiguity. I was simultaneously creating mockups, doing desk research, and working with the team on the overall vision and strategy for ML Reports.

Developing product principles and feature prioritization helped drive the process forward.

Principles

With information from discovery work, I developed the following product principles:

Collaborative Intelligence
Enhance collaboration through effortless sharing of insights, visualizations, and analyses across teams.

Efficiency Through Standardization
Streamline repetitive analysis tasks, allowing teams to focus on interpretation and discovery.

Seamless Exploration
Frictionless movement between datasets, experiments, and evaluations.

Evidence-Based Decision Making
Provide structured data and analysis tools for confident decisions about model readiness.

Clarity Through Visibility
Surface comprehensive metrics across the entire ML lifecycle through intuitive visualizations.

View New Report – Producer@2x

Report producer, primary flow

View New Report – Consumer@2x

Report consumer, primary flow

With features identified for the MVP, I jumped into the design process. Thinking about how a user gets started creating a report was surprisingly challenging. For the MVP, the team established that users would view and create reports from the Reports page in ML Hub.

Apple-mvp

With the primary flow for report creation starting from the Reports page, I started with a modal that launched into an empty report, but I thought this flow could be lighter weight

Reports – Edit State v2@2x

I further explored the toolbar placement and what the empty state would look like. Here, users are presented with their options for report content, and while it was ok and encouraged to create new patterns and components, since this was a new feature and experience for ML Hub, I thought it could perhaps be more aligned with other Apple experiences.

Reports – Create – Get Started v3 Copy@2x

Taking inspiration from apps like Quip and Google Docs, I added contextual getting-started content, but this, too, could be better aligned with Apple standards, and I thought the getting-started experience would be better as a linear experience.

Reports – Empty@2x

The final version is more aligned with other Apple getting started expereinces. Users can take a product tour to familiarize themselves with the features and view documentation to help them get started. The toolbar is fixed at the top to allow for maximum screen real estate and feels familiar, like other document apps. Instead of having an edit state, as in the previous versions, there is a context switcher. For a user creating a report, they are always in the context of the “report builder” and can easily toggle to the preview state before publishing or sharing.

DESIGN

Designing Data

Reports – Comments v2@2x

Collaborative Intelligence

  • Before: Insights trapped in individual tools and workflows
  • After: Centralized platform enabling cross-team knowledge sharing
  • Impact: Teams can build upon each other's analysis rather than working in silos

All reports are seamlessly integrated with ML Hub projects. To directly address the organizational challenges of collaboration and duplication, reports can be shared at multiple levels – from individual team members to the broader Apple ML community.

Efficiency Through Standardization

  • Before: Teams practice inconsistent methods and spend time on repetitive, manual analysis tasks
  • After: Standardized processes and templates are implemented for common tasks, creating consistency and reusable frameworks
  • Impact: Reporting standards and streamlined analysis let teams focus on innovation

Teams now have a standardized, yet flexible solution for exploring and analyzing their data.

Reports – Chart Resize Copy 2@2x
Reports@2x

Seamless Exploration

  • Before: Context switching hindered the process
  • After: Frictionless movement within the ML workflow
  • Impact: Velocity accelerates as cognitive load dissipates, enabling deeper focus on insights rather than wayfinding

Centralized solutions enable teams to focus on their work rather than the infrastructure. With reporting integrated into the ML platform, analysis is seamless rather than another step or workflow.

Clarity Through Visibility

  • Before: Decisions based on fragmented, inaccessible data
  • After: Data is easily accessible for better exploration and analysis 
  • Impact: Teams can iterate faster and explore more hypotheses, leading to faster time-to-insight

Establishing best practices and transparency in the process enables teams to make better decisions.

Reports – Chart Edit Copy 5@2x

Digging Deeper

Now that the private beta launch was behind us and we had some breathing room, I took the opportunity to validate decisions and get feedback regarding the private beta release of ML Reports. I conducted six semi-moderated interviews with early adopters.

Key insights:

  • Unclear reporting capabilities prohibit adoption
  • Reporting needs to be more integrated into the platform
  • Users want to have more control over their report layout
  • Reports should be accessible to both technical and non-technical users

Unclear reporting capabilities prohibit adoption

Pain-points:

  • The barrier to entry is too high
  • Users are unsure of how to get started
  • Documentation is lacking

Unclear reporting capabilities were causing underutilization of resources. Users' uncertainty about available options means teams may be using suboptimal approaches, failing to leverage the full potential of reports, or abandoning the feature altogether.

Reports – List View – Empty Copy@2x
Reports – List View @2x
Reports Copy 3@2x
Reports – View Templates@2x

Explorations of the getting-started experience

Reports – Empty@2x

Before

  • Users were unsure of how to get started. The product tour was not informative enough and documentation was lacking.
  • Users felt the lift was too heavy.
Reports – Start Empty Copy 3@2x

After

  • More robust documentation was added in addition to 
contextual help.
  • Most users are using reports for a handful of use cases. 
Enabling users to get started with a template that supports their use case lets them get started for efficiently and effectively.
Apple-getting-started

Reporting needs to be more integrated into the platform

Pain-points:

  • Reports felt disconnected from how users work
  • We should meet users where they are in their workflow

While it made sense to start with a main reports page where users could create a report for the MVP, we needed to better integrate reporting into the ML platform. Meeting users where they are is essential to creating a product that is accessible and usable.

Save Copy 8@2x

Explorations for Experiments/Evaluations and Reports integration

Screenshot 2025-12-05 at 7.20.23 PM 1
Reports@2x

Before

  • With the only option to create a report from the Reports page, this experience did not feel as integrated as it should be.
Run Comparison@2x

After

  • Users can create a report from Experiments and Offline Evaluation, creating better integration with their workflow and easier access.
Apple-rc

Experiment Tracking

Apple-eval-report

Offline Evaluation

Reports should be accessible to both technical and non-technical users

Pain-points:

  • Users should be able to use Reports without having to know SQL or Python
  • Users want to be able to change components and have it update dynamically

Cross-functional accessibility barriers limited report adoption. The MVP was not designed for non-technical stakeholders, which resulted in knowledge silos that prevent effective decision-making across departments. Technical teams may create reports that business leaders can't interpret, while non-technical users avoid engaging with data that could inform strategic decisions.

Reports – Chart Modal@2x

Edit chart explorations

Reports – Chart Comp 3
Reports – Chart Edit Copy 4@2x

Before

  • Previously, when users would select a component it would open a modal where they would have update the SQL query or Python, and apply changes.
Reports – Chart Edit Copy 5@2x

After

  • Users can now edit a component within the Reports UI without having to open a modal, without having to code anything, and the changes are made dynamically.
Apple-chart

Users want to have more control over their report layout

Pain-points:

  • Currently, when a new component is added, it is placed at the end of the report page
  • Users want to be able to place components in a specific place on the page

When users would add a new component, it would add it to the bottom of the report. If that was not the intended location for the component, they would have to move it, resulting in too much time spent on report organization.

Reports – D&D 2@2x

Layout explorations

Reports – C2P@2x
Reports – C2P 2@2x
Reports – Chart Resize Copy@2x

Before

  • Previously, when a user added a new component, it was placed at the bottom of the page, and users would have to then move it to the correct location.
Reports – Contextual Help Copy 3@2x

After

  • Users can now drag-and-drop components in their desired location.
Apple-drag-drop

“We've been using the MLHub Reports feature for a few weeks now to track our scheduled jobs and it has proven very useful. It was much easier to get up and running with this than the alternatives (custom web page, Tableau, etc.) Thanks!”
~ ML engineer

IMPACT

Results

We met or exceeded all deadlines and we were able to onboard an additional two teams from our original goal of three.

  • Beta launch: 02/2024
  • GA: 09/2025
  • User onboarding: 5 teams
  • Features shipped: 16
  • Increased automation: 35%
  • Increased adoption: 17%
Apple-full

Selected Works

KlevaHealthHealthcare Website

OTC MarketsMarket Data Website

InvestorVisionEnterprise Software

BrightAIEnterprise Software

AT&T Business VoiceEnterprise Software