This team has spent decades at the forefront of digital accessibility. From the early days of the internet, their mission has been to ensure equal access to web and mobile experiences for everyone, including people with disabilities. Today, they’re using machine learning to drive the next generation of accessibility testing, automating what used to be fully manual processes and scaling their impact like never before.
Annotating just one web page screen for accessibility compliance can take around ten minutes. Multiply that by thousands of screens across web and mobile platforms, and the workload quickly becomes overwhelming. As their dataset grew, the team realized they needed a better solution. They turned to Databrewery for both the tooling and support to help scale their efforts. Prior to this, they were relying on open-source annotation tools stitched together with Jupyter notebooks and spreadsheets, a setup that made collaboration difficult and consistency nearly impossible.
“Before we had access to diagnostics within Databrewery, everything was a manual lift,” said Javier Moretti, Machine Learning Engineer. “We were calculating metrics on our own and trying to visualize predictions through our own tooling. The moment we moved those workflows into Databrewery, everything started moving faster. Iteration became natural, not something we dreaded.”
By using Model Diagnostics, Javier’s team was able to evaluate how their models were performing and quickly identify weak spots. When they reviewed their existing dataset, they found noisy samples that were dragging performance down. Roughly one-third of those data points were filtered out, leading to a measurable 5% improvement in performance. After re-labeling high-impact examples, results improved even further. Many of these edge cases were difficult for both labelers and models making the ability to focus and refine on known problem areas a breakthrough moment.
By refining their dataset instead of expanding it randomly, the team significantly reduced the amount of labeling required without sacrificing model quality. “We’ve seen equal or better performance using half the data,” Moretti added. “That only became possible because we were finally able to pinpoint model weaknesses and match them to the right data. Otherwise, we’d still be labeling twice as much and making half the progress.”
With Databrewery powering their workflows and a sharper focus on high-value data, the team is pushing accessibility forward through smarter, faster AI proving that inclusive technology isn’t just possible, it’s scalable.