PROGRESS UPDATES

EIC selected Sarus for funding through its prestigious Accelerator Program

February 2025

Sarus Technologies has made significant strides in developing a privacy-preserving backbone for dataset manipulation, representing a core infrastructure designed to securely run remote data analysis jobs. The fundamental principle driving this innovation is the "computation-to-data" paradigm, which ensures that sensitive information remains within its controlled environment while analyses are brought to the data. This approach aims to resolve the "innovation-privacy dilemma," enabling robust analytics and AI model training without compromising individual privacy, a critical shift from traditional data-sharing models that proved insufficient against re-identification risks.

A foundational achievement of this backbone is its sophisticated mechanism for tracing individual "Privacy Units" (PUs) across complex data transformations, including those spanning multiple rows and tables in relational databases. This capability is crucial for providing meaningful user-level Differential Privacy (DP) guarantees, which protect all data associated with a single individual as a unified entity. Furthermore, the system supports recursive DP compilation and incorporates a meticulous privacy accountant that tracks cumulative privacy loss (ϵ,δ) across iterative analytical workflows, ensuring that the predefined total privacy budget is never exceeded, even through repeated queries.

In the realm of Differentially Private SQL (DP-SQL), Sarus introduced Qrlew, an open-source library that functions as a SQL-to-SQL rewriter. This innovation allows a standard SQL query to be intercepted, transformed into a mathematically equivalent but differentially private version, and then compiled back into standard SQL for execution on any existing SQL datastore. Qrlew employs a proprietary Intermediate Representation (IR) called "Relation" and utilizes advanced range propagation techniques like k-Intervals and Piecewise-Monotonic Functions to precisely calculate query sensitivity, thereby minimizing the noise added and maximizing utility. It also provides a flexible, declarative language for data owners to specify how individuals are identified across complex relational schemas, enabling true user-level privacy.

For Differentially Private Artificial Intelligence (DP-AI), Sarus has addressed the formidable challenges of applying DP-SGD (Differentially Private Stochastic Gradient Descent) to large-scale models, particularly Large Language Models (LLMs). The core innovation lies in a technology stack that synergistically combines DP-SGD with Parameter-Efficient Fine-Tuning (PEFT) methods, such as Low-Rank Adaptation (LoRA) and QLoRA, alongside optimizations like quantization and gradient checkpointing. This makes DP fine-tuning practical and efficient for enterprise applications, with empirical validation demonstrating a successful utility-privacy trade-off, including experimental support for models like Mistral 7B and Llama2 7B. Additionally, Sarus developed DP-RAG, a novel framework for Differentially Private Retrieval-Augmented Generation, applying privacy mechanisms to both document retrieval and response generation to mitigate data leakage risks.

The enterprise readiness of Sarus's privacy backbone is powerfully demonstrated through its synergistic integration with Azure Confidential Clean Rooms. This creates a sophisticated, two-layer defense-in-depth architecture: Azure Confidential Clean Rooms provide hardware-level protection using confidential computing, while the Sarus backbone acts as an automated, dynamic application-level privacy enforcement layer within this enclave. This collaboration, publicly announced with Microsoft and EY, replaces slow, manual pre-approval processes with real-time, automated privacy enforcement, enabling agile multi-party collaboration for highly sensitive data, such as financial crime detection with Canadian banks.

Overall, these achievements deliver a robust, practical system that bridges the gap between theoretical Differential Privacy and the complex demands of enterprise data science. Sarus has also released open-source Python libraries, such as arena-ai and structured-logprobs, to foster trust and broader adoption of privacy-enhancing technologies. Future work includes expanding DP mechanisms in Qrlew, improving the efficiency and utility of private LLM training and inference, and exploring new applications for this composite architecture with confidential computing.

January 2024

We are thrilled to announce that Sarus Technologies has been selected to receive funding from the European Innovation Council (EIC) Accelerator program!

The EIC Accelerator is a highly competitive funding program that supports innovative startups and small and medium-sized enterprises with the potential to disrupt existing markets and create new ones. The program provides funding, mentoring, and business acceleration services to help startups like Sarus Technologies scale up and bring their groundbreaking innovations to the market.

Sarus Technologies is a deep-tech startup that offers enterprises a solution to unleash the full potential of sensitive data, and do research, analytics and AI while keeping data safe. Sarus brings a privacy-by-design approach to the modern data stack by letting data practitioners work on sensitive data without ever accessing it. Data protection is enforced automatically thanks to the differential privacy, and data practitioners’ experience is preserved thanks to privacy-safe synthetic data.

Our developments covered by this funding will be focusing on:

  • Backbone infrastructure and data portal: Sarus will be able to handle new varieties of data assets and data processing jobs with two objectives: offering synthetic data, keeping track of user-level information in all original or derived dataset.
  • Privacy-preserving analytics & AI: Sarus will extend support for analytics and AI tools and libraries. We will expand our API to support any data processing jobs (SQL, Python...), being able to make those jobs compatible with the privacy-preserving backbone.

Being selected for the EIC Accelerator program is a testimony to the transformative potential of our technology and the hard work and dedication of our team. We are honored to be among the few startups chosen for this prestigious program, and we look forward to leveraging the resources and expertise of the EIC to accelerate our growth and make an even greater impact in the HPC and cloud computing industries.

We would like to extend our sincere thanks to the EIC for recognizing the potential of our technology and supporting us on our journey. We are excited to be part of this vibrant and dynamic community of innovators, and we look forward to collaborating with other startups, investors, and industry leaders to drive innovation and create new opportunities for growth and prosperity.

Stay tuned for more updates on our progress and the exciting developments we have in store!

Subscribe to our newsletter

You're on the list! Thank you for signing up.
Oops! Something went wrong while submitting the form.
128 rue La Boétie
75008 Paris — France
Resources
Blog
©2023 Sarus Technologies.
All rights reserved.