Resources
That's Fresh! Newsletter
Read a selection of our past issues.
- Clearbox AI Goes Open Source – Meet Synthetic Kit! 🚀Our Open Source journey has begun!January 27, 2025
- Unwrap Our Latest News Before Christmas 🎁Catch our latest AI series and learn about groundbreaking tech developmentsDecember 18, 2024
- SURE, an open-source library for synthetic data evaluation🌎 Discover our open-source project!November 27, 2024
- 🙌 NumPy 2.0 is almost out!And: Our new data preprocessor with Polars | Interview with S2E at Italy Insurance ForumJune 5, 2024
- 😮 What a month for new LLMs!And: Datacamp webinar with ShaliniMay 22, 2024
- ✨ GenAI true value lies beyond operational enhancementsAnd: The Future of Data Protection | New updates about AI ActApril 24, 2024
- 👁 What are 1-bit Large Language Models?And: Linkedin Live about AI Act | Mastercard's Country Manager interviewed our CEOMarch 6, 2024
- LLaMAntino - Effective Text Generation in ItalianAnd: Creating train and test datasets | Use case: Detecting money muling with the help of synthetic dataFebruary 21, 2024
- 🗞️ The NY Times sues OpenAI and MicrosoftAnd: Can AI work with little data? | La Stampa: AI means developmentJanuary 10, 2024
- Synthetic Data 101 🚨And: Why synthetic data? | New project with Poste ItalianeNovember 8, 2023
- How easy is it for LLM to infer sensitive information?And: Why is data sharing important? | Our new partnership with S2EOctober 25, 2023
- Have you heard of Pythia?And: Data augmentation tutorial | Did you say AI apocalypse?August 30, 2023
- Google's answer to ChatGPTAnd: Generating synthetic data within relational databases. Let's meet at WAICF!February 8, 2023
- Understanding ChatGPT betterAnd: How to deal with imbalanced data. More about our productDecember 14, 2022
- A curated list of failed ML projectsAnd: How to build a data strategy. Clearbox AI and Bearing Point partnership.November 16, 2022
- Our open source library is now on GitHubAnd: Clearbox AI on Cybernews.June 22, 2022
- Discovering DagsterAnd: Quantifying privacy risks. Use case: a synthetic data sandbox to freely share data.June 8, 2022
- Can interaction data be fully anonymized?And: Synthetic Data for privacy preservation: understanding privacy risks. Discover our Enterprise solution.April 6, 2022
- What are GFlow nets?And: Improve models with Synthetic Data. Use case: augment financial time series.March 16, 2022
- The European Commission selected us for Women TechEU pilot project!And: What is Synthetic Data. The new Synthetic Data platform.March 09, 2022
- The EDPS on Synthetic DataAnd: From raw to good quality data. Changelogs: now you can upload unlabeled datasets.February 23, 2022
- 2022 Gartner’s Technology TrendsAnd: How to harness the power of AI in companies. Changelogs: new metrics available for your synthetic dataset.February 09, 2022
FROM AI WORLD
The holiday season is just around the corner, and like many of you, we’re preparing to take a short break. It’s the perfect time to recharge our batteries, reflect on an incredible year, and get ready for all the exciting projects we have in store for 2025.
But before we sign off for the holidays, we’d like to highlight an exciting development from Microsoft. They recently introduced Phi-4, a lightweight large language model (LLM) with 14 billion parameters, which has demonstrated remarkable performance, even surpassing GPT-4 in specific tasks. What’s truly groundbreaking is Phi-4’s innovative training approach: the majority of its training data was synthetically generated. Even more fascinating, Phi-4 outperformed the very model used to generate its synthetic training data. This breakthrough underscores the transformative potential of advanced data-generation and post-training techniques, pushing the boundaries of what’s possible in AI.
Speaking of synthetic data, we’re thrilled to share something new with you! Our latest format, "It's Time to Spill the AI", has officially launched. In this very first episode, Simona and Giulia dive into the fascinating world of synthetic data. It’s a must-watch for anyone curious about the future of AI.
We’re already hard at work on the upcoming episodes of "It's Time to Spill the AI." As one of our valued supporters, we’d love to hear your thoughts on this new series. Your insights and ideas are vital to shaping the content, ensuring it’s as engaging and informative as possible.

