Resources
That's Fresh! Newsletter
Read a selection of our past issues.
- 🙌 NumPy 2.0 is almost out!And: Our new data preprocessor with Polars | Interview with S2E at Italy Insurance ForumJune 5, 2024
- 😮 What a month for new LLMs!And: Datacamp webinar with ShaliniMay 22, 2024
- ✨ GenAI true value lies beyond operational enhancementsAnd: The Future of Data Protection | New updates about AI ActApril 24, 2024
- 👁 What are 1-bit Large Language Models?And: Linkedin Live about AI Act | Mastercard's Country Manager interviewed our CEOMarch 6, 2024
- LLaMAntino - Effective Text Generation in ItalianAnd: Creating train and test datasets | Use case: Detecting money muling with the help of synthetic dataFebruary 21, 2024
- 🗞️ The NY Times sues OpenAI and MicrosoftAnd: Can AI work with little data? | La Stampa: AI means developmentJanuary 10, 2024
- Synthetic Data 101 🚨And: Why synthetic data? | New project with Poste ItalianeNovember 8, 2023
- How easy is it for LLM to infer sensitive information?And: Why is data sharing important? | Our new partnership with S2EOctober 25, 2023
- Have you heard of Pythia?And: Data augmentation tutorial | Did you say AI apocalypse?August 30, 2023
- Google's answer to ChatGPTAnd: Generating synthetic data within relational databases. Let's meet at WAICF!February 8, 2023
- Understanding ChatGPT betterAnd: How to deal with imbalanced data. More about our productDecember 14, 2022
- A curated list of failed ML projectsAnd: How to build a data strategy. Clearbox AI and Bearing Point partnership.November 16, 2022
- Our open source library is now on GitHubAnd: Clearbox AI on Cybernews.June 22, 2022
- Discovering DagsterAnd: Quantifying privacy risks. Use case: a synthetic data sandbox to freely share data.June 8, 2022
- Can interaction data be fully anonymized?And: Synthetic Data for privacy preservation: understanding privacy risks. Discover our Enterprise solution.April 6, 2022
- What are GFlow nets?And: Improve models with Synthetic Data. Use case: augment financial time series.March 16, 2022
- The European Commission selected us for Women TechEU pilot project!And: What is Synthetic Data. The new Synthetic Data platform.March 09, 2022
- The EDPS on Synthetic DataAnd: From raw to good quality data. Changelogs: now you can upload unlabeled datasets.February 23, 2022
- 2022 Gartner’s Technology TrendsAnd: How to harness the power of AI in companies. Changelogs: new metrics available for your synthetic dataset.February 09, 2022
FROM THE AI WORLD
The adage "innovate or die" is not an exaggeration since innovation is key for businesses to survive and thrive in the rapidly changing tech and market landscape. Although, innovation leaders and digital transformation experts in companies might find it hard to prioritise investments in the various burgeoning technologies. A great framework to consider to make these decisions and connect them to business needs is Gartner's technology trends 2022. This week we will touch upon two trends that are close to what we do: Data Fabric and Generative AI.
The concept of Data Fabric was quite new to me and I found this article useful to understand more about it. It is a design concept that can be adopted to enhance data management processes across organisations. It is built upon active metadata and machine learning models to automate data integration processes. In general I think many of the concepts included in this design space overlap with the DataOps domain and tools such as DVC and GreatExpectations.io will be integral parts of data fabric solutions.
Generative AI on the other hand is a topic at the core of our daily activities. According to Gartner ‘By 2025, generative AI will account for 10% of all data produced, up from less than 1% today’. In this scenario I feel that a lot of the focus around generative AI will move from the generation process itself towards tools to make sure that synthetic data is properly representing the data we are generating from while leaving room to synthetic creativity.
Where you need to invest in 2022
A quick guide on the technology trends that Gartner expects to have an impact on digital business and innovation in the next years. Let's explore them!
CLEARBOX AI
New Metrics for your Synthetic Dataset
Understand the value of your AI generated data in the blink of an eye thanks to the new Privacy Score and Quality Score we added in the Synthetic Dataset Report.
INTERVIEW
Harness the power of AI in companies
What does it mean for companies to adopt AI in their businesses and how can they benefit from it? Let's find out with Marina Geymonat, Head of Innovation at Sisal Lab.