Speech Dataset
Creation for AI

Power your AI/ML models with the highest-quality speech datasets in Croatian, Slovenian, Bosnian, Serbian, Montenegrin and other languages.

What we do?

We meticulously record and design every sound to meet the unique needs of each project. From building AI datasets with precise labeling to crafting immersive soundscapes and custom audio, our process combines technical expertise with creative innovation.

What languages we provide?

Our primary focus is on creating speech datasets for Croatian, Slovenian, Bosnian, Serbian, and Montenegrin. However, if you require datasets in other languages, don’t hesitate to reach out - we may be able to assist or connect you with the right resources to meet your needs.

Can we create custom datasets?

We understand that some AI projects require specialized solutions to meet specific goals and challenges. That’s why we offer custom dataset creation, adapting to the precise needs of your application. Whether it’s text-to-speech (TTS), automatic speech recognition (ASR), or natural language understanding (NLU), we design datasets that align perfectly with your project’s scope.

How we do it?

Our recordings are typically done at 48kHz in mono with 24 bits per sample, ensuring exceptional audio clarity and precision. However, we can accommodate any sample rate and bit depth you require to meet your specific project needs.

To preserve audio integrity, we incorporate clipping prevention during recording. In post-production, we enhance the recordings with noise removal, normalization and mastering, resulting in polished and professional outputs.

Our team also includes linguists and language experts who review and validate our scripts. This ensures that the content is not only accurate but also culturally and linguistically appropriate for its intended use.

We don't bite!

Get in touch!

Have a project in mind or just curious about our services? We’re here to help!

Developed by
Velos.hr