{"id":35672,"date":"2024-06-20T09:30:57","date_gmt":"2024-06-20T07:30:57","guid":{"rendered":"https:\/\/www.oneword.de\/datenbereinigung-onecleanup\/"},"modified":"2025-10-06T13:36:30","modified_gmt":"2025-10-06T11:36:30","slug":"cleaning-up-data-onecleanup","status":"publish","type":"post","link":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/","title":{"rendered":"oneCleanup: cleaning up data made easy"},"content":{"rendered":"<p><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-padding-top:0px;--awb-padding-right:0px;--awb-padding-bottom:0px;--awb-padding-left:0px;--awb-background-color:#82a0a7;--awb-flex-wrap:wrap;\" id=\"opener\" ><div class=\"fusion-builder-row fusion-row\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_2_3 2_3 fusion-two-third fusion-column-first\" style=\"--awb-padding-top:105px;--awb-padding-bottom:106px;--awb-bg-size:cover;width:66.666666666667%;width:calc(66.666666666667% - ( ( 4% ) * 0.66666666666667 ) );margin-right: 4%;\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-text fusion-text-1\"><p><small> 07\/06\/2024<\/small><\/p>\n<\/div><div class=\"fusion-title title fusion-title-1 fusion-sep-none fusion-title-text fusion-title-size-one\" style=\"--awb-sep-color:#82a0a7;\"><h1 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">oneCleanup: cleaning up data made easy<\/h1><\/div><div class=\"fusion-text fusion-text-2\"><p>Data is the new gold: it&#8217;s valuable and required for a range of processes, applications and developments. Generative AI in particular is showing once again what large amounts of data can create. But data is also the new rubbish: it appears in a wide variety of places and in large quantities, accumulates quickly, never reduces in size and sometimes grows very uncontrollably. And the bigger the mountain of data, the more difficult it becomes to use it meaningfully. Our oneCleanup service takes on this challenge and helps to uncover the shimmering gold beneath the layer of dirt. We present the background information and details, and we demonstrate why it&#8217;s high time that databases are seen not as a tangled mess but as treasure troves.<\/p>\n<\/div><div class=\"fusion-button-wrapper\"><a class=\"fusion-button button-flat fusion-button-default-size button-custom fusion-button-default button-1 fusion-button-default-span fusion-button-default-type button-white\" style=\"--button_accent_color:#ffffff;--button_accent_hover_color:#676362;--button_border_hover_color:#676362;--button_gradient_top_color:rgba(249,157,28,0);--button_gradient_bottom_color:rgba(249,157,28,0);--button_gradient_top_color_hover:rgba(182,106,0,0);--button_gradient_bottom_color_hover:rgba(182,106,0,0);\" target=\"_self\" href=\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#information\"><span class=\"fusion-button-text\">Read more here<\/span><i class=\"fa-chevron-right fas button-icon-right\" aria-hidden=\"true\"><\/i><\/a><\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-2 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-padding-top:90px;--awb-flex-wrap:wrap;\" id=\"information\" ><div class=\"fusion-builder-row fusion-row\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-1 fusion_builder_column_2_3 2_3 fusion-two-third fusion-column-first\" style=\"--awb-padding-right:12%;--awb-bg-size:cover;width:66.666666666667%;width:calc(66.666666666667% - ( ( 4% ) * 0.66666666666667 ) );margin-right: 4%;\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-title title fusion-title-2 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Terminology and translation memory \u2013 what&#8217;s the difference?<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-3\"><p>The kind of data that is relevant differs for each area of life and each area of business. With <a href=\"https:\/\/www.oneword.de\/en\/language-data-clean-up\/\">oneCleanup<\/a>, we concentrate on language data and focus on the two most important types of data in the translation sector: translation memories and terminology databases.<\/p>\n<p>In a translation memory (TM), the source text and the corresponding translation are stored segment by segment. The TM is therefore the translator&#8217;s digital memory. Each new text to be translated is compared with all previous projects stored in the memory, and identical or similar segments are recognised. These segments then do not have to be translated again \u2013 which might result in a different translation altogether. Instead, the existing translation can simply be reused, or adapted as necessary. This clearly saves time and money, as existing segments do not cost the full amount.<\/p>\n<p>However, a terminology database is different, as it does not contain complete sentences. Instead, it contains entries for terms with the matching terms in the target language, illustrations, definitions and additional information. A terminology database is given priority over the TM in the translation process. When a translation is being produced, it identifies the terms that occur within a segment and displays the foreign-language equivalent. During the translation, or after the translation is complete, the text is checked to ensure that the specified terms have been used correctly, as part of a terminology check.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-3 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Historical growth<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-4\"><p>The two databases therefore contain different language-related data. What both have in common is that the data accumulates quickly: the number of segments stored in the TM grows with each translation project, and the terminology database grows with every new entry or term. Large projects or importing prepared terminology lists can lead to substantial and sometimes uncontrolled growth. In day-to-day translation work, however, only a few companies have established processes for regular checks and data maintenance or even for targeted data clean-ups. Because you might actually think: the more data the better. Every segment available in the TM could be required again elsewhere; every term recorded could save time researching and increase consistency. So do large amounts of data save time and costs?<\/p>\n<p>In practice, the opposite is often the case: large amounts of data quickly become confusing and therefore more difficult to handle. Databases then continue to grow in an uncontrolled manner and the data becomes unclean. And unclean data is much more difficult to use in a meaningful way. If, for example, a terminology database contains duplicates with different information or if a TM contains two different translations for an almost identical source segment, this disrupts the translation process and leads to increased effort spent researching and selecting the correct data. If the corresponding segments in the TM or entries in the terminology database are not corrected, this effort is repeated each time they appear during translation projects. However, the data from the TM and the terminology database is also becoming increasingly relevant outside of the translation process. This is because language data can be used for processes and applications in very different scenarios.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-4 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Language data for a wide range of applications<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-5\"><p>For a number of reasons, language data is being valued as it actually should have been long before now, whether in knowledge management, the targeted use of Large Language Models (LLMs) or machine translation. Here are two examples of scenarios that use language data.<\/p>\n<p>In scenario 1, a company wants to train a chatbot for German and English to respond to support requests in both languages. The AI-supported assistant will be based on existing manuals, which are used to generate responses. Translations from the last ten years are used to provide sufficient input for the training. However, the TM data has never been adapted to changes in terminology and the interface texts have also changed in the meantime. The chatbot could therefore display outdated information or refer to buttons that no longer exist. The TM used for training also contains numerous duplicates and fragments, as segmentation was not always optimal during the translation process. This means that the AI system has a lot of data input but can learn nothing at all, or nothing meaningful, from it. However, for many models users are charged based on tokens, i.e. the smallest units used to process text are counted. This means that both the quantity and the quality of the input are decisive.<\/p>\n<p>In scenario 2, the content of the terminology database is going to be used as a glossary for machine translation. In an ideal world, all entries would be transferred to the MT system and implemented correctly and consistently by the MT engine. In reality, however, terminology databases often contain thousands of entries that are supposed to serve as input. These entries may contain contradictions, be ambiguous or contain translated terms from different areas. Converting an extensive database into a glossary can also mean that every second word that is being translated is suddenly specified by the glossary. The MT system\u2019s good and fluent translation then quickly becomes a series of specified terms, which can significantly change and negatively impact the output. For this scenario, too, the quantity and quality of the data are crucial for the data to be used in a meaningful way. It&#8217;s clear that in both these scenarios, the data must be cleaned up. This is where our oneCleanup service comes into play.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-5 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Potential for cleaning up data and practical implementation<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-6\"><p>The aim of cleaning up TM and terminology data is therefore to obtain a reduced and clean database. To analyse where there is potential for cleaning up data, i.e. what can be cleaned up, we use automation and scripting so that we can evaluate the large volumes of data quickly and effectively.<\/p>\n<p>For both types of data, we consider five key points:<\/p>\n<ul>\n<li>Incorrect forms<\/li>\n<li>Incorrect source-target pairings<\/li>\n<li>Duplicates and similar data<\/li>\n<li>Missing information<\/li>\n<li>Outdated data<\/li>\n<\/ul>\n<p>What each check then specifically targets varies greatly depending on the type of data. Incorrect form in the terminology database includes, for instance, terms that have been capitalised even though they should be in lower case. However, when analysing TM data, this criterion in the check returns segments that end with different punctuation marks, for example.<\/p>\n<p>With oneCleanup, we can analyse databases of any size. The steps in the check can be extended on a case-by-case basis to fulfil all company requirements. This is because no two TMs and terminology databases are structured or filled in exactly the same way.<\/p>\n<p>The potential for cleaning up the data is presented as clear analysis results. Our oneCleanup service is highly automated, making it possible to rapidly assess the actual effort required to clean up the data. As always where quality and informed decisions are required, people are then involved to evaluate the results and determine and implement the necessary measures. Changes and corrections can be made immediately or data can be marked for deletion. The results of the analyses also enable an iterative approach in order to implement the clean-up steps gradually.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-6 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Conclusion: Clean data through expertise<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-7\"><p>Data is only the new gold if it is regularly checked and cleaned up. Because, in all areas in which language data can be used, quality is more important than quantity. With oneCleanup, we use our decades of language and process expertise to analyse data from TMs and terminology databases in an efficient way that saves resources and to leverage the potential of data clean-up.<\/p>\n<\/div><div class=\"fusion-text fusion-text-8\"><p><em>Would you like to find out more about oneCleanup? If so, our <a href=\"https:\/\/www.oneword.de\/en\/contact-us\/\">experts<\/a> will be happy to arrange a consultation.<\/em><\/p>\n<\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-2 fusion_builder_column_1_3 1_3 fusion-one-third fusion-column-last\" style=\"--awb-bg-size:cover;width:33.333333333333%;width:calc(33.333333333333% - ( ( 4% ) * 0.33333333333333 ) );\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-widget-area awb-widget-area-element fusion-widget-area-1 fusion-content-widget-area\" style=\"--awb-title-color:#676362;--awb-padding:0px 0px 0px 0px;\">\n\t\t<section id=\"recent-posts-2\" class=\"widget widget_recent_entries\">\n\t\t<div class=\"heading\"><h4 class=\"widget-title\">Recent Posts<\/h4><\/div>\n\t\t<ul>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/5-reasons-why-users-love-onesuite\/\">5 reasons why users love oneSuite<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/christmas-donation-nph-kinderhilfe-lateinamerika-2025\/\">A portion of hope: Christmas donation for children&#8217;s charity nph<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/glossary-creation-at-the-touch-of-a-button\/\">Glossary creation at the touch of a button?<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/language-data-for-llm-use\/\">Language data for LLM use: making AI an expert on your company<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/tcworld-conference-2025\/\">oneword at the 2025 tcworld conference<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t<\/ul>\n\n\t\t<\/section><div class=\"fusion-additional-widget-content\"><\/div><\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-3 hundred-percent-fullwidth non-hundred-percent-height-scrolling fusion-equal-height-columns\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-padding-top:0px;--awb-padding-right:0px;--awb-padding-bottom:0px;--awb-padding-left:0px;--awb-flex-wrap:wrap;\" id=\"closer\" ><div class=\"fusion-builder-row fusion-row\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-3 fusion_builder_column_2_5 2_5 fusion-two-fifth fusion-column-first competence\" style=\"--awb-padding-top:8%;--awb-padding-right:10%;--awb-padding-bottom:5%;--awb-padding-left:38%;--awb-bg-color:#82a0a7;--awb-bg-color-hover:#82a0a7;--awb-bg-image:url(&#039;https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/10\/8-gruende-footer-icon.png&#039;);--awb-bg-position:left center;--awb-bg-repeat:repeat-y;width:40%;width:calc(40% - ( ( 4% ) * 0.4 ) );margin-right: 4%;\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy fusion-column-has-bg-image\" data-bg-url=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/10\/8-gruende-footer-icon.png\"><div class=\"fusion-title title fusion-title-7 fusion-sep-none fusion-title-text fusion-title-size-five\" style=\"--awb-margin-bottom:32px;\"><h5 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">8 good reasons to choose oneword.<\/h5><\/div><div class=\"fusion-text fusion-text-9\"><p>Learn more about what we do and what sets us apart from traditional translation agencies.<\/p>\n<p>We explain 8 good reasons and more to choose oneword for a successful partnership.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:30px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-button-wrapper\"><a class=\"fusion-button button-flat fusion-button-default-size button-custom fusion-button-default button-2 fusion-button-default-span fusion-button-default-type button-white\" style=\"--button_accent_color:#ffffff;--button_accent_hover_color:#676362;--button_border_hover_color:#676362;--button_gradient_top_color:rgba(249,157,28,0);--button_gradient_bottom_color:rgba(249,157,28,0);--button_gradient_top_color_hover:rgba(182,106,0,0);--button_gradient_bottom_color_hover:rgba(182,106,0,0);\" target=\"_self\" href=\"https:\/\/www.oneword.de\/en\/translation-company-stuttgart\/\"><span class=\"fusion-button-text\">Explore reasons<\/span><i class=\"fa-chevron-right fas button-icon-right\" aria-hidden=\"true\"><\/i><\/a><\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div>\n<div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-4 fusion_builder_column_3_5 3_5 fusion-three-fifth fusion-column-last contact\" style=\"--awb-padding-top:8%;--awb-padding-right:30%;--awb-padding-bottom:5%;--awb-padding-left:10%;--awb-bg-color:#f99d1c;--awb-bg-color-hover:#f99d1c;--awb-bg-size:cover;width:60%;width:calc(60% - ( ( 4% ) * 0.6 ) );\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-title title fusion-title-8 fusion-sep-none fusion-title-text fusion-title-size-five\" style=\"--awb-margin-bottom:39px;\"><h5 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Request a quotation<\/h5><\/div>\n<div class=\"wpcf7 no-js\" id=\"wpcf7-f6607-o1\" lang=\"de-DE\" dir=\"ltr\" data-wpcf7-id=\"6607\">\n<div class=\"screen-reader-response\"><p role=\"status\" aria-live=\"polite\" aria-atomic=\"true\"><\/p> <ul><\/ul><\/div>\n<form action=\"\/en\/wp-json\/wp\/v2\/posts\/35672#wpcf7-f6607-o1\" method=\"post\" class=\"wpcf7-form init\" aria-label=\"Kontaktformular\" novalidate=\"novalidate\" data-status=\"init\">\n<fieldset class=\"hidden-fields-container\"><input type=\"hidden\" name=\"_wpcf7\" value=\"6607\" \/><input type=\"hidden\" name=\"_wpcf7_version\" value=\"6.1.4\" \/><input type=\"hidden\" name=\"_wpcf7_locale\" value=\"de_DE\" \/><input type=\"hidden\" name=\"_wpcf7_unit_tag\" value=\"wpcf7-f6607-o1\" \/><input type=\"hidden\" name=\"_wpcf7_container_post\" value=\"0\" \/><input type=\"hidden\" name=\"_wpcf7_posted_data_hash\" value=\"\" \/>\n<\/fieldset>\n<div class=\"contact-form\">\n\t<div class=\"fusion-builder-row fusion-row\">\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_2  fusion-one-half fusion-column-first 1_2\">\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-anliegen\"><select class=\"wpcf7-form-control wpcf7-select wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" name=\"anfrage-anliegen\"><option value=\"Your enquiry *\">Your enquiry *<\/option><option value=\"Translation\/Localisation\">Translation\/Localisation<\/option><option value=\"Terminology\">Terminology<\/option><option value=\"Machine Translation\">Machine Translation<\/option><option value=\"Post-editing (MTPE)\">Post-editing (MTPE)<\/option><option value=\"International SEO\">International SEO<\/option><option value=\"Transcreation\">Transcreation<\/option><option value=\"Technologies\/Processes\">Technologies\/Processes<\/option><option value=\"Consulting\">Consulting<\/option><option value=\"Price list\">Price list<\/option><option value=\"Other\">Other<\/option><\/select><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-name\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Full name *\" value=\"\" type=\"text\" name=\"anfrage-name\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-firma\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Company *\" value=\"\" type=\"text\" name=\"anfrage-firma\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-mail\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-email wpcf7-validates-as-required wpcf7-text wpcf7-validates-as-email\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"E-mail address *\" value=\"\" type=\"email\" name=\"anfrage-mail\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-telefon\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Tel. no. *\" value=\"\" type=\"text\" name=\"anfrage-telefon\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_2  fusion-one-half 1_2\">\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-text\"><textarea cols=\"40\" rows=\"10\" maxlength=\"2000\" class=\"wpcf7-form-control wpcf7-textarea wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Your message (please state required language)\" name=\"anfrage-text\"><\/textarea><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span id=\"wpcf7-69de27cdaea83-wrapper\" class=\"wpcf7-form-control-wrap vogelnest-wrap\" style=\"display:none !important; visibility:hidden !important;\"><label for=\"wpcf7-69de27cdaea83-field\" class=\"hp-message\">Bitte lasse dieses Feld leer.<\/label><input id=\"wpcf7-69de27cdaea83-field\"  class=\"wpcf7-form-control wpcf7-text\" type=\"text\" name=\"vogelnest\" value=\"\" size=\"40\" tabindex=\"-1\" autocomplete=\"new-password\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n\t<div class=\"fusion-builder-row fusion-row privacy\">\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_1  fusion-one-full 1_1\">\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-datenschutz\"><span class=\"wpcf7-form-control wpcf7-checkbox wpcf7-validates-as-required\"><span class=\"wpcf7-list-item first last\"><input type=\"checkbox\" name=\"anfrage-datenschutz[]\" value=\"I agree that oneword GmbH may contact me and store the data that I provide.\" \/><span class=\"wpcf7-list-item-label\">I agree that oneword GmbH may contact me and store the data that I provide.<\/span><\/span><\/span><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n\t<div class=\"fusion-builder-row fusion-row\">\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_5  fusion-one-fifth 1_5\">\n\t\t\t<div class=\"fusion-column-wrapper send-form\">\n\t\t\t\t<p><input class=\"wpcf7-form-control wpcf7-submit has-spinner\" type=\"submit\" value=\"Submit request\" \/>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/div><div class=\"fusion-alert alert custom alert-custom fusion-alert-center wpcf7-response-output fusion-alert-capitalize awb-alert-native-link-color alert-dismissable awb-alert-close-boxed\" style=\"--awb-border-size:1px;--awb-border-top-left-radius:0px;--awb-border-top-right-radius:0px;--awb-border-bottom-left-radius:0px;--awb-border-bottom-right-radius:0px;\" role=\"alert\"><div class=\"fusion-alert-content-wrapper\"><span class=\"fusion-alert-content\"><\/span><\/div><button type=\"button\" class=\"close toggle-alert\" data-dismiss=\"alert\" aria-label=\"Close\">&times;<\/button><\/div>\n<\/form>\n<\/div>\n<div class=\"fusion-clearfix\"><\/div><\/div><\/div>\n<\/div><\/div><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Data is the new gold: it&#8217;s valuable and required for a range of processes, applications and developments. Generative AI in particular is showing once again what large amounts of data can create. But data is also the new rubbish: it appears in a wide variety of places and in large quantities, accumulates quickly, never reduces in size and sometimes grows very uncontrollably. And the bigger the mountain of data, the more difficult it becomes to use it meaningfully. Our oneCleanup service takes on this challenge and helps to uncover the shimmering gold beneath the layer of dirt. We present the background information and details, and we demonstrate why it&#8217;s high time that databases are seen not as a tangled mess but as treasure troves.<\/p>\n","protected":false},"author":16,"featured_media":35449,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[13],"tags":[1459],"class_list":["post-35672","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-data-clean-up"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Cleaning up data made easy with oneCleanup \u2013 oneword Blog<\/title>\n<meta name=\"description\" content=\"Cleaning up data with oneCleanup: We explain the most important aspects of cleaning up data and present our oneCleanup solution. | Get the information now!\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cleaning up data made easy with oneCleanup \u2013 oneword Blog\" \/>\n<meta property=\"og:description\" content=\"Cleaning up data with oneCleanup: We explain the most important aspects of cleaning up data and present our oneCleanup solution. | Get the information now!\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\" \/>\n<meta property=\"og:site_name\" content=\"oneword Fach\u00fcbersetzungen\" \/>\n<meta property=\"article:published_time\" content=\"2024-06-20T07:30:57+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-06T11:36:30+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup-fallback.jpeg\" \/>\n\t<meta property=\"og:image:width\" content=\"1536\" \/>\n\t<meta property=\"og:image:height\" content=\"768\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sara Cantaro\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sara Cantaro\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"21 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\"},\"author\":{\"name\":\"Sara Cantaro\",\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2\"},\"headline\":\"oneCleanup: cleaning up data made easy\",\"datePublished\":\"2024-06-20T07:30:57+00:00\",\"dateModified\":\"2025-10-06T11:36:30+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\"},\"wordCount\":4295,\"publisher\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg\",\"keywords\":[\"Data clean-up\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\",\"url\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\",\"name\":\"Cleaning up data made easy with oneCleanup \u2013 oneword Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg\",\"datePublished\":\"2024-06-20T07:30:57+00:00\",\"dateModified\":\"2025-10-06T11:36:30+00:00\",\"description\":\"Cleaning up data with oneCleanup: We explain the most important aspects of cleaning up data and present our oneCleanup solution. | Get the information now!\",\"breadcrumb\":{\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage\",\"url\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg\",\"contentUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg\",\"width\":400,\"height\":428,\"caption\":\"Datenbereinigung oneCleanup; Illustration von 2 Besen die Daten bereinigen\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Startseite\",\"item\":\"https:\/\/www.oneword.de\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"oneCleanup: cleaning up data made easy\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.oneword.de\/en\/#website\",\"url\":\"https:\/\/www.oneword.de\/en\/\",\"name\":\"oneword Fach\u00fcbersetzungen\",\"description\":\"oneword Fach\u00fcbersetzungen\",\"publisher\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.oneword.de\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.oneword.de\/en\/#organization\",\"name\":\"oneword GmbH\",\"url\":\"https:\/\/www.oneword.de\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png\",\"contentUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png\",\"width\":360,\"height\":70,\"caption\":\"oneword GmbH\"},\"image\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/de.linkedin.com\/company\/oneword-gmbh\",\"https:\/\/www.youtube.com\/channel\/UCmC10VvZbP2IueXEZuH3Suw\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2\",\"name\":\"Sara Cantaro\",\"url\":\"https:\/\/www.oneword.de\/en\/author\/sara_cantaro\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cleaning up data made easy with oneCleanup \u2013 oneword Blog","description":"Cleaning up data with oneCleanup: We explain the most important aspects of cleaning up data and present our oneCleanup solution. | Get the information now!","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/","og_locale":"en_US","og_type":"article","og_title":"Cleaning up data made easy with oneCleanup \u2013 oneword Blog","og_description":"Cleaning up data with oneCleanup: We explain the most important aspects of cleaning up data and present our oneCleanup solution. | Get the information now!","og_url":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/","og_site_name":"oneword Fach\u00fcbersetzungen","article_published_time":"2024-06-20T07:30:57+00:00","article_modified_time":"2025-10-06T11:36:30+00:00","og_image":[{"width":1536,"height":768,"url":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup-fallback.jpeg","type":"image\/jpeg"}],"author":"Sara Cantaro","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Sara Cantaro","Est. reading time":"21 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#article","isPartOf":{"@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/"},"author":{"name":"Sara Cantaro","@id":"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2"},"headline":"oneCleanup: cleaning up data made easy","datePublished":"2024-06-20T07:30:57+00:00","dateModified":"2025-10-06T11:36:30+00:00","mainEntityOfPage":{"@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/"},"wordCount":4295,"publisher":{"@id":"https:\/\/www.oneword.de\/en\/#organization"},"image":{"@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage"},"thumbnailUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg","keywords":["Data clean-up"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/","url":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/","name":"Cleaning up data made easy with oneCleanup \u2013 oneword Blog","isPartOf":{"@id":"https:\/\/www.oneword.de\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage"},"image":{"@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage"},"thumbnailUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg","datePublished":"2024-06-20T07:30:57+00:00","dateModified":"2025-10-06T11:36:30+00:00","description":"Cleaning up data with oneCleanup: We explain the most important aspects of cleaning up data and present our oneCleanup solution. | Get the information now!","breadcrumb":{"@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#primaryimage","url":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg","contentUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/06\/datenbereinigung-onecleanup.jpeg","width":400,"height":428,"caption":"Datenbereinigung oneCleanup; Illustration von 2 Besen die Daten bereinigen"},{"@type":"BreadcrumbList","@id":"https:\/\/www.oneword.de\/en\/cleaning-up-data-onecleanup\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Startseite","item":"https:\/\/www.oneword.de\/en\/"},{"@type":"ListItem","position":2,"name":"oneCleanup: cleaning up data made easy"}]},{"@type":"WebSite","@id":"https:\/\/www.oneword.de\/en\/#website","url":"https:\/\/www.oneword.de\/en\/","name":"oneword Fach\u00fcbersetzungen","description":"oneword Fach\u00fcbersetzungen","publisher":{"@id":"https:\/\/www.oneword.de\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.oneword.de\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.oneword.de\/en\/#organization","name":"oneword GmbH","url":"https:\/\/www.oneword.de\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/","url":"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png","contentUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png","width":360,"height":70,"caption":"oneword GmbH"},"image":{"@id":"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/de.linkedin.com\/company\/oneword-gmbh","https:\/\/www.youtube.com\/channel\/UCmC10VvZbP2IueXEZuH3Suw"]},{"@type":"Person","@id":"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2","name":"Sara Cantaro","url":"https:\/\/www.oneword.de\/en\/author\/sara_cantaro\/"}]}},"_links":{"self":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts\/35672","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/comments?post=35672"}],"version-history":[{"count":2,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts\/35672\/revisions"}],"predecessor-version":[{"id":35674,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts\/35672\/revisions\/35674"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/media\/35449"}],"wp:attachment":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/media?parent=35672"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/categories?post=35672"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/tags?post=35672"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}