{"id":35653,"date":"2024-06-20T09:00:48","date_gmt":"2024-06-20T07:00:48","guid":{"rendered":"https:\/\/www.oneword.de\/termextraktion-ki\/"},"modified":"2024-06-20T09:06:00","modified_gmt":"2024-06-20T07:06:00","slug":"term-extraction-ai","status":"publish","type":"post","link":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/","title":{"rendered":"Term extraction and AI: how to get it right."},"content":{"rendered":"<p><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-padding-top:0px;--awb-padding-right:0px;--awb-padding-bottom:0px;--awb-padding-left:0px;--awb-background-color:#82a0a7;--awb-flex-wrap:wrap;\" id=\"opener\" ><div class=\"fusion-builder-row fusion-row\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_2_3 2_3 fusion-two-third fusion-column-first\" style=\"--awb-padding-top:105px;--awb-padding-bottom:106px;--awb-bg-size:cover;width:66.666666666667%;width:calc(66.666666666667% - ( ( 4% ) * 0.66666666666667 ) );margin-right: 4%;\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-text fusion-text-1\"><p><small> 27\/05\/2024<\/small><\/p>\n<\/div><div class=\"fusion-title title fusion-title-1 fusion-sep-none fusion-title-text fusion-title-size-one\" style=\"--awb-sep-color:#82a0a7;\"><h1 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Term extraction and AI: how to get it right.<\/h1><\/div><div class=\"fusion-text fusion-text-2\"><p>Term extraction is the first step in extracting specialised terminology and creating a terminology database. How the extraction is carried out depends largely on the database and the time and personnel resources available. This is because working manually quickly becomes extremely time-consuming when the source files are extensive. And, at the moment, AI is always called upon if manual tasks need to be supported or automated. But what does term extraction look like? Is it enough to formulate a prompt to extract all the specialised terms from a text? We pitted human, machine and AI against each other in several tests with different texts and prompting strategies. Here we present the results and show the advantages and disadvantages of each of the three options.<\/p>\n<\/div><div class=\"fusion-button-wrapper\"><a class=\"fusion-button button-flat fusion-button-default-size button-custom fusion-button-default button-1 fusion-button-default-span fusion-button-default-type button-white\" style=\"--button_accent_color:#ffffff;--button_accent_hover_color:#676362;--button_border_hover_color:#676362;--button_gradient_top_color:rgba(249,157,28,0);--button_gradient_bottom_color:rgba(249,157,28,0);--button_gradient_top_color_hover:rgba(182,106,0,0);--button_gradient_bottom_color_hover:rgba(182,106,0,0);\" target=\"_self\" href=\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#information\"><span class=\"fusion-button-text\">Read more here<\/span><i class=\"fa-chevron-right fas button-icon-right\" aria-hidden=\"true\"><\/i><\/a><\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-2 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-padding-top:90px;--awb-flex-wrap:wrap;\" id=\"information\" ><div class=\"fusion-builder-row fusion-row\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-1 fusion_builder_column_2_3 2_3 fusion-two-third fusion-column-first\" style=\"--awb-padding-right:12%;--awb-bg-size:cover;width:66.666666666667%;width:calc(66.666666666667% - ( ( 4% ) * 0.66666666666667 ) );margin-right: 4%;\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-title title fusion-title-2 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">The preparation: materials and manual reference results<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-3\"><p>What AI is to text creation, the sponge city concept is to urban planning: innovative and a buzzword on everyone&#8217;s lips. So it seems appropriate that for our tests, we selected two texts about sponge cities and climate-adapted urban development. Both texts contained a high density of terminology with specialised terms from the environmental, urban development and climate adaptation domains. Texts of significantly different lengths were chosen to take into account how text length may impact the results: text 1 comprised 1658 words, text 2 more than five times as many at 9803 words.<\/p>\n<p>With increasing text lengths, manual extraction is not really an option day to day: it is too complex, too time-consuming and too expensive. While terms from a short text can be quickly entered manually (and doing so is sometimes even quicker than using software), for longer texts, this creates quality issues and takes a lot of time. When a text reaches a certain length, the concentration of those working on the text wanes and they become unsure whether they have already recorded a term or not. This often leads to terms being extracted several times, which can be easily detected and eliminated at the end of the process, but causes additional work during extraction.<\/p>\n<p>The human result is considered the gold standard for extraction, particularly when you consider that results extracted by software always have to be checked manually or suggested candidates have to be validated. It was therefore important for our tests to obtain reference values by first performing manual extractions, which would then serve as a benchmark for the results of the tools used.<\/p>\n<p>To keep this benchmark as objective as possible, each manual extraction was carried out by two people, and the results were compared with each other and then combined. 113 terms were extracted from text 1; from text 2 there were 299 terms. As there was a high density of terms in the texts overall, the figures matched what happens in practice: in short specialised texts terminology is often densely packed, whereas in longer texts it is often repeated.<\/p>\n<p>The results served not only as a quantitative reference, but also as a qualitative one: during the subsequent tool-based extraction, the only terms that were validated were those also extracted in the manual process. This meant that the tool results were compared with the reference values.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-3 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Challenger 1: Extraction software<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-4\"><p>The first tool-assisted extraction was carried out in the \u2018traditional\u2019 way, with extraction software. Several runs were carried out for both texts with different settings for data noise and maximum term length.<\/p>\n<p>In the short text 1, terms usually only appeared once, which led to inadequate results with the default settings. This was also reflected in the raw extraction result, which found between 67 and 612 terms depending on the setting. This was followed by manual validation to obtain the actual relevant technical terms from the proposed candidate terms. When run with the optimum settings, 98 terms were validated. This is 86.73% of the reference value of 113 terms from the manual extraction.<\/p>\n<p>Text 2 was also processed in several runs with different settings. With raw results of between 538 and 2526 terms, the number of identified terms was significantly higher. In the best result, 259 of the 299 terms, i.e. 86.62% of the reference result, were validated. Depending on the settings, the suggested candidate terms still had to be cleaned up linguistically, for example to correct plural forms to singular.<\/p>\n<p>Even though the aim of the validation was only to compare the results with the reference result, additional terms were quickly detected in the raw result that were classified as terminologically relevant. A further run was therefore carried out in which the extraction was performed independently of the manual reference values, resulting in 30 additional validated terms. Let us explain. The extraction software delivers the raw result in list form and, therefore, it is much more concentrated than in the full text. This also reveals terms that may have been overlooked in the document because they appear in captions or footnotes. However, the list form also risks including and validating terms that are only part of a company name or appear in the bibliography.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-4 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Challenger 2: Generative AI<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-5\"><p>As there is more and more of a call for the use of AI for extraction, Large Language Models (LLMs) have been introduced as a third option. The question and hope behind this was: is it possible to carry out extractions with a clear instruction in the form of a prompt and with little effort and to obtain a list of results within seconds or a few minutes?<\/p>\n<p>When using generative AI, prompting, i.e. formulating work instructions for the LLM, is of particular importance. In our tests, we therefore tried out various strategies, such as task-specific prompting, domain-specific prompting, combinations of both and also reverse prompting. The latter involves showing or describing the desired result to the system and asking it to formulate a prompt, with the aim of this leading to the result. A total of ten different prompts were used, some of which differed greatly in terms of their general instructions and the level of detail of these instructions. The best prompts were used several times, for example on different occasions. Three different models of ChatGPT (GPT-3.5, GPT-4 Turbo, GPT-4o) were used as the Large Language Model, as this is currently the most widely used tool. The text was entered directly into the input window and also transferred to the system as a file.<\/p>\n<p>The low number of extracted terms was clear in all runs, especially in relation to the reference result. For the shorter text 1, ChatGPT delivered between 20 and 53 terms as a raw result, between 12 and 35 of which could be validated. The coverage was therefore 30.97% of the reference value. For the longer text 2, the AI suggested between 25 and 157 terms as a raw result, depending on the prompt, of which 5 to 75 (at most) were validated. Therefore, ChatGPT achieved a maximum coverage of only 25.08% for text 2. This means that only a quarter of the terms that a person had classified as relevant in the same text were detected and extracted.<\/p>\n<p>The ten different prompts produced large differences in the raw and final results, as the following overview of text 1 shows.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-image-element in-legacy-container\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\" fusion-imageframe imageframe-none imageframe-1 hover-type-none\"><a href=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts.jpg\" class=\"fusion-lightbox\" data-rel=\"iLightbox[6d29a61afe050aa29e1]\" data-title=\"term-extraction-ai-differences-different-prompts\" title=\"term-extraction-ai-differences-different-prompts\"><img decoding=\"async\" width=\"2008\" height=\"2008\" alt=\"Term extraction and AI: differences with different prompts\" src=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts.jpg\" class=\"img-responsive wp-image-35665\" srcset=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts-200x200.jpg 200w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts-400x400.jpg 400w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts-600x600.jpg 600w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts-800x800.jpg 800w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts-1200x1200.jpg 1200w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-different-prompts.jpg 2008w\" sizes=\"(max-width: 1023px) 100vw, 800px\" \/><\/a><\/span><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-6\"><p class=\"pic-text\">However, even identical prompts with identical input (same source text, same form of data input) never delivered the same result twice, but sometimes three times as many terms. Both the raw result and the number of validated terms fluctuated, even though the same prompt was used four times on different occasions. This is a good illustration of the lack of reproducibility that is often being discussed at the moment in relation to generative AI. It also puts the influence of the prompt formulation somewhat into perspective, as even the same input can produce very different results.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-image-element in-legacy-container\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\" fusion-imageframe imageframe-none imageframe-2 hover-type-none\"><a href=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts.jpg\" class=\"fusion-lightbox\" data-rel=\"iLightbox[5be671429a670f16e37]\" data-title=\"term-extraction-ai-differences-identical-prompts\" title=\"term-extraction-ai-differences-identical-prompts\"><img decoding=\"async\" width=\"2008\" height=\"2008\" alt=\"Term extraction and AI: differences with identical prompts\" src=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts.jpg\" class=\"img-responsive wp-image-35660\" srcset=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts-200x200.jpg 200w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts-400x400.jpg 400w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts-600x600.jpg 600w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts-800x800.jpg 800w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts-1200x1200.jpg 1200w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-differences-identical-prompts.jpg 2008w\" sizes=\"(max-width: 1023px) 100vw, 800px\" \/><\/a><\/span><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-7\"><p>In addition to the lack of reproducibility, we encountered another weakness with AI in the tests that limits the reliability of the results: hallucinations. In a total of five runs, ChatGPT delivered terms that appeared technically correct but did not appear in the text at all. The newer ChatGPT-4o model hallucinated more frequently than the other two models.<\/p>\n<p>Since the result of ChatGPT was significantly lower than that of the other methods, we began to assume that perhaps only high-frequency terms are extracted by AI. A further prompt instructed that, for each term, the LLM should also indicate how many times the term appeared. With the same text input, the system gave a frequency of 2 for the term \u2018Bodenfunktion\u2019 on the first attempt and a frequency of 8 on the next attempt. A manual check revealed that the term appeared a total of 29 times in the text. Even though it is not new knowledge that LLMs are not calculating machines, this shows that there is potential for errors when querying additional information about the extraction result. After further analyses, it could not be confirmed that the terms extracted by AI occurred more frequently in the texts than other, non-extracted terms.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-5 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">The finishing line \u2013 and what to watch out for<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-8\"><p>Depending on the method used, term extraction can lead to very different results, as we have shown. However, a decision cannot be made on the final result alone, as, in day-to-day work, time and costs and any potential for errors and process risks must always be considered alongside the number of extracted terms.<\/p>\n<p>In our tests, we documented how long the extraction took using each method. In all cases, however, it depends on how experienced you are with the tasks or the tool being used and, when using ChatGPT, whether you first need to formulate and test a prompt or whether a tried-and-tested prompt is already available.<\/p>\n<p>For the longer text 2 (9803 words), the time comparison showed very clear results.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-image-element in-legacy-container\" style=\"--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);\"><span class=\" fusion-imageframe imageframe-none imageframe-3 hover-type-none\"><a href=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2.jpg\" class=\"fusion-lightbox\" data-rel=\"iLightbox[0d4590f4e9c039a94c9]\" data-title=\"term-extraction-ai-test2\" title=\"term-extraction-ai-test2\"><img decoding=\"async\" width=\"2008\" height=\"2008\" alt=\"Term extraction and AI; Test 2; Comparison duration, Diagramme\" src=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2.jpg\" class=\"img-responsive wp-image-35655\" srcset=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2-200x200.jpg 200w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2-400x400.jpg 400w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2-600x600.jpg 600w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2-800x800.jpg 800w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2-1200x1200.jpg 1200w, https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/term-extraction-ai-test2.jpg 2008w\" sizes=\"(max-width: 1023px) 100vw, 800px\" \/><\/a><\/span><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-9\"><p>While using extraction software saves almost half the time of manual extraction, ChatGPT is the unbeaten fastest method, taking just one eighth of the time required to do the task manually. Formulating and entering a prompt is much faster than creating an extraction project in the tool or perusing several pages of a document. With ChatGPT, the results are also processed and output within a few seconds. The results obtained were all terminologically clean and could have been used without further editing, at least from a linguistic point of view. With both manual and software-assisted extraction, however, terms often had to be converted into their basic form.<\/p>\n<p>Using ChatGPT for term extraction also has the advantage that the result can be requested in different formats (e.g. as a list, in columns, etc.) or file formats (e.g. Excel) for optimal further processing. This enables automation of the process to immediately fill a database based on the extraction results. And, in a work context, less time means lower costs.<\/p>\n<p>Time, effort and costs are precisely the reasons why manual extraction, which was considered an important benchmark in our tests, is usually not a realistic option in day-to-day work. After all, <a href=\"https:\/\/www.oneword.de\/en\/term-mining\/\">term extraction<\/a> is mainly used for large amounts of text. In these cases, human labour alone, which takes a lot more effort, is disproportionate to the larger quantity of terms that could be found.<\/p>\n<p>Although the manual result was also used as a qualitative benchmark, human extraction also has potential for errors. As well as the additional terms which, as mentioned above, were also validated with tool extraction, and the susceptibility of errors due to the person\u2019s dwindling concentration when there are large amounts of text, subjectivity plays a major role in the extraction. Two terminologists will probably never provide the exact same validation results, because there are always grey areas as to which words are really considered technical terms. In addition, different validation strategies influence the result: depending on how broadly you define a domain, you will validate more or fewer terms. In our example, the general domain is \u2018environment and climate\u2019 and the specific domain is \u2018climate-adapted urban planning\u2019. Including only the latter leads to significantly fewer validations than including the general subject area. In our test, we tried to relativise this subjectivity and influence by comparing the results of two terminologists and by using agreements and guidelines.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-title title fusion-title-6 fusion-sep-none fusion-title-text fusion-title-size-four\"><h4 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Conclusion: Many roads lead to term extraction<\/h4><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:25px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-text fusion-text-10\"><p>Although the three analysed methods of term extraction show clear differences, they can all be useful in different cases. Manual extraction as the gold standard may be the best-in-class in theory, but in day-to-day business it simply often fails due to its feasibility, and, as shown, it is not infallible.<\/p>\n<p>AI delivers very fast, but sometimes incorrect, results, which \u2013 like everything that comes directly from a machine \u2013 must be checked with a critical eye. As it extracts significantly fewer terms, it is arguably unsuitable for use in a large extraction project or when a comprehensive terminology structure is required. However, the speed and automation of the process enables terminology to be built up \u2018on the fly\u2019, which means that a database can be built up gradually without much effort. The terms invented by AI that creep into the database and its lack of reproducibility, as seen in the different results obtained from the exact same input, must be viewed critically.<\/p>\n<p>Software specialising in term extraction proved to be the optimal middle ground in the test. This solution is the standard in day-to-day terminology work and is certainly always a good choice. In our comparisons, the software achieved good results or even exceeded the reference value with less time spent. When using the tools, it&#8217;s important to choose the right settings for each project in order to increase the yield and reduce the amount of rework required.<\/p>\n<p>Terms can therefore be extracted in different ways and with different results. Therefore, the key question is what you want to achieve and how many resources can be used to achieve it. In the end, however, term extraction is only possible with a human in the loop, as it is up to humans to check and validate the raw results from software or AI.<\/p>\n<\/div><div class=\"fusion-text fusion-text-11\"><p><em>Would you like to find out more about term extraction or build up a terminology database based on your data? Our terminology team will be happy to help you: <a href=\"mailto:terminologie@oneword.de\">terminologie@oneword.de<\/a><\/em><\/p>\n<\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-2 fusion_builder_column_1_3 1_3 fusion-one-third fusion-column-last\" style=\"--awb-bg-size:cover;width:33.333333333333%;width:calc(33.333333333333% - ( ( 4% ) * 0.33333333333333 ) );\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-widget-area awb-widget-area-element fusion-widget-area-1 fusion-content-widget-area\" style=\"--awb-title-color:#676362;--awb-padding:0px 0px 0px 0px;\">\n\t\t<section id=\"recent-posts-2\" class=\"widget widget_recent_entries\">\n\t\t<div class=\"heading\"><h4 class=\"widget-title\">Recent Posts<\/h4><\/div>\n\t\t<ul>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/5-reasons-why-users-love-onesuite\/\">5 reasons why users love oneSuite<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/christmas-donation-nph-kinderhilfe-lateinamerika-2025\/\">A portion of hope: Christmas donation for children&#8217;s charity nph<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/glossary-creation-at-the-touch-of-a-button\/\">Glossary creation at the touch of a button?<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/language-data-for-llm-use\/\">Language data for LLM use: making AI an expert on your company<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t<li>\n\t\t\t\t\t<a href=\"https:\/\/www.oneword.de\/en\/tcworld-conference-2025\/\">oneword at the 2025 tcworld conference<\/a>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t<\/ul>\n\n\t\t<\/section><div class=\"fusion-additional-widget-content\"><\/div><\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div><\/div><\/div><div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-3 hundred-percent-fullwidth non-hundred-percent-height-scrolling fusion-equal-height-columns\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-padding-top:0px;--awb-padding-right:0px;--awb-padding-bottom:0px;--awb-padding-left:0px;--awb-flex-wrap:wrap;\" id=\"closer\" ><div class=\"fusion-builder-row fusion-row\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-3 fusion_builder_column_2_5 2_5 fusion-two-fifth fusion-column-first competence\" style=\"--awb-padding-top:8%;--awb-padding-right:10%;--awb-padding-bottom:5%;--awb-padding-left:38%;--awb-bg-color:#82a0a7;--awb-bg-color-hover:#82a0a7;--awb-bg-image:url(&#039;https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/10\/8-gruende-footer-icon.png&#039;);--awb-bg-position:left center;--awb-bg-repeat:repeat-y;width:40%;width:calc(40% - ( ( 4% ) * 0.4 ) );margin-right: 4%;\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy fusion-column-has-bg-image\" data-bg-url=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/10\/8-gruende-footer-icon.png\"><div class=\"fusion-title title fusion-title-7 fusion-sep-none fusion-title-text fusion-title-size-five\" style=\"--awb-margin-bottom:32px;\"><h5 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">8 good reasons to choose oneword.<\/h5><\/div><div class=\"fusion-text fusion-text-12\"><p>Learn more about what we do and what sets us apart from traditional translation agencies.<\/p>\n<p>We explain 8 good reasons and more to choose oneword for a successful partnership.<\/p>\n<\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-separator fusion-full-width-sep\" style=\"margin-left: auto;margin-right: auto;margin-bottom:30px;width:100%;\"><\/div><div class=\"fusion-sep-clear\"><\/div><div class=\"fusion-button-wrapper\"><a class=\"fusion-button button-flat fusion-button-default-size button-custom fusion-button-default button-2 fusion-button-default-span fusion-button-default-type button-white\" style=\"--button_accent_color:#ffffff;--button_accent_hover_color:#676362;--button_border_hover_color:#676362;--button_gradient_top_color:rgba(249,157,28,0);--button_gradient_bottom_color:rgba(249,157,28,0);--button_gradient_top_color_hover:rgba(182,106,0,0);--button_gradient_bottom_color_hover:rgba(182,106,0,0);\" target=\"_self\" href=\"https:\/\/www.oneword.de\/en\/translation-company-stuttgart\/\"><span class=\"fusion-button-text\">Explore reasons<\/span><i class=\"fa-chevron-right fas button-icon-right\" aria-hidden=\"true\"><\/i><\/a><\/div><div class=\"fusion-clearfix\"><\/div><\/div><\/div>\n<div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-4 fusion_builder_column_3_5 3_5 fusion-three-fifth fusion-column-last contact\" style=\"--awb-padding-top:8%;--awb-padding-right:30%;--awb-padding-bottom:5%;--awb-padding-left:10%;--awb-bg-color:#f99d1c;--awb-bg-color-hover:#f99d1c;--awb-bg-size:cover;width:60%;width:calc(60% - ( ( 4% ) * 0.6 ) );\"><div class=\"fusion-column-wrapper fusion-flex-column-wrapper-legacy\"><div class=\"fusion-title title fusion-title-8 fusion-sep-none fusion-title-text fusion-title-size-five\" style=\"--awb-margin-bottom:39px;\"><h5 class=\"fusion-title-heading title-heading-left\" style=\"margin:0;\">Request a quotation<\/h5><\/div>\n<div class=\"wpcf7 no-js\" id=\"wpcf7-f6607-o1\" lang=\"de-DE\" dir=\"ltr\" data-wpcf7-id=\"6607\">\n<div class=\"screen-reader-response\"><p role=\"status\" aria-live=\"polite\" aria-atomic=\"true\"><\/p> <ul><\/ul><\/div>\n<form action=\"\/en\/wp-json\/wp\/v2\/posts\/35653#wpcf7-f6607-o1\" method=\"post\" class=\"wpcf7-form init\" aria-label=\"Kontaktformular\" novalidate=\"novalidate\" data-status=\"init\">\n<fieldset class=\"hidden-fields-container\"><input type=\"hidden\" name=\"_wpcf7\" value=\"6607\" \/><input type=\"hidden\" name=\"_wpcf7_version\" value=\"6.1.4\" \/><input type=\"hidden\" name=\"_wpcf7_locale\" value=\"de_DE\" \/><input type=\"hidden\" name=\"_wpcf7_unit_tag\" value=\"wpcf7-f6607-o1\" \/><input type=\"hidden\" name=\"_wpcf7_container_post\" value=\"0\" \/><input type=\"hidden\" name=\"_wpcf7_posted_data_hash\" value=\"\" \/>\n<\/fieldset>\n<div class=\"contact-form\">\n\t<div class=\"fusion-builder-row fusion-row\">\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_2  fusion-one-half fusion-column-first 1_2\">\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-anliegen\"><select class=\"wpcf7-form-control wpcf7-select wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" name=\"anfrage-anliegen\"><option value=\"Your enquiry *\">Your enquiry *<\/option><option value=\"Translation\/Localisation\">Translation\/Localisation<\/option><option value=\"Terminology\">Terminology<\/option><option value=\"Machine Translation\">Machine Translation<\/option><option value=\"Post-editing (MTPE)\">Post-editing (MTPE)<\/option><option value=\"International SEO\">International SEO<\/option><option value=\"Transcreation\">Transcreation<\/option><option value=\"Technologies\/Processes\">Technologies\/Processes<\/option><option value=\"Consulting\">Consulting<\/option><option value=\"Price list\">Price list<\/option><option value=\"Other\">Other<\/option><\/select><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-name\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Full name *\" value=\"\" type=\"text\" name=\"anfrage-name\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-firma\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Company *\" value=\"\" type=\"text\" name=\"anfrage-firma\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-mail\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-email wpcf7-validates-as-required wpcf7-text wpcf7-validates-as-email\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"E-mail address *\" value=\"\" type=\"email\" name=\"anfrage-mail\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-telefon\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Tel. no. *\" value=\"\" type=\"text\" name=\"anfrage-telefon\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_2  fusion-one-half 1_2\">\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-text\"><textarea cols=\"40\" rows=\"10\" maxlength=\"2000\" class=\"wpcf7-form-control wpcf7-textarea wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Your message (please state required language)\" name=\"anfrage-text\"><\/textarea><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span id=\"wpcf7-69d1127548b6c-wrapper\" class=\"wpcf7-form-control-wrap vogelnest-wrap\" style=\"display:none !important; visibility:hidden !important;\"><label for=\"wpcf7-69d1127548b6c-field\" class=\"hp-message\">Bitte lasse dieses Feld leer.<\/label><input id=\"wpcf7-69d1127548b6c-field\"  class=\"wpcf7-form-control wpcf7-text\" type=\"text\" name=\"vogelnest\" value=\"\" size=\"40\" tabindex=\"-1\" autocomplete=\"new-password\" \/><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n\t<div class=\"fusion-builder-row fusion-row privacy\">\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_1  fusion-one-full 1_1\">\n\t\t\t<div class=\"fusion-column-wrapper\">\n\t\t\t\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"anfrage-datenschutz\"><span class=\"wpcf7-form-control wpcf7-checkbox wpcf7-validates-as-required\"><span class=\"wpcf7-list-item first last\"><input type=\"checkbox\" name=\"anfrage-datenschutz[]\" value=\"I agree that oneword GmbH may contact me and store the data that I provide.\" \/><span class=\"wpcf7-list-item-label\">I agree that oneword GmbH may contact me and store the data that I provide.<\/span><\/span><\/span><\/span>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n\t<div class=\"fusion-builder-row fusion-row\">\n\t\t<div class=\"fusion-layout-column fusion_builder_column fusion_builder_column_1_5  fusion-one-fifth 1_5\">\n\t\t\t<div class=\"fusion-column-wrapper send-form\">\n\t\t\t\t<p><input class=\"wpcf7-form-control wpcf7-submit has-spinner\" type=\"submit\" value=\"Submit request\" \/>\n\t\t\t\t<\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/div><div class=\"fusion-alert alert custom alert-custom fusion-alert-center wpcf7-response-output fusion-alert-capitalize awb-alert-native-link-color alert-dismissable awb-alert-close-boxed\" style=\"--awb-border-size:1px;--awb-border-top-left-radius:0px;--awb-border-top-right-radius:0px;--awb-border-bottom-left-radius:0px;--awb-border-bottom-right-radius:0px;\" role=\"alert\"><div class=\"fusion-alert-content-wrapper\"><span class=\"fusion-alert-content\"><\/span><\/div><button type=\"button\" class=\"close toggle-alert\" data-dismiss=\"alert\" aria-label=\"Close\">&times;<\/button><\/div>\n<\/form>\n<\/div>\n<div class=\"fusion-clearfix\"><\/div><\/div><\/div>\n<\/div><\/div><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Term extraction is the first step in extracting specialised terminology and creating a terminology database. How the extraction is carried out depends largely on the database and the time and personnel resources available. This is because working manually quickly becomes extremely time-consuming when the source files are extensive. And, at the moment, AI is always called upon if manual tasks need to be supported or automated. But what does term extraction look like? Is it enough to formulate a prompt to extract all the specialised terms from a text? We pitted human, machine and AI against each other in several tests with different texts and prompting strategies. Here we present the results and show the advantages and disadvantages of each of the three options.<\/p>\n","protected":false},"author":16,"featured_media":35340,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[13],"tags":[1457],"class_list":["post-35653","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-term-extraction-and-ai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Term extraction tested. Which is best? Human, software or AI?<\/title>\n<meta name=\"description\" content=\"We had AI, humans and an extraction tool perform a term extraction. Read the blog to find out which came out on top.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Term extraction tested. Which is best? Human, software or AI?\" \/>\n<meta property=\"og:description\" content=\"We had AI, humans and an extraction tool perform a term extraction. Read the blog to find out which came out on top.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\" \/>\n<meta property=\"og:site_name\" content=\"oneword Fach\u00fcbersetzungen\" \/>\n<meta property=\"article:published_time\" content=\"2024-06-20T07:00:48+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-06-20T07:06:00+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki-fallback.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1536\" \/>\n\t<meta property=\"og:image:height\" content=\"768\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sara Cantaro\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sara Cantaro\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"32 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\"},\"author\":{\"name\":\"Sara Cantaro\",\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2\"},\"headline\":\"Term extraction and AI: how to get it right.\",\"datePublished\":\"2024-06-20T07:00:48+00:00\",\"dateModified\":\"2024-06-20T07:06:00+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\"},\"wordCount\":6504,\"publisher\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg\",\"keywords\":[\"Term extraction and AI\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\",\"url\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\",\"name\":\"Term extraction tested. Which is best? Human, software or AI?\",\"isPartOf\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg\",\"datePublished\":\"2024-06-20T07:00:48+00:00\",\"dateModified\":\"2024-06-20T07:06:00+00:00\",\"description\":\"We had AI, humans and an extraction tool perform a term extraction. Read the blog to find out which came out on top.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage\",\"url\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg\",\"contentUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg\",\"width\":400,\"height\":428,\"caption\":\"Termextraktion KI; Illustration eines menschlichen und eines Roboterkopfes die am Hinterkopf verschmelzen\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Startseite\",\"item\":\"https:\/\/www.oneword.de\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Term extraction and AI: how to get it right.\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.oneword.de\/en\/#website\",\"url\":\"https:\/\/www.oneword.de\/en\/\",\"name\":\"oneword Fach\u00fcbersetzungen\",\"description\":\"oneword Fach\u00fcbersetzungen\",\"publisher\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.oneword.de\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.oneword.de\/en\/#organization\",\"name\":\"oneword GmbH\",\"url\":\"https:\/\/www.oneword.de\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png\",\"contentUrl\":\"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png\",\"width\":360,\"height\":70,\"caption\":\"oneword GmbH\"},\"image\":{\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/de.linkedin.com\/company\/oneword-gmbh\",\"https:\/\/www.youtube.com\/channel\/UCmC10VvZbP2IueXEZuH3Suw\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2\",\"name\":\"Sara Cantaro\",\"url\":\"https:\/\/www.oneword.de\/en\/author\/sara_cantaro\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Term extraction tested. Which is best? Human, software or AI?","description":"We had AI, humans and an extraction tool perform a term extraction. Read the blog to find out which came out on top.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/","og_locale":"en_US","og_type":"article","og_title":"Term extraction tested. Which is best? Human, software or AI?","og_description":"We had AI, humans and an extraction tool perform a term extraction. Read the blog to find out which came out on top.","og_url":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/","og_site_name":"oneword Fach\u00fcbersetzungen","article_published_time":"2024-06-20T07:00:48+00:00","article_modified_time":"2024-06-20T07:06:00+00:00","og_image":[{"width":1536,"height":768,"url":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki-fallback.jpg","type":"image\/jpeg"}],"author":"Sara Cantaro","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Sara Cantaro","Est. reading time":"32 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#article","isPartOf":{"@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/"},"author":{"name":"Sara Cantaro","@id":"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2"},"headline":"Term extraction and AI: how to get it right.","datePublished":"2024-06-20T07:00:48+00:00","dateModified":"2024-06-20T07:06:00+00:00","mainEntityOfPage":{"@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/"},"wordCount":6504,"publisher":{"@id":"https:\/\/www.oneword.de\/en\/#organization"},"image":{"@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage"},"thumbnailUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg","keywords":["Term extraction and AI"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/","url":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/","name":"Term extraction tested. Which is best? Human, software or AI?","isPartOf":{"@id":"https:\/\/www.oneword.de\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage"},"image":{"@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage"},"thumbnailUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg","datePublished":"2024-06-20T07:00:48+00:00","dateModified":"2024-06-20T07:06:00+00:00","description":"We had AI, humans and an extraction tool perform a term extraction. Read the blog to find out which came out on top.","breadcrumb":{"@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.oneword.de\/en\/term-extraction-ai\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#primaryimage","url":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg","contentUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2024\/05\/termextraktion-ki.jpg","width":400,"height":428,"caption":"Termextraktion KI; Illustration eines menschlichen und eines Roboterkopfes die am Hinterkopf verschmelzen"},{"@type":"BreadcrumbList","@id":"https:\/\/www.oneword.de\/en\/term-extraction-ai\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Startseite","item":"https:\/\/www.oneword.de\/en\/"},{"@type":"ListItem","position":2,"name":"Term extraction and AI: how to get it right."}]},{"@type":"WebSite","@id":"https:\/\/www.oneword.de\/en\/#website","url":"https:\/\/www.oneword.de\/en\/","name":"oneword Fach\u00fcbersetzungen","description":"oneword Fach\u00fcbersetzungen","publisher":{"@id":"https:\/\/www.oneword.de\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.oneword.de\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.oneword.de\/en\/#organization","name":"oneword GmbH","url":"https:\/\/www.oneword.de\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/","url":"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png","contentUrl":"https:\/\/www.oneword.de\/wp-content\/uploads\/2018\/05\/oneword-logo.png","width":360,"height":70,"caption":"oneword GmbH"},"image":{"@id":"https:\/\/www.oneword.de\/en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/de.linkedin.com\/company\/oneword-gmbh","https:\/\/www.youtube.com\/channel\/UCmC10VvZbP2IueXEZuH3Suw"]},{"@type":"Person","@id":"https:\/\/www.oneword.de\/en\/#\/schema\/person\/e5cb951cb96ef68846fced17e472bdc2","name":"Sara Cantaro","url":"https:\/\/www.oneword.de\/en\/author\/sara_cantaro\/"}]}},"_links":{"self":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts\/35653","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/comments?post=35653"}],"version-history":[{"count":2,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts\/35653\/revisions"}],"predecessor-version":[{"id":35670,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/posts\/35653\/revisions\/35670"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/media\/35340"}],"wp:attachment":[{"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/media?parent=35653"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/categories?post=35653"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.oneword.de\/en\/wp-json\/wp\/v2\/tags?post=35653"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}