{"id":285,"date":"2022-12-12T23:27:55","date_gmt":"2022-12-12T23:27:55","guid":{"rendered":"https:\/\/inten.to\/machine-translation-university\/?p=285"},"modified":"2023-03-25T22:54:43","modified_gmt":"2023-03-25T22:54:43","slug":"mastering-custom-machine-translation-a-practical-guide","status":"publish","type":"post","link":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/","title":{"rendered":"Mastering Custom Machine Translation: A Practical Guide"},"content":{"rendered":"<h3><span style=\"font-weight: 400;\">Understand what is under the hood<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">It is important to understand the training and evaluation processes in MT systems. Those systems are not foolproof, meaning they do not prevent users from making mistakes.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For instance, inexperienced users may simply drag and drop a TMX file into the training interface, press a button, and see their BLEU score decrease due to poor data. When actions do not yield desired outcomes, users may repeat the same process, potentially updating their existing model (if the provider has such an option) with the same data without retraining from scratch. This can cause training, validation, and test sets to overlap, resulting in a poorly performing model, while the BLEU score increase may seem impressive (exactly because of this overlap). A high BLEU score paired with gibberish translations in production can lead users to believe machine translation is ineffective.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">To address this issue, MT providers must design tools to enforce proper processes. Although MT involves a certain level of artistry, adhering to a specific process is crucial. Unfortunately, many tools do not enforce this process, leading to subpar results. In this chapter, we will guide you through the training process on an example of training a custom model using Google Cloud, which is similar to other cloud MT providers with automatic domain adaptation.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Google Cloud example<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">When training an MT model in Google Cloud steps involved are<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Configure Google Cloud: Set up projects and permissions, and manage costs.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Upload data and initiate training: Add your data to the platform and start the training process.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Monitor successful training and quality score changes: Track the progress and observe improvements in the quality score.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Analyze training and dive deeper: Evaluate the training process and explore further ways to improve your custom model.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Update models based on data: Adjust and enhance your models according to the data and its performance.<\/span><\/li>\n<\/ol>\n<h3><span style=\"font-weight: 400;\">Domain adaptation. Google Cloud. Configuration.<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Getting started with Google Cloud can initially seem overwhelming, but the process is similar to other cloud systems. To set up your project, follow these steps:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Create a Google Cloud project.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Set up billing for the project.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Enable AutoML and Cloud Storage API to use machine learning and upload training data.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Create a Service Account and download the key (JSON).<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Configure permissions using Google Cloud CLI.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Create a Google Cloud Storage bucket in the desired region using gsutil CLI.<\/span><\/li>\n<\/ol>\n<h3><span style=\"font-weight: 400;\">Domain adaptation. Google Cloud. Training<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Once you have created the storage bucket, the next step is to prepare your data. The requirements may vary between providers, so it is essential to follow the specific guidelines of the platform you are using. For Google Cloud, you can refer to their documentation for data preparation: <\/span><a href=\"https:\/\/cloud.google.com\/translate\/automl\/docs\/prepare\"><span style=\"font-weight: 400;\">https:\/\/cloud.google.com\/translate\/automl\/docs\/prepare<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">After preparing your data according to the guidelines, create a dataset in the user interface (UI) dataset and import your files into the dataset. This will enable you to utilize the data for training your custom machine translation model within the Google Cloud platform.<\/span><\/p>\n<figure id=\"attachment_456\" aria-describedby=\"caption-attachment-456\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img fetchpriority=\"high\" decoding=\"async\" class=\"wp-image-456 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-1024x302.png\" alt=\"\" width=\"800\" height=\"236\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-1024x302.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-300x88.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-768x226.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-1536x453.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-2048x604.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-456\" class=\"wp-caption-text\">Figure 8. Training Google Model. Step 1<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">At this stage, you can import your translation files into the dataset you created in the user interface (UI). Ensure your translation files are appropriately formatted and prepared according to the platform guidelines. <\/span><\/p>\n<figure id=\"attachment_457\" aria-describedby=\"caption-attachment-457\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" class=\"wp-image-457 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/9-1024x363.png\" alt=\"\" width=\"800\" height=\"284\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/9-1024x363.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/9-300x106.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/9-768x272.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/9-1536x544.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/9.png 1600w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-457\" class=\"wp-caption-text\">Figure 9. Training Google Model. Step 2<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Once your translation files are imported, you will see statistics and an automatic split between train, validation, and test sets. While the default split is convenient, there may be instances where you would want more control over the division of your data, particularly for the validation set.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When dealing with different content types, ensure that all types are represented in the validation and test sets. This helps to evaluate the model&#8217;s performance accurately across all content types. Instead of training separate models for each content type, assessing how a single model performs for all content types is better.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In such cases, you can manually split your data during the preparation to maintain the desired distribution of content types in the training, validation, and test sets. This approach can provide better insights into your model&#8217;s performance and help you fine-tune it for optimal results.<\/span><\/p>\n<figure id=\"attachment_458\" aria-describedby=\"caption-attachment-458\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" class=\"wp-image-458 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/10-1024x242.png\" alt=\"\" width=\"800\" height=\"189\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/10-1024x242.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/10-300x71.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/10-768x181.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/10-1536x363.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/10.png 1600w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-458\" class=\"wp-caption-text\">Figure 10. Training Google Model. Step 3<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Once the model has completed its training, you will notice an improvement in the BLEU score, indicating the enhanced quality of your machine translation model. Additionally, you can review various statistics related to the training process, such as loss, accuracy, and other performance metrics. These insights will help you understand how well your model performs and identify potential areas for further optimization and improvement.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Remember that Google Translate allows you to run models in specific regions. <\/span><\/p>\n<figure id=\"attachment_462\" aria-describedby=\"caption-attachment-462\" style=\"width: 219px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-462 size-medium\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-219x300.png\" alt=\"\" width=\"219\" height=\"300\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-219x300.png 219w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-748x1024.png 748w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-768x1052.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score.png 974w\" sizes=\"(max-width: 219px) 100vw, 219px\" \/><figcaption id=\"caption-attachment-462\" class=\"wp-caption-text\">Figure 11. Google Training Analysis. BLEU Score<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_464\" aria-describedby=\"caption-attachment-464\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-464 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-Interpretation-1024x542.png\" alt=\"\" width=\"800\" height=\"423\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-Interpretation-1024x542.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-Interpretation-300x159.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-Interpretation-768x407.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-Interpretation-1536x814.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Google-Training-Analysis.-BLEU-Score-Interpretation.png 1820w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-464\" class=\"wp-caption-text\">Figure 12. Google Training Analysis. BLEU Score Interpretation<\/figcaption><\/figure>\n<h3><span style=\"font-weight: 400;\">Domain adaptation. Google cloud. Using the model.<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Keep in mind that Google Translate runs models in certain regions.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">To integrate your custom model with your Translation Management System (TMS), follow these steps:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Choose the appropriate TMS connector: If your TMS has a built-in Google AutoML connector, use it. Otherwise, you can use the Intento connector as an alternative.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Authenticate using the service project JSON: In the TMS connector, authenticate by providing the service project JSON downloaded earlier.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Use the model ID: Input your custom model&#8217;s ID into the TMS connector to ensure the correct translation model is used.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Opt for Google Cloud Advanced API: We recommend using it instead of the AutoML API, as it supports batching, resulting in faster translation speeds.<\/span><\/li>\n<\/ol>\n<h3><span style=\"font-weight: 400;\">Other platforms and considerations.\u00a0<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Different platforms have varying dataset requirements, such as data size, segment length, number of uploaded TMs, file formats, encoding, and escaping. Adhere to these requirements to ensure optimal translation results.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">There are two primary types of model adaptation: deep (static) and lightweight (dynamic). Deep adaptation, like Google Cloud, is easier to build and validate as the model is trained once and then used over a long time. Lightweight adaptation, which is dynamic and adjusts on-the-fly, works well with continuously changing data but has limitations, such as, for example, constant data quality verification to prevent feeding a model bad data.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">As your dataset increases, you might need to switch between custom machine translation platforms to ensure the best performance. When updating models, you often need to combine datasets and retrain the model, as most platforms no longer support training on top of existing models.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For large organizations, migrating models between environments can be crucial, especially in industries with strict regulations, such as pharmaceuticals. Some platforms, like Google Cloud, allow for easy model migration, enabling you to train models in one environment and use them in another. However, not all platforms offer this flexibility, and you may need to retrain your model in the new environment.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In summary, understanding the different dataset requirements, types of adaptation, model updating methods, and migration capabilities are crucial when working with custom machine translation platforms to ensure the best results and compatibility with your organization&#8217;s needs.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Was the training successful?<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">To streamline and simplify the process of training and evaluating custom machine translation models across multiple vendors, the AI Curation team at Intento uses an internal tool with a unified interface. This tool, <a href=\"https:\/\/inten.to\/mt-studio\/\">Intento MT Studio<\/a>, helps Intento analysts effectively handle the differences between various platforms. With this tool, our team can streamline the model training process, save time, and quickly compare the performance of models trained on different platforms, enabling us to choose the best-performing solution for specific translation needs.<\/span><\/p>\n<figure id=\"attachment_463\" aria-describedby=\"caption-attachment-463\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-463 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-1024x960.png\" alt=\"\" width=\"800\" height=\"750\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-1024x960.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-300x281.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-768x720.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-1536x1439.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.png 1749w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-463\" class=\"wp-caption-text\">Figure 13. Intento MT Studio. Training<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_465\" aria-describedby=\"caption-attachment-465\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-465 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.-Providers-Accounts-1024x321.png\" alt=\"\" width=\"800\" height=\"251\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.-Providers-Accounts-1024x321.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.-Providers-Accounts-300x94.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.-Providers-Accounts-768x241.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.-Providers-Accounts-1536x481.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training.-Providers-Accounts.png 1794w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-465\" class=\"wp-caption-text\">Figure 14. Intento MT Studio. Training. Providers &amp; Accounts<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Intento MT Studio<\/span><span style=\"font-weight: 400;\"> displays stock models alongside custom models, making it convenient for analysts to compare the performance of both models. This side-by-side comparison enables a more comprehensive understanding of the training process and its impact on translation quality beyond just the BLEU score.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">By comparing stock and custom models, we can better evaluate the effectiveness of the custom model and identify areas for potential improvement or optimization.<\/span><\/p>\n<figure id=\"attachment_467\" aria-describedby=\"caption-attachment-467\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-467 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-Status-1-1024x543.png\" alt=\"\" width=\"800\" height=\"424\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-Status-1-1024x543.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-Status-1-300x159.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-Status-1-768x408.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-Status-1-1536x815.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Training-Status-1-2048x1087.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-467\" class=\"wp-caption-text\">Figure 15. Intento MT Studio. Training Status<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">While comparing scores for each model, confidence intervals to help us assess the stability of the translations. Wide confidence intervals may indicate that although the score improved, the translation quality became less consistent.<\/span><\/p>\n<figure id=\"attachment_468\" aria-describedby=\"caption-attachment-468\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-468 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Scoring-1024x495.png\" alt=\"\" width=\"800\" height=\"387\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Scoring-1024x495.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Scoring-300x145.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Scoring-768x371.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Scoring-1536x743.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Scoring-2048x990.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-468\" class=\"wp-caption-text\">Figure 16. Intento MT Studio. Scoring<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">However, reviewing scores and confidence intervals alone may only partially understand the model&#8217;s performance. To gain deeper insights, examine the aspects behind these scores. This may involve analyzing the training process, data preparation, and other factors that could influence the model&#8217;s performance. By delving into the details, analysts can identify potential areas for improvement and ensure more reliable and accurate translations.<\/span><\/p>\n<figure id=\"attachment_469\" aria-describedby=\"caption-attachment-469\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-469 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models-1024x515.png\" alt=\"\" width=\"800\" height=\"402\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models-1024x515.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models-300x151.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models-768x386.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models-1536x772.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models-2048x1030.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-469\" class=\"wp-caption-text\">Figure 17. Intento MT Studio. Comparing Models<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">We usually also take a deeper look at what is behind these scores by selecting the custom model we want to compare, a baseline model for reference, and the desired score for the training analysis. MT Studio then generates a scatter plot that visually represents the performance of each model compared to the baseline.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This scatter plot provides an easy way to observe the translation quality differences between custom and baseline models. By analyzing this data, we can identify which models perform better and decide whether further optimization or improvements are needed for the custom models. This valuable insight can help us achieve better translation results tailored to specific needs.<\/span><\/p>\n<figure id=\"attachment_471\" aria-describedby=\"caption-attachment-471\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-471 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot-1-1024x686.png\" alt=\"\" width=\"800\" height=\"536\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot-1-1024x686.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot-1-300x201.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot-1-768x515.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot-1-1536x1029.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot-1-2048x1372.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-471\" class=\"wp-caption-text\">Figure 18. Intento MT Studio. Comparing Models. Scatter Plot<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">In the scatter plot, if a data point is above the diagonal line, it indicates an improvement in the score compared to the baseline model. Conversely, if a data point is below the diagonal, it signifies a decrease in the score. The average score for the custom model is greatly influenced by the most distant segments from the diagonal line.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">If the score did not improve significantly for a model, the segments below the diagonal are likely the cause. Intento MT Studio allows our team to examine these specific segments, providing insight into areas where the model underperforms compared to the baseline. By analyzing these segments, we can identify opportunities for further optimization to enhance the custom model&#8217;s overall performance.<\/span><\/p>\n<figure id=\"attachment_472\" aria-describedby=\"caption-attachment-472\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-472 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Details-1024x637.png\" alt=\"\" width=\"800\" height=\"498\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Details-1024x637.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Details-300x187.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Details-768x478.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Details-1536x956.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Details-2048x1275.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-472\" class=\"wp-caption-text\">Figure 19. Intento MT Studio. Comparing Models. Scatter Plot. Details<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Examining the details of segments with decreased scores may help analysts understand the reasons behind the decline. They can request that language reviewers assess these segments and provide comments or feedback on the quality of the translation. This information can then be forwarded to the machine translation provider.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">By communicating this feedback to the MT provider, they can better understand the specific issues affecting that particular case. This collaboration enables both parties to work together in addressing the underlying problems, ultimately leading to improvements and optimizations in the custom translation model. Such a targeted approach can help ensure that the custom model consistently achieves high-quality translations across various content types.<\/span><\/p>\n<figure id=\"attachment_473\" aria-describedby=\"caption-attachment-473\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-473 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Segments-Review-1024x633.png\" alt=\"\" width=\"800\" height=\"495\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Segments-Review-1024x633.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Segments-Review-300x185.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Segments-Review-768x474.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Segments-Review-1536x949.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Comparing-Models.-Scatter-Plot.-Segments-Review-2048x1265.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-473\" class=\"wp-caption-text\">Figure 20. Intento MT Studio. Comparing Models. Scatter Plot. Segments Review<\/figcaption><\/figure>\n<h2><span style=\"font-weight: 400;\">Glossary adaptation. Google Cloud. Configuration<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Whether you have enough training data or not, you could improve the quality of MT even more by using glossaries.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">To use a glossary with your custom translation model, follow these steps:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Create object storage and a bucket: Use the platform&#8217;s user interface (UI) to set up and create a new storage bucket.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Upload the glossary: The following steps require using an API. With the command line API tools, upload your glossary in a TSV, CSV, or TMX format to the storage bucket you created earlier.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\">After completing these steps, your glossary will be available for translation tasks. You can access it through the API or the platform&#8217;s console. By incorporating a glossary, you can ensure that specific terminology is consistently and accurately translated across all your projects, enhancing the overall quality of your custom translation model.<\/span><\/p>\n<figure id=\"attachment_474\" aria-describedby=\"caption-attachment-474\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-474 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-1--1024x192.png\" alt=\"\" width=\"800\" height=\"150\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-1--1024x192.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-1--300x56.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-1--768x144.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-1--1536x288.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-1--2048x384.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-474\" class=\"wp-caption-text\">Figure 21. Creating Glossary. Step 1<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_476\" aria-describedby=\"caption-attachment-476\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-476 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-2-1-1024x323.png\" alt=\"\" width=\"800\" height=\"252\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-2-1-1024x323.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-2-1-300x95.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-2-1-768x243.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-2-1-1536x485.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Creating-Glossary.-Step-2-1-2048x647.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-476\" class=\"wp-caption-text\">Figure 22. Creating Glossary. Step 2<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Once you have added the glossary through the API, Google provides two translation results: one with the glossary applied and the other without it. This allows you to compare the translations and assess the impact of using the glossary on translation quality.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Consider that updating the glossary can lead to downtime, as it typically requires you to delete the existing glossary, update it, and then add it again. To minimize downtime, Intento uses a green-blue deployment scheme. This approach involves creating a shadow copy of the glossary, updating it, and adding it to the system without disrupting the translation process.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Using the green-blue deployment scheme ensures seamless integration of updated glossaries, allowing you to maintain consistent and accurate translations while avoiding potential disruptions to your translation workflows.<\/span><\/p>\n<h3><span style=\"font-weight: 400;\">Glossary adaptation. Other platforms and considerations.<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Various machine translation providers support glossaries, though they may differ in glossary types, features, and limitations. Some providers, like DeepL and SYSTRAN, support morphology, while others may impose size limits on glossaries, which can be an issue for companies with large glossaries. Intento offers an on-the-top glossary search and replacement feature that can handle glossaries of any size.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Here is an overview of some popular providers&#8217; glossary capabilities and limitations (as of 2023):<\/span><\/p>\n<ol>\n<li><span style=\"font-weight: 400;\"> Amazon: no morphology, limits on term length, languages, number of glossaries per region, and glossary size (10 MB).<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> DeepL: morphology support, limitations on some language pairs, glossary size (5K), and special characters.<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> IBM: one glossary per custom model, a limit on glossary size (10 MB).<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> Google: a limit on glossary size (10 MB).<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> Microsoft: dynamic or trained-in glossaries.<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> Systran: 20K terms per glossary.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\">When selecting a machine translation provider, consider the glossary features and limitations that best align with your company&#8217;s needs, ensuring that your translations are consistent and accurate.<\/span><\/p>\n<h3><span style=\"font-weight: 400;\">Intento glossary management.<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Intento simplifies using a single glossary across different machine translation providers. You can create, edit, and add glossaries to your MT configurations in the user interface. Behind the scenes, Intento compiles glossaries to meet the specific requirements of each provider. To avoid downtimes during glossary updates, Intento employs the green-blue deployment scheme.<\/span><\/p>\n<figure id=\"attachment_477\" aria-describedby=\"caption-attachment-477\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-477 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318716-y-1024x827.jpg\" alt=\"\" width=\"800\" height=\"646\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318716-y-1024x827.jpg 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318716-y-300x242.jpg 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318716-y-768x620.jpg 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318716-y.jpg 1280w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-477\" class=\"wp-caption-text\">Figure 23. Intento Glossaries. Translation Preview<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">For providers that support glossaries, such as Google and DeepL, Intento utilizes their built-in glossary features. For other providers without native glossary support, Intento implements glossaries on top of the translation using their proprietary technology.<\/span><\/p>\n<figure id=\"attachment_478\" aria-describedby=\"caption-attachment-478\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-478 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Glossary-1024x327.png\" alt=\"\" width=\"800\" height=\"255\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Glossary-1024x327.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Glossary-300x96.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Glossary-768x245.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Glossary-1536x491.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/Intento-MT-Studio.-Glossary-2048x654.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-478\" class=\"wp-caption-text\">Figure 24. Intento Glossaries<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_479\" aria-describedby=\"caption-attachment-479\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-479 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318713-y-1024x820.jpg\" alt=\"\" width=\"800\" height=\"641\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318713-y-1024x820.jpg 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318713-y-300x240.jpg 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318713-y-768x615.jpg 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/telegram-cloud-photo-size-2-5255702380505318713-y.jpg 1280w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-479\" class=\"wp-caption-text\">Figure 25. Intento Glossaries. Glossary Applied<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Once the glossary is added to your MT configuration, your translations will adhere to the terminology specified in the glossary.<\/span><\/p>\n<figure id=\"attachment_480\" aria-describedby=\"caption-attachment-480\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-480 size-large\" src=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/image-7-1024x678.png\" alt=\"\" width=\"800\" height=\"530\" srcset=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/image-7-1024x678.png 1024w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/image-7-300x199.png 300w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/image-7-768x508.png 768w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/image-7-1536x1017.png 1536w, https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/image-7.png 1556w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-480\" class=\"wp-caption-text\">Figure 26. Intento Glossaries. Adding glossary to the routing<\/figcaption><\/figure>\n<h2><span style=\"font-weight: 400;\">Key takeaways<\/span><\/h2>\n<ol>\n<li><span style=\"font-weight: 400;\"> Understanding the training and evaluation processes in machine translation systems is crucial for achieving high-quality translations. Do not rely on MT quality scores only. Following proper processes and guidelines can help prevent issues.\u00a0<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> Custom machine translation platforms allow users to train and optimize their models while incorporating glossaries for consistent terminology. Consider the differences in dataset requirements, types of adaptation, model updating methods, and migration capabilities when choosing a platform.<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> Use valuable tools for streamlining the training and evaluation of custom machine translation models across multiple vendors. Run side-by-side comparison of stock and custom models, to get insights into translation quality improvements and areas for potential optimization.<\/span><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Domain and glossary adaptation<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Mastering Custom Machine Translation: A Practical Guide - inten.to\/machine-translation-university\/<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Mastering Custom Machine Translation: A Practical Guide - inten.to\/machine-translation-university\/\" \/>\n<meta property=\"og:description\" content=\"Domain and glossary adaptation\" \/>\n<meta property=\"og:url\" content=\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/\" \/>\n<meta property=\"og:site_name\" content=\"inten.to\/machine-translation-university\/\" \/>\n<meta property=\"article:published_time\" content=\"2022-12-12T23:27:55+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-03-25T22:54:43+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-1024x302.png\" \/>\n<meta name=\"author\" content=\"sergei.polikarpov@inten.to\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"sergei.polikarpov@inten.to\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"16 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/\",\"url\":\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/\",\"name\":\"Mastering Custom Machine Translation: A Practical Guide - inten.to\/machine-translation-university\/\",\"isPartOf\":{\"@id\":\"https:\/\/inten.to\/machine-translation-university\/#website\"},\"datePublished\":\"2022-12-12T23:27:55+00:00\",\"dateModified\":\"2023-03-25T22:54:43+00:00\",\"author\":{\"@id\":\"https:\/\/inten.to\/machine-translation-university\/#\/schema\/person\/1aa9e5874e74cbf37313324ccc703af0\"},\"breadcrumb\":{\"@id\":\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"MT University\",\"item\":\"https:\/\/inten.to\/machine-translation-university\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Mastering Custom Machine Translation: A Practical Guide\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/inten.to\/machine-translation-university\/#website\",\"url\":\"https:\/\/inten.to\/machine-translation-university\/\",\"name\":\"inten.to\/machine-translation-university\/\",\"description\":\"Intento MT University\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/inten.to\/machine-translation-university\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/inten.to\/machine-translation-university\/#\/schema\/person\/1aa9e5874e74cbf37313324ccc703af0\",\"name\":\"sergei.polikarpov@inten.to\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/inten.to\/machine-translation-university\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/1fbab3532c586e5c65e28bb673c63bb7?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/1fbab3532c586e5c65e28bb673c63bb7?s=96&d=mm&r=g\",\"caption\":\"sergei.polikarpov@inten.to\"},\"url\":\"https:\/\/inten.to\/machine-translation-university\/author\/sergei-polikarpovinten-to\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Mastering Custom Machine Translation: A Practical Guide - inten.to\/machine-translation-university\/","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/","og_locale":"en_US","og_type":"article","og_title":"Mastering Custom Machine Translation: A Practical Guide - inten.to\/machine-translation-university\/","og_description":"Domain and glossary adaptation","og_url":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/","og_site_name":"inten.to\/machine-translation-university\/","article_published_time":"2022-12-12T23:27:55+00:00","article_modified_time":"2023-03-25T22:54:43+00:00","og_image":[{"url":"https:\/\/inten.to\/machine-translation-university\/wp-content\/uploads\/2022\/12\/8-1-1024x302.png"}],"author":"sergei.polikarpov@inten.to","twitter_card":"summary_large_image","twitter_misc":{"Written by":"sergei.polikarpov@inten.to","Est. reading time":"16 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/","url":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/","name":"Mastering Custom Machine Translation: A Practical Guide - inten.to\/machine-translation-university\/","isPartOf":{"@id":"https:\/\/inten.to\/machine-translation-university\/#website"},"datePublished":"2022-12-12T23:27:55+00:00","dateModified":"2023-03-25T22:54:43+00:00","author":{"@id":"https:\/\/inten.to\/machine-translation-university\/#\/schema\/person\/1aa9e5874e74cbf37313324ccc703af0"},"breadcrumb":{"@id":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/inten.to\/machine-translation-university\/mastering-custom-machine-translation-a-practical-guide\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"MT University","item":"https:\/\/inten.to\/machine-translation-university\/"},{"@type":"ListItem","position":2,"name":"Mastering Custom Machine Translation: A Practical Guide"}]},{"@type":"WebSite","@id":"https:\/\/inten.to\/machine-translation-university\/#website","url":"https:\/\/inten.to\/machine-translation-university\/","name":"inten.to\/machine-translation-university\/","description":"Intento MT University","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/inten.to\/machine-translation-university\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/inten.to\/machine-translation-university\/#\/schema\/person\/1aa9e5874e74cbf37313324ccc703af0","name":"sergei.polikarpov@inten.to","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/inten.to\/machine-translation-university\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/1fbab3532c586e5c65e28bb673c63bb7?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/1fbab3532c586e5c65e28bb673c63bb7?s=96&d=mm&r=g","caption":"sergei.polikarpov@inten.to"},"url":"https:\/\/inten.to\/machine-translation-university\/author\/sergei-polikarpovinten-to\/"}]}},"_links":{"self":[{"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/posts\/285"}],"collection":[{"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/comments?post=285"}],"version-history":[{"count":3,"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/posts\/285\/revisions"}],"predecessor-version":[{"id":481,"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/posts\/285\/revisions\/481"}],"wp:attachment":[{"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/media?parent=285"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/categories?post=285"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/inten.to\/machine-translation-university\/wp-json\/wp\/v2\/tags?post=285"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}