This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
Fragment repetition count without translation memory
Thread poster: Mar Jiménez Quesada
Mar Jiménez Quesada United Kingdom Local time: 17:44 English to Spanish + ...
Apr 11, 2017
Let´s assume that we have a file to translate that contains repetitions but there is no translation memory, so we are looking at the number of repeated fragments within the file. Does the software include in the repeated fragment count the first instance of a repeated fragment (i.e., the one that you will have to translate from scratch). For example, if you have the fragment "xxxxab", which appears 3 times, although the repeat count is 3, one of those will have to be translated from scra... See more
Let´s assume that we have a file to translate that contains repetitions but there is no translation memory, so we are looking at the number of repeated fragments within the file. Does the software include in the repeated fragment count the first instance of a repeated fragment (i.e., the one that you will have to translate from scratch). For example, if you have the fragment "xxxxab", which appears 3 times, although the repeat count is 3, one of those will have to be translated from scratch so it should not be counted as a repetition. Will the software then place 2 of those fragments in the repeated count and the other one in the "new fragment" count? ▲ Collapse
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Lianne van de Ven United States Local time: 12:44 Member (2008) English to Dutch + ...
It depends
Apr 11, 2017
on the software and on how you define a "fragment". Repeated words should not be counted as repeated content (although some software seems to "think" so). Repeated fragments should follow certain rules, although I don't know which ones - it depends on the software.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Emma Goldsmith Spain Local time: 18:44 Member (2004) Spanish to English
Segment or fragment?
Apr 12, 2017
Mar Jiménez Quesada wrote:
Does the software include in the repeated fragment count the first instance of a repeated fragment
By default, a segment is a standalone unit that usually ends with a full stop. So a sentence is the most common-sized segment. A fragment is a chunk or group of words within a segment. It isn't a standalone unit.
In the above explanation there are 4 segments. "a standalone unit" is a fragment that appears in two of those segments.
I expect you're referring to segments. In that case, where there are four identical segments, Studio considers the first as new content and the other three as repetitions. If you're talking about fragments, then if the fragment is long enough (supposing it accounts for 80% of the segment), then it will be picked up as a fuzzy match. To get Studio to account for repeated fragments (fuzzy matches) within a file and without a translation memory, simply mark "report internal fuzzy match leverage" in the Analyze Files batch task. Again, Studio will consider the first segment as new content, and subsequent "fuzzies" as such.
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.