Report statistics about fuzzy/translated/approved strings copied by TM Merge


Language service providers apply different rates for fuzzy matches in different percentage ranges.
To be able to calculate the cost, the project statistics need to be broken down.

Generate a summary page after TM merge for a project version that outlines the words copied in different state and breakdown of fuzzy strings copied with matching percentage. The fuzzy band can be static, or configurable depending on development time.

1) A configurable range of percentage by admin.
2) Be able to export the summary page into a shareable format. e.g. PDF


  • This only applies to TM merge function in the project version page. Will apply to the TM Merge in the editor if they share the same backend code. UPDATE: bad news: they don't!

  • Sends report through email to the requester at the end of the process

  • Notify requestor (e.g. a message in the modal) that a report will be sent to their email when the process is complete.

  • Fuzzy band configuration

    • Global set of band that can be configured by admin

    • allow the requester to customise the band in the TM merge modal (use the global config as default)

    • -Or project level band configuration - maintained by project maintainer.-

  • Future enhancement

    • Report in PDF format

    • Archive reports

    • Enhance the frontend to keep the states of the process and display the report when the process is complete.

    • Multiple vendors configuration (different band), allows project maintainer to select from vendors list


Sean Flanigan
May 21, 2018, 2:33 AM

Current limitations:

  1. If any "translation requests" are skipped due to validation errors or concurrent changes, these will still be reported as "copied" in the report FIXED

  2. Notify requestor that a report will be sent to their email when the process is complete. DONE

  3. No validation for band definitions on server configuration page

Michelle Kim
May 18, 2018, 2:55 AM

Yes Typos there. I fixed them to discount

Sean Flanigan
May 18, 2018, 2:41 AM

Thanks @mkim33, I guess "discuss" should be "discount"?

Michelle Kim
May 18, 2018, 2:22 AM

I have more detailed information on discount pricing for different fuzzy matches for your information :

0-74% match: New words, no discount

75% - 99% match: partial match, discount can be applied as following

Tier 1 75% - 84% match

Tier 2 85% - 94% match

Tier 3 95% - 99% match

and Tier 1 has the smallest discount and Tier 3 has the biggest discount. Since the Zanata can't provide fine-grained match rate, so only the Tier1 discount rate (the smallest discount) applied to our partially
matched contents.

Hope this helps,

Ready for Release


Sean Flanigan




Tested Version/s




Fix versions