SlideShare a Scribd company logo
1 of 19
Download to read offline
Wednesday,	
  4	
  June	
  
Sovee	
  Smart	
  Engine	
  2.0:	
  
A	
  Leap	
  Beyond	
  Base	
  Moses	
  Technology	
  
Sco$	
  Gaskill,	
  Sovee	
  
TAUS	
  Machine	
  TranslaDon	
  Showcase	
  2014	
  
Dublin	
  (Ireland)	
  
The	
  research	
  within	
  the	
  project	
  MosesCore	
  leading	
  to	
  these	
  results	
  has	
  received	
  funding	
  from	
  the	
  European	
  Union	
  7th	
  Framework	
  Programme,	
  grant	
  agreement	
  no	
  288487	
  
Presented by: Scott Gaskill
Christopher Klapp
June 4, 2014
	
  	
  	
  MT	
  Showcase	
  
3	
  
I	
  skate	
  to	
  where	
  the	
  puck	
  is	
  going	
  to	
  be,	
  not	
  
where	
  it	
  has	
  been.	
  
	
  
	
  
Wayne	
  Gretzsky,	
  Hockey	
  Star	
  
4	
  
Where	
  is	
  the	
  world	
  going?	
  
CNNTech,	
  “Google	
  boss:	
  EnDre	
  world	
  will	
  be	
  online	
  by	
  2020,”	
  April	
  2013	
  hXp://www.cnn.com/2013/04/15/tech/web/eric-­‐schmidt-­‐internet	
  	
  
Kenya	
  	
  stat	
  from	
  ITU,	
  2-­‐13.	
  Photo	
  used	
  by	
  permission	
  of	
  Deseret	
  News.	
  
2016	
  the	
  world	
  will	
  have	
  
internet	
  connecDvity	
  
	
  
By	
  the	
  end	
  of	
  this	
  decade	
  
everyone	
  in	
  the	
  world	
  will	
  
be	
  on	
  the	
  Web,	
  with	
  
Mobile	
  access	
  growing	
  as	
  
the	
  preferred	
  interface	
  
	
  
In	
  Kenya,	
  99%	
  of	
  Internet	
  
connecDons	
  are	
  mobile	
  
	
  
5	
  
We	
  are	
  entering	
  the	
  Convergence	
  era:	
  translaDon	
  
will	
  be	
  a	
  uDlity	
  embedded	
  in	
  every	
  app,	
  device	
  and	
  
screen.	
  Businesses	
  will	
  prosper	
  by	
  finding	
  new	
  
customers	
  in	
  new	
  markets….	
  	
  
	
  
Consumers	
  will	
  become	
  world-­‐wise,	
  	
  
communicaDng	
  as	
  if	
  language	
  barriers	
  never	
  
existed.	
  
	
  
	
  
Jaap	
  van	
  der	
  Meer,	
  Director	
  of	
  TAUS,	
  2013	
  
6	
  
Transla9on	
  Memory	
  –	
  Is	
  More	
  Be?er?	
  
If	
  we	
  simply	
  add	
  an	
  addiDonal	
  1,000	
  TM	
  lines	
  to	
  a	
  database	
  of	
  
40-­‐60	
  billion,	
  will	
  we	
  see	
  beXer	
  translaDons?	
  	
  	
  
Knowing	
  how	
  to	
  use	
  the	
  data	
  is	
  key	
  
7	
  
Challenges	
  	
  
Technology,	
  approach	
  &	
  process	
  
Progress	
  in	
  first	
  60	
  years	
   Progress	
  Needed	
  by	
  2016	
  
Engines	
  for	
  <	
  150	
  Languages	
   Engines	
  for	
  >	
  6000	
  languages	
  
<	
  3%	
  of	
  the	
  world’s	
  content	
  
translated	
  
All	
  content	
  translated	
  
Cloud-­‐based	
  speed	
  providing	
  more	
  
servers	
  for	
  translaDon	
  
92	
  billion	
  Servers	
  
StaDsDcal	
  translaDon	
  introduced,	
  
but	
  “fuzzy	
  logic”	
  does	
  not	
  deliver	
  
quality	
  businesses	
  need	
  
Quality	
  improvement	
  to	
  standards	
  
required	
  to	
  meet	
  world	
  commerce	
  
demand	
  
8	
  
4	
  (	
  n(n-­‐1)	
  
2	
  
)	
  
Generic	
  SMT	
  
92	
  million	
  	
   9.2	
  billion	
  –	
  
based	
  on	
  100	
  
businesses	
  
92	
  billion	
  
Based	
  on	
  
1000	
  
customers	
  
Not	
  valued	
  as	
  
pracDcal	
  –	
  
infinite	
  
servers	
  
required	
  
MT	
  Assets	
  (cascades)	
  
Technology	
  Challenge	
  
6800 languages
Generic	
  SMT	
  
Domain	
  
Generic	
  SMT	
  
Domain	
  
Customer	
  
Generic	
  SMT	
  
Domain	
  
Customer	
  
Project	
  
Minimum	
  Server	
  Requirements	
  
9	
  
Accuracy	
  Challenge	
  
Relevant	
  Segments	
   General	
  	
  Corpus	
  	
  
Adequacy	
  
Accuracy	
  
General	
  MT	
  (30-­‐40%)	
  
	
  TM	
  (40-­‐60%)	
  
	
  Post	
  EdiDng	
  (up	
  to	
  100%)	
  
Preparing	
  new	
  project	
  /	
  import	
  
TM	
  /	
  CAT	
  
Leverage	
  Exact	
  Fuzzy	
  Match	
  
Post	
  Edit	
  
Review	
  	
  
Deliver	
  to	
  customer	
  	
  
	
  
	
  
Gather	
  past	
  TM	
  
Package	
  and	
  send	
  TM	
  to	
  SMT	
  
provider	
  
Clean,	
  tokenize,	
  data	
  (prepare	
  data)	
  
Train	
  –Tune-­‐Test	
  (3Ts)	
  
Repeat	
  unDl	
  viewed	
  as	
  acceptable	
  
(repeat	
  with	
  customer	
  data	
  each	
  
Dme)	
  
10
	
  	
  	
  	
  	
  	
  Post	
  Edi9ng	
  	
  	
  	
   	
  	
  	
  	
  	
  	
  	
  	
  Learning	
  Engine 	
   	
  	
  	
  SMT	
  Workflow	
  
Segments	
  are	
  not	
  just	
  a	
  
string	
  of	
  text	
  –	
  they	
  are	
  a	
  
living	
  learning	
  en99es	
  
Process
Real-time Automation and Integration
Sovee	
  	
  
Smart	
  Engine	
  2.0	
  
11	
  
Smart	
  Engine	
  Advantages	
  
Language	
  
from	
  
Scratch	
  
Seamless	
  
integra9on	
  to	
  
Post	
  Edi9ng	
  
workflow	
  
Training	
  /	
  
Learning	
  
Efficiency	
  Gains	
  (what	
  we	
  have	
  seen)	
  
	
   	
  Post	
  ediDng	
  –	
  50%+	
  improvement	
  	
  
	
   	
  TM	
  /MT	
  management	
  and	
  training	
  –	
  100%	
  improvement	
  
	
  
Update	
  MT	
  on	
  the	
  fly	
  	
   Watch	
  it	
  learn	
  before	
  
your	
  eyes	
  
Never	
  leave	
  the	
  post	
  
ediDng	
  environment	
  
12	
  
Learned	
  Transla9ons	
  
!"#"$%&'()"*+"&',( -"&".%#((/0.12,((
34"52%67(
3662.%67(
-"&".%#(89(
:0+%;&(
(<.*%&;=%>0&(
/2,'0+".(
?.0@"6'(
3,,"'(
9%*,(
Cascading	
  Assets	
   Sovee	
  Smart	
  
Engine	
  MT	
  
Learned	
  Segments	
  
Segment	
  output	
  
1	
   2	
  
Asset	
  Synchrony	
  (CAT	
  Tools)	
  
Post	
  edi9ng	
  interface	
  
Smart	
  Engine	
  
13	
  
Asset	
  Push	
  (Past	
  TM)	
  
Real-­‐9me	
  progressive	
  
transla9on	
  cycle	
  (Sovee	
  MT,	
  
save	
  /push	
  post	
  edits)	
  
1
2
14	
  
Demo	
  
15	
  
Seamless	
  Integra9on	
  
“Convergence	
  Era”	
  
Apps	
  
Websites	
  
eCommerce	
  
elearning	
  
Videos	
  
Podcasts	
  
Sorware	
  
Live	
  chat	
  
Text	
  Messages	
  
email	
  
	
  
	
  
	
  
Japan	
   Sovee	
  Smart	
  Engine	
  
TranslaDon	
  
USA	
  
Yukiko	
  (Japan):	
  
ホールインワンを決めたよ!	
  
	
  
Robert	
  (USA):	
  
I	
  just	
  scored	
  a	
  hole-­‐in-­‐one!	
  
Original:	
  
ホールインワンを決めたよ!	
  
	
  	
  	
  	
  	
  Japan	
  
SNAG	
  
17	
  
Jack	
  Nicklaus	
  Learning	
  Leagues	
  
Languages:	
  Spanish	
  and	
  Japanese	
  	
  
In	
  Process:	
  10	
  more	
  languages	
  
Video	
  and	
  Training	
  Materials	
  for	
  Golf	
  Instruc9on	
  
18	
  
R.E.	
  Michel	
  
19	
  
Ques9ons?	
  

More Related Content

Viewers also liked

05 epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...
05   epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...05   epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...
05 epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...bahakagd
 
La familia cristiana es una iglesia doméstica
La familia cristiana es una iglesia domésticaLa familia cristiana es una iglesia doméstica
La familia cristiana es una iglesia domésticaArturo Cárdenas
 
Social Media Marketing for the Wine Industry by @JoeyShepp
Social Media Marketing for the Wine Industry by @JoeySheppSocial Media Marketing for the Wine Industry by @JoeyShepp
Social Media Marketing for the Wine Industry by @JoeySheppEarthsite
 
Psicología de los grupos - Analisis de material grupal
Psicología de los grupos - Analisis de material grupalPsicología de los grupos - Analisis de material grupal
Psicología de los grupos - Analisis de material grupalSergio Moya Casas
 
How CBSE Students can get their Digital Marksheets from DigiLocker
How CBSE Students can get their Digital Marksheets from DigiLocker How CBSE Students can get their Digital Marksheets from DigiLocker
How CBSE Students can get their Digital Marksheets from DigiLocker DigiLocker
 
Planograma
PlanogramaPlanograma
Planogramafedconet
 
Exalogic Technical Overview
Exalogic Technical OverviewExalogic Technical Overview
Exalogic Technical OverviewAndrey Akulov
 
Germanarticle
GermanarticleGermanarticle
GermanarticleUSAF
 

Viewers also liked (14)

Prese
PresePrese
Prese
 
05 epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...
05   epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...05   epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...
05 epicuro, lucrã©cio, cã­cero, sãªneca e marco aurã©lio - coleã§ã£o os pen...
 
La familia cristiana es una iglesia doméstica
La familia cristiana es una iglesia domésticaLa familia cristiana es una iglesia doméstica
La familia cristiana es una iglesia doméstica
 
decreto leguislativo
decreto leguislativodecreto leguislativo
decreto leguislativo
 
Social Media Marketing for the Wine Industry by @JoeyShepp
Social Media Marketing for the Wine Industry by @JoeySheppSocial Media Marketing for the Wine Industry by @JoeyShepp
Social Media Marketing for the Wine Industry by @JoeyShepp
 
La pascua
La pascuaLa pascua
La pascua
 
Psicología de los grupos - Analisis de material grupal
Psicología de los grupos - Analisis de material grupalPsicología de los grupos - Analisis de material grupal
Psicología de los grupos - Analisis de material grupal
 
Turismo de honduras
Turismo de hondurasTurismo de honduras
Turismo de honduras
 
Poliester insaturado up
Poliester insaturado upPoliester insaturado up
Poliester insaturado up
 
PHOTOSHOP: la historia de una tesis inacabada
PHOTOSHOP: la historia de una tesis inacabadaPHOTOSHOP: la historia de una tesis inacabada
PHOTOSHOP: la historia de una tesis inacabada
 
How CBSE Students can get their Digital Marksheets from DigiLocker
How CBSE Students can get their Digital Marksheets from DigiLocker How CBSE Students can get their Digital Marksheets from DigiLocker
How CBSE Students can get their Digital Marksheets from DigiLocker
 
Planograma
PlanogramaPlanograma
Planograma
 
Exalogic Technical Overview
Exalogic Technical OverviewExalogic Technical Overview
Exalogic Technical Overview
 
Germanarticle
GermanarticleGermanarticle
Germanarticle
 

Similar to TAUS MT Showcase, Sovee Smart Engine 2.0, A Leap Beyond Base Moses Technology, Scott Gaskill, Sovee

Bahaa Abd-Elrazek CV
Bahaa Abd-Elrazek CVBahaa Abd-Elrazek CV
Bahaa Abd-Elrazek CVBahaa Elrazek
 
DIGITALIZED WORLD- Technology Disruptions & Future Of Software Test Automation
DIGITALIZED WORLD- Technology Disruptions & Future Of Software Test AutomationDIGITALIZED WORLD- Technology Disruptions & Future Of Software Test Automation
DIGITALIZED WORLD- Technology Disruptions & Future Of Software Test AutomationHCL Technologies
 
Accelerating automotive test development may 2008
Accelerating automotive test development   may 2008Accelerating automotive test development   may 2008
Accelerating automotive test development may 2008Thorsten MAYER
 
Microservices: The Future-Proof Framework for IoT
Microservices: The Future-Proof Framework for IoTMicroservices: The Future-Proof Framework for IoT
Microservices: The Future-Proof Framework for IoTCapgemini
 
WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...
WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...
WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...Eric Lo
 
Cloud Computing (Brief Client Briefing Research & Univ Oct 2009 en UK)
Cloud Computing (Brief Client Briefing   Research & Univ   Oct 2009   en UK)Cloud Computing (Brief Client Briefing   Research & Univ   Oct 2009   en UK)
Cloud Computing (Brief Client Briefing Research & Univ Oct 2009 en UK)Moises Navarro
 
Azure WP7 fire starter
Azure WP7 fire starterAzure WP7 fire starter
Azure WP7 fire starterSam Basu
 
Karthikeyan Krishnan_5.0_Years_NMS_EMS_Application Developer
Karthikeyan Krishnan_5.0_Years_NMS_EMS_Application DeveloperKarthikeyan Krishnan_5.0_Years_NMS_EMS_Application Developer
Karthikeyan Krishnan_5.0_Years_NMS_EMS_Application Developerkarthikeyan krishnan
 
Ai for logistics
Ai for logisticsAi for logistics
Ai for logisticsEITESAL NGO
 

Similar to TAUS MT Showcase, Sovee Smart Engine 2.0, A Leap Beyond Base Moses Technology, Scott Gaskill, Sovee (20)

Bahaa Abd-Elrazek CV
Bahaa Abd-Elrazek CVBahaa Abd-Elrazek CV
Bahaa Abd-Elrazek CV
 
DIGITALIZED WORLD- Technology Disruptions & Future Of Software Test Automation
DIGITALIZED WORLD- Technology Disruptions & Future Of Software Test AutomationDIGITALIZED WORLD- Technology Disruptions & Future Of Software Test Automation
DIGITALIZED WORLD- Technology Disruptions & Future Of Software Test Automation
 
Accelerating automotive test development may 2008
Accelerating automotive test development   may 2008Accelerating automotive test development   may 2008
Accelerating automotive test development may 2008
 
SantoshDengale
SantoshDengaleSantoshDengale
SantoshDengale
 
Sourav_Das
Sourav_DasSourav_Das
Sourav_Das
 
Microservices: The Future-Proof Framework for IoT
Microservices: The Future-Proof Framework for IoTMicroservices: The Future-Proof Framework for IoT
Microservices: The Future-Proof Framework for IoT
 
Mahmoud khattab
Mahmoud khattabMahmoud khattab
Mahmoud khattab
 
Mahmoud khattab
Mahmoud khattabMahmoud khattab
Mahmoud khattab
 
WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...
WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...
WISE-PaaS 2.0 - the smart manufacturing solutions from edge to cloud-eric lo_...
 
Mahesh - Resume - Paypal
Mahesh - Resume - PaypalMahesh - Resume - Paypal
Mahesh - Resume - Paypal
 
SantoshDengale
SantoshDengaleSantoshDengale
SantoshDengale
 
Nanaji_Jonnadula
Nanaji_JonnadulaNanaji_Jonnadula
Nanaji_Jonnadula
 
Cloud Computing (Brief Client Briefing Research & Univ Oct 2009 en UK)
Cloud Computing (Brief Client Briefing   Research & Univ   Oct 2009   en UK)Cloud Computing (Brief Client Briefing   Research & Univ   Oct 2009   en UK)
Cloud Computing (Brief Client Briefing Research & Univ Oct 2009 en UK)
 
Azure WP7 fire starter
Azure WP7 fire starterAzure WP7 fire starter
Azure WP7 fire starter
 
Karthikeyan Krishnan_5.0_Years_NMS_EMS_Application Developer
Karthikeyan Krishnan_5.0_Years_NMS_EMS_Application DeveloperKarthikeyan Krishnan_5.0_Years_NMS_EMS_Application Developer
Karthikeyan Krishnan_5.0_Years_NMS_EMS_Application Developer
 
Ai for logistics
Ai for logisticsAi for logistics
Ai for logistics
 
My CV
My CVMy CV
My CV
 
Vishal Katore
Vishal KatoreVishal Katore
Vishal Katore
 
Resume
ResumeResume
Resume
 
MOHAMMED RIDHA new
MOHAMMED RIDHA newMOHAMMED RIDHA new
MOHAMMED RIDHA new
 

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Recently uploaded

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 

TAUS MT Showcase, Sovee Smart Engine 2.0, A Leap Beyond Base Moses Technology, Scott Gaskill, Sovee

  • 1. Wednesday,  4  June   Sovee  Smart  Engine  2.0:   A  Leap  Beyond  Base  Moses  Technology   Sco$  Gaskill,  Sovee   TAUS  Machine  TranslaDon  Showcase  2014   Dublin  (Ireland)   The  research  within  the  project  MosesCore  leading  to  these  results  has  received  funding  from  the  European  Union  7th  Framework  Programme,  grant  agreement  no  288487  
  • 2. Presented by: Scott Gaskill Christopher Klapp June 4, 2014      MT  Showcase  
  • 3. 3   I  skate  to  where  the  puck  is  going  to  be,  not   where  it  has  been.       Wayne  Gretzsky,  Hockey  Star  
  • 4. 4   Where  is  the  world  going?   CNNTech,  “Google  boss:  EnDre  world  will  be  online  by  2020,”  April  2013  hXp://www.cnn.com/2013/04/15/tech/web/eric-­‐schmidt-­‐internet     Kenya    stat  from  ITU,  2-­‐13.  Photo  used  by  permission  of  Deseret  News.   2016  the  world  will  have   internet  connecDvity     By  the  end  of  this  decade   everyone  in  the  world  will   be  on  the  Web,  with   Mobile  access  growing  as   the  preferred  interface     In  Kenya,  99%  of  Internet   connecDons  are  mobile    
  • 5. 5   We  are  entering  the  Convergence  era:  translaDon   will  be  a  uDlity  embedded  in  every  app,  device  and   screen.  Businesses  will  prosper  by  finding  new   customers  in  new  markets….       Consumers  will  become  world-­‐wise,     communicaDng  as  if  language  barriers  never   existed.       Jaap  van  der  Meer,  Director  of  TAUS,  2013  
  • 6. 6   Transla9on  Memory  –  Is  More  Be?er?   If  we  simply  add  an  addiDonal  1,000  TM  lines  to  a  database  of   40-­‐60  billion,  will  we  see  beXer  translaDons?       Knowing  how  to  use  the  data  is  key  
  • 7. 7   Challenges     Technology,  approach  &  process   Progress  in  first  60  years   Progress  Needed  by  2016   Engines  for  <  150  Languages   Engines  for  >  6000  languages   <  3%  of  the  world’s  content   translated   All  content  translated   Cloud-­‐based  speed  providing  more   servers  for  translaDon   92  billion  Servers   StaDsDcal  translaDon  introduced,   but  “fuzzy  logic”  does  not  deliver   quality  businesses  need   Quality  improvement  to  standards   required  to  meet  world  commerce   demand  
  • 8. 8   4  (  n(n-­‐1)   2   )   Generic  SMT   92  million     9.2  billion  –   based  on  100   businesses   92  billion   Based  on   1000   customers   Not  valued  as   pracDcal  –   infinite   servers   required   MT  Assets  (cascades)   Technology  Challenge   6800 languages Generic  SMT   Domain   Generic  SMT   Domain   Customer   Generic  SMT   Domain   Customer   Project   Minimum  Server  Requirements  
  • 9. 9   Accuracy  Challenge   Relevant  Segments   General    Corpus     Adequacy   Accuracy   General  MT  (30-­‐40%)    TM  (40-­‐60%)    Post  EdiDng  (up  to  100%)  
  • 10. Preparing  new  project  /  import   TM  /  CAT   Leverage  Exact  Fuzzy  Match   Post  Edit   Review     Deliver  to  customer         Gather  past  TM   Package  and  send  TM  to  SMT   provider   Clean,  tokenize,  data  (prepare  data)   Train  –Tune-­‐Test  (3Ts)   Repeat  unDl  viewed  as  acceptable   (repeat  with  customer  data  each   Dme)   10            Post  Edi9ng                        Learning  Engine        SMT  Workflow   Segments  are  not  just  a   string  of  text  –  they  are  a   living  learning  en99es   Process Real-time Automation and Integration Sovee     Smart  Engine  2.0  
  • 11. 11   Smart  Engine  Advantages   Language   from   Scratch   Seamless   integra9on  to   Post  Edi9ng   workflow   Training  /   Learning   Efficiency  Gains  (what  we  have  seen)      Post  ediDng  –  50%+  improvement        TM  /MT  management  and  training  –  100%  improvement     Update  MT  on  the  fly     Watch  it  learn  before   your  eyes   Never  leave  the  post   ediDng  environment  
  • 12. 12   Learned  Transla9ons   !"#"$%&'()"*+"&',( -"&".%#((/0.12,(( 34"52%67( 3662.%67( -"&".%#(89( :0+%;&( (<.*%&;=%>0&( /2,'0+".( ?.0@"6'( 3,,"'( 9%*,( Cascading  Assets   Sovee  Smart   Engine  MT   Learned  Segments   Segment  output  
  • 13. 1   2   Asset  Synchrony  (CAT  Tools)   Post  edi9ng  interface   Smart  Engine   13   Asset  Push  (Past  TM)   Real-­‐9me  progressive   transla9on  cycle  (Sovee  MT,   save  /push  post  edits)   1 2
  • 15. 15   Seamless  Integra9on   “Convergence  Era”   Apps   Websites   eCommerce   elearning   Videos   Podcasts   Sorware   Live  chat   Text  Messages   email        
  • 16. Japan   Sovee  Smart  Engine   TranslaDon   USA   Yukiko  (Japan):   ホールインワンを決めたよ!     Robert  (USA):   I  just  scored  a  hole-­‐in-­‐one!   Original:   ホールインワンを決めたよ!            Japan   SNAG  
  • 17. 17   Jack  Nicklaus  Learning  Leagues   Languages:  Spanish  and  Japanese     In  Process:  10  more  languages   Video  and  Training  Materials  for  Golf  Instruc9on