Research & Innovation


View pages in this document

We propose a hybrid architecture for high quality machine translation which combines the strengths of both approaches and minimizes their weaknesses: At the core is a rule-based MT system which provides morphology, declarative grammars, semantic categories, and small dictionaries, but which avoids all expensive kinds of intellectual knowledge acquisition. Instead of manually working out large dictionaries and compiling information on disambiguation preference, we suggest a novel corpus-based bootstrapping method for automatically expanding dictionaries, and for training the analytical performance and the choice of transfer alternatives.

This is a Marie Curie FP7 project in collaboration with Lingenio, Heidelberg, a small company developing and selling rule based MT systems (Translate) for English/German/French (Spanish and Italian under development) and also Office Dictionaries based on the context sensitive Intellidict technology. The underlying technology was originally developed at the IBM Heidelberg research centre in a long term project.

Contact: Bogdan Babych

Pages in this document

  1. Research & Innovation
  2. Track record in funded projects
  5. eColo Family projects
  6. EvIDence
  7. Intellitext
  8. HyghTra
  9. Kelly
  10. LangCorp
  11. Mellange
  12. MITRAS
  13. MULIMO
  14. MyExhibition
  15. NNI
  16. ORCIT
  17. ReadingCorp
  18. TAUS
  19. TTC
  20. WebDoc