GoogleSoC2008

From IndLinux
(Redirected from GoogleSoc2008)
Jump to: navigation, search

Google Summer of Code 2008


Latest update - IndLinux is not in here

Dates

  • Applications for mentoring organizations begin 3rd March 20008
  • Student applications start 24th March 2008

Guidelines for proposals

  • Some non-trivial task/activity, taking 3-4 months of devel work.Gsoc involves 8-9 weeks of fulltime activity.Lawgoff 12:06, 14 March 2008 (IST)
  • Ideas for tools to improve/enhance Indic I18N/L10N tasks.
  • Along with proposal, please weigh the benefits & importance of it.

Proposed Ideas

List ideas/proposals here. We could fine tune them.

  1. Indic Translation Management Framework
    1. Aim of this tool is to improve the translation workflow, which is currently mostly manual to a more automated system, where manual work is limited to commiting new translations and reviewing existing ones.
    2. It would allow tracking of all kind of work for a particular language/team it is configured for. Primary component would be the backend engine - involving
      1. Translation memory (database of translations)
      2. Dictionary (to keep a log of all english, and target language words)
      3. Spellchecker (allow spellchecking of translations)
      4. Validator (check for translation errors, when importing/exporting to other formats)
      5. Automated translations (for new strings coming in, based on existing translations).
    3. This system is to be developed in Python/C etc. Existing tools could be used where required (eg Translate Toolkit)
    4. Translation Frontend like Entrans/Pootle could be used.
    5. For more details refer TranslationFramework, TranslationDatabase, TFRequirements
  2. Hindi optical character recognition system.
    1. Aim for this project is to create a set of libraries and applications for character recognition for (Devanagari script) for Hindi using computers.
    2. This project will have two parts: (i) Character recognition from scanned images, (ii) handwriting recognition. Character recognition from scanned image can be restricted to fonts, but handwriting recognition has to be generic. Read more OCR.
  3. Dhvani Indian Langauge TTS related projects. More info
  4. Hindi Grammar Checker More Info
  5. "Bhaiyyaa" - a spoken dialog system for learning basics of internet More Info
Personal tools
Namespaces
Variants
Actions
Site
communication
Development
Resources
Activities
Toolbox