Note: Currently new registrations are closed, if you want an account Contact us

Difference between revisions of "SMC/SoC/2008"

From FSCI Wiki
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
Participation of SMC in GSOC 2008 is not confirmed. Use this page for collecting the Project Ideas
[[SMC/SoC/2007|SMC in Google Summer of Code 2007]]
 
[[User:Santhosh|Santhosh Thottingal]] will be lead admin this year, and will deal with the administrative stuff with Google for SMC to be a mentoring organisation
 
==Ideas for Google Summer of Code 2008==
==Ideas for Google Summer of Code 2008==
===Tokenizer/Lemmatiser for malayalam for GATE===
===Tokenizer/Lemmatiser for malayalam for GATE===
Write a Lemmatiser for Malayalam. See whether we can do a  plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. IGoogle search GATE,download and install GATE , and in the plugins directory a hindi tokenizer and lemmatiser is available.
Write a Lemmatiser for Malayalam. See whether we can do a  plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. Google search GATE,download and install GATE , and in the plugins directory a hindi tokenizer and lemmatiser is available.
 
=== Functional Optical character Recognition system===
=== Functional Optical character Recognition system===
Add malayalam Support for tesseract OCR.
Add malayalam Support for tesseract OCR.
Line 18: Line 22:


===Rewrite the Dhvani sound system with SDL===
===Rewrite the Dhvani sound system with SDL===
#Rewrite the ALSA sound system of dhvani with [http://www.libsdl.org/ SDL] to make it a cross platform application
#Rewrite the ALSA sound system of [[Dhvani|Dhvani]] with [http://www.libsdl.org/ SDL] to make it a cross platform application
#Packaging for different platforms
#Packaging for different platforms
#Bug fixes for langauge modules and Code clean up
#Bug fixes for langauge modules and Code clean up
#Adding pitch/volume/pause support for the generated speech
#Adding pitch/volume/pause support for the generated speech


===Localization of Free Content Management Systems to Malayalam-Drupal/Joomla===
===Localization of Free Content Management Systems to Malayalam-Drupal &Joomla ===
100% localization of Drupal and Joomla CMS systems to Malayalam
100% localization of Drupal and Joomla CMS systems to Malayalam
===Speech recognition system for Malayalam===
#Develop a speech recognition system for Malayalam using the concepts of memory prediction framework


==How to Apply ==
==How to Apply ==


see http://code.google.com/soc/2008/faqs.html
#see http://code.google.com/soc/2008/faqs.html
#[http://wiki.debian.org/SummerOfCode2008/StudentApplicationTemplate Student Application Template]


==Selection procedure ==
==Selection procedure ==
http://code.google.com/soc/2008/faqs.html
==Guidelines for Students ==
==Guidelines for Students ==
==Guidelines for Mentors ==
==Guidelines for Mentors ==
[http://www.gnome.org/~federico/docs/summer-of-code-mentoring-howto/index.html Summer of Code Mentoring HOWTO]

Revision as of 19:21, 12 March 2008

SMC in Google Summer of Code 2007

Santhosh Thottingal will be lead admin this year, and will deal with the administrative stuff with Google for SMC to be a mentoring organisation

Ideas for Google Summer of Code 2008

Tokenizer/Lemmatiser for malayalam for GATE

Write a Lemmatiser for Malayalam. See whether we can do a plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. Google search GATE,download and install GATE , and in the plugins directory a hindi tokenizer and lemmatiser is available.

Functional Optical character Recognition system

Add malayalam Support for tesseract OCR.

  • Study tesseract OCR system
  • Recogntion of all characters
  • Layout recogization using ocropus (optional ?)

http://code.google.com/p/tesseract-ocr/ http://code.google.com/p/ocropus/

Write a Gnome Speech Driver for Dhvani and Integrate it with Orca

  1. Orca for visually impaired users uses gnome speech for speech engines. Currently Festival, Espeak, freetts etc have drivers for gnome speech. We need to write a driver for dhvani.
  2. Develop plugins for KTTS/Gedit/Firefox

Rewrite the Dhvani sound system with SDL

  1. Rewrite the ALSA sound system of Dhvani with SDL to make it a cross platform application
  2. Packaging for different platforms
  3. Bug fixes for langauge modules and Code clean up
  4. Adding pitch/volume/pause support for the generated speech

Localization of Free Content Management Systems to Malayalam-Drupal &Joomla

100% localization of Drupal and Joomla CMS systems to Malayalam

Speech recognition system for Malayalam

  1. Develop a speech recognition system for Malayalam using the concepts of memory prediction framework

How to Apply

  1. see http://code.google.com/soc/2008/faqs.html
  2. Student Application Template

Selection procedure

http://code.google.com/soc/2008/faqs.html

Guidelines for Students

Guidelines for Mentors

Summer of Code Mentoring HOWTO