11:32:26 --> mary (~mary@203.200.123.2) has joined #indlinux 11:34:11 --> pamri (~guest@203.145.183.210) has joined #indlinux 12:11:26 --- You are now known as karunakar 12:15:04 karunakar: is the meet there, today? 12:19:52 sort of lets start discussing 12:20:10 --- karunakar has changed the topic to: INDEV meet - 12-6PM 12:20:21 lets begin with locales 12:20:40 esp collation sequence - thats missing 12:22:30 does ICU define collation rules also? 12:45:56 <-- mary has quit (Ping timeout: 600 seconds) 12:45:56 --> mary (~mary@203.200.123.2) has joined #indlinux 12:46:30 sorry guys... really bad conn today... 12:46:30 btw is this chat thingy being logged ? 12:47:28 pamri: alive ? 12:48:11 mary: yeah. 12:51:15 --- mary is now known as mary_at_lunch 12:55:31 --> ding0 (~ding0@216.236.98.130) has joined #indlinux 13:01:07 mary_at_lunch: yeah 13:01:20 pamri: yeah 13:03:34 <-- ding0 has quit (Client exiting) 13:06:38 --- mary_at_lunch is now known as mary 13:07:07 whatever happened to the thread on collation sequences ? 13:14:25 its on 13:14:37 mary: seem to be pretty impatient today.? 13:14:53 pamri: was wondering if i had missed something and all.. 13:15:07 ICU has some definitions for hindi, telugu etc 13:15:28 can someone summarize the current situation with collation sequences for the lesser mortals :-) 13:15:34 can the same locale be used for bsd? 13:15:47 may be we can then take it and make a todo thingy ... for each topic.. 13:15:53 arun has created a hindi locale for bsd 13:20:50 mary: ya will do that 13:20:58 pamri: links? 13:21:21 mary: to? 13:21:42 pamri: link to the hindi locale for bsd 13:22:01 hindi locale is there in glibc but lacks collation 13:22:52 http://www.sharma-home.net/~adsharma/projects/hi_IN/ 13:23:11 it is for iscii..shd convert it to utf-8. 13:23:43 mary: http://www.sharma-home.net/~adsharma/projects/hi_IN/iscii-dev/ 13:25:35 a good intro to creating locales on freebsd: http://www.topolis.lt/squirrelmail/freebsd/ 13:25:57 pamri: use the script .. hehe 13:27:18 mary: yup. we need a script which converts fr. dev utf-8 to other indic utf-8. eg: unicode val of dev A to corresponding A of any indic language 13:27:25 pamri: here were are dealing with locales on linux - glibc ones 13:27:53 pamri: why that script 13:28:58 pamri: iscii -> utf8 is tho straight forward.. and also converting to interconverting between indian languages is also easy because they are based on iscii and differ mostly only by multiples of constant offsets .. 13:29:16 for eg: you can convert hindi locale to kannada locale...or a dev keymap to kan keymap.. 13:29:38 currently glibc has - hindi, telugu, tamil & arabic + english for indic 13:30:29 this script does it fr. dev to kannanda http://www.sharma-home.net/~adsharma/projects/kn_IN/localify.py creating one isn't a rocket-science 13:30:40 malayalam, kannada also developed , gujrati & punjabi untested 13:32:50 why has kannada not been submitted to glibc..only thing it lacks is the collation seq. 13:33:33 ok, but it can still go in 13:33:55 considering the slow release cycles of glibc 13:34:13 if you have it included now - it may turn up in production ones next yr 13:36:52 ok. how to do it 13:37:24 we cld have done it last october. anyway, it was ready. & no changes have been made. 13:39:51 i too donno - will chek that out 13:40:58 Ok there are three issues here 1) getting the proper sort order (from diff sources) 13:41:01 other languages cld use this script: http://www.sharma-home.net/~adsharma/projects/kn_IN/localify.py 13:41:40 2: writing the collation data in actual form required ( in LANG_IN file ) 13:42:07 3: Testing 13:43:42 pamri: actually its a easy job once you write down the data on paper 13:44:29 the way the month names & weekday names differ slightly in diff langs 13:44:50 pamri: does kannada locale has collation seq now? 13:45:49 nopr 13:46:24 actually, there are diff sorting orders in kannada. the same confusion that happened in font-encoding has also happened here. 13:47:06 the kgp collation seq seems to be sound. 13:48:37 pamri: then can ya create a unicode seq of it 13:48:58 kind of this way http://www.unicode.org/charts/collation/chart_Bengali.html 13:49:17 that will serve as startin pt for it 13:50:32 mary: can ya chek up with hindi guys there if this is good enough http://www.unicode.org/charts/collation/chart_Devanagari.html 13:52:19 shdn't consonants with halanth be first. 13:54:30 --- You are now known as karunakarLUNCH 13:54:41 will be back in 10mins 14:25:15 --- You are now known as karunakar 14:25:33 --> ding0 (~ding0@216.236.98.130) has joined #indlinux 14:25:48 karunakar: none of them are around .. will check and let you know.. 14:26:20 --- ding0 is now known as ding0-Away 14:27:45 mary: ok 14:33:38 --> goli (~sunil@203.145.183.210) has joined #indlinux 14:37:22 hi mary and karunakar! 14:37:33 here is a small update 14:38:40 a friend of joseph koshy called "Cherry" has joined MAHITI as a volunteer. he is working on a package management system so that we will be able to deploy indic solutions with multiple dependencies more easily. 14:39:12 i am not sure i completely understand this project or its relevance to the cause of indic computing. 14:39:26 but koshy thinks that this is a non negotiable and i am happy to go with what he says at the moment. 14:39:32 end of small update 14:40:13 goli: package management system ? 14:40:48 goli: whats it based on? 14:40:53 mary: not package mgt. but on freebsd, he is using the ports system to deliver indic packages. 14:41:02 ok 14:43:18 see this: http://sourceforge.net/projects/indic-computing 14:43:18 karunakar: not sure. pramod will have more details. 14:43:31 karunakar: i guess shell programming plus some python scripts 14:45:12 from the rel notes: "A FreeBSD 'port' that installs the necessary tools to process Indic Computing documentation. " 14:47:01 goli: so its not nything debian based or linux based ? 14:47:32 we can use this s/w http://abeni.sourceforge.net/ to create a similar system for gentoo. 14:48:05 karunakar: i am sure the toolkit/chain will be generic .. except that the primary development platform would be freebsd .. 14:48:58 mary: yup, correct. testing will be done on debian & hurd. :-) 14:50:25 okie.. for starters .. regarding the collation sequences and all that 14:50:37 can we start with introductory links / texts .. and issues (links) and all... 14:50:37 i will compile them. 14:50:48 <-- pamri has quit (Client Exiting) 14:51:20 ya, was writing something on it will send that 14:52:46 for starters, http://oss.software.ibm.com/icu/userguide/Collate_Intro.html 14:53:49 --- mary is now known as mary_brb 14:57:11 more so the format of locale file : http://www.opengroup.org/onlinepubs/7908799/xbd/locale.html 15:00:50 --- mary_brb is now known as mary 15:07:47 [OT] is anyone able to access indic-computing cvs ? 15:18:51 anonymous nyone can, write access only to developers 15:19:18 karunakar: grrr.. i am unable to get anything via cvs.. wondering why .. 15:20:43 mary: contact jkoshy 15:23:14 more on collation stuff like currently available locales etc ? .. guys wakie wakie 15:24:21 mary: you can do a anonymous check out 15:24:55 mary: there is one pdf doc which describes it all - i have it with me but cant get its online link 15:25:24 basically the locale definition guidelines / specification 15:25:38 a confusing doc at that 15:32:58 --> Bunny (~Bunny@216.236.98.130) has joined #indlinux 15:34:09 karunakar: is it the one from sun ? 15:34:30 --- karunakar has changed the topic to: INDEV meet - 12-6PM - Indic locale + collation 15:34:36 nope, some ISO doc 15:35:34 --> alok_kumar (~chatzilla@61.1.144.145) has joined #indlinux 15:35:44 (.8M$G 15:36:09 hello indlinux 15:36:17 Alok from Bangalore 15:37:05 its this http://wwwold.dkuug.dk/jtc1/sc22/wg20/docs/n897-14652w25.pdf 15:37:24 from http://wwwold.dkuug.dk/jtc1/sc22/wg20/docs/projects.html 15:37:44 Are you folks able to see &G5(>0@? 15:38:00 yeah 15:38:16 alok_kumar: welcome to indlinux 15:38:47 Thanks Bunny. 15:39:54 linux locales at http://www.openi18n.org/download/docs/locale/index.htm 15:40:46 Hi karunakar 15:40:51 alok_kumar: hi 15:40:53 Bunny: hi 15:41:15 karunakar: hi 15:41:18 the discussion are for completing the Indic locale definitions 15:41:38 kok_IN ? They have 3 letter codes as well? 15:41:45 some locales are defined in glibc, but none has sort order ( collation data ) 15:41:53 yeah 15:41:58 Ok, great. 15:42:21 We need to compile locale data for languages not yet available 15:42:23 how many hours of work is it? 15:42:28 and collation data for all langs 15:42:54 not much - all could be done in a week , weith one person looking into each lang 15:43:12 Collations can get quite complex 15:43:30 main work is compiling the locale data, basic data & then collation data 15:43:49 alok_kumar: yeah, but we need to start somewhere & have atleast he basic stuff 15:44:10 I was scared off by http://www.unicode.org/reports/tr10/tr10-8.html if that's what we would have to do for collation. 15:44:17 you cant do a sort on a hindi utf-8 file 15:44:42 karunakar: Correct. 15:45:28 Actually grep works on RH8, but I think it does a byte by byte comparision. 15:45:49 What's the work left for hi_IN.UTF-8? Is there a todo somewhere on http://indlinux.org ? 15:46:28 alok_kumar: writing that only :) 15:46:49 Oh 15:47:21 Do I have to be good at the language to do basic data (with out collation data)? I am planning to help for telugu (if no one else takes up the task) 15:47:26 i do hav modified file - from which rough gujarati & punjabi ones were made 15:47:44 Bunny: was just abt to ask you for same 15:50:46 Perhaps I could help with sa_IN.UTF-8 15:52:21 --- ding0-Away is now known as ding0 15:52:36 hi guys 15:52:54 Hi ding0 15:53:16 I'm going off to login from the linux side. See you in a couple of minutes. 15:53:17 alok_kumar: ya 15:53:24 <-- alok_kumar has quit (Read error: 104 (Connection reset by peer)) 15:56:58 --> alok_kumar (~chatzilla@61.1.144.145) has joined #indlinux 15:57:21 Back 15:58:59 Say, isn't Milan supposed to have the hi_IN.UTF-8 locale? 15:59:15 it has 15:59:31 locale -a|grep IN gives me ar, en, mr, ta and te but not hi. Funny 15:59:51 Maybe that's why I'm getting the gkb error. 15:59:58 hi, ta, te, mr are already available 16:00:09 ok, maybe 16:05:23 hi_IN was already around(timestamped 13 May in /usr/lib/locale) so I think inslocale.sh skipped localedef for hi_IN. 16:05:27 hmm, a bug in the install script ! 16:05:32 I installed Milan on 10 Jun 16:05:40 yeah 16:06:02 Does redhat 8 ship with hi_IN? Maybe that's why it was skipped. 16:07:14 redhat does I guess 16:07:48 yeah , it does since rh 7.3 , but only gets installed is you select it while installation (or select all langs) 16:08:56 so in the script it checks if its already installed , if not its built 16:19:19 --- ding0 is now known as ding0-Away 16:23:24 Reinstalled Milan, no hi_IN is also there. 16:23:39 s/no /now / 16:29:27 ok 16:29:42 all: got some more clue to collation stuff 16:32:28 in http://www.openi18n.org/download/ in locale/src it has LC_COLLATE.ISO14651 16:32:39 karunakar: are you asking us or trying to say something ? :-) 16:32:42 collation table which include indic stuff 16:32:52 saying something 16:33:40 a iso14652_t1 exists /usr/share/i18n/locales which doesnt include indic ranges 16:34:03 & all indic locales at present copy this file for LC_COLLATE categ. 16:37:00 <-- alok_kumar has quit (Ping timeout: 600 seconds) 16:37:02 this data seems to come from UCA & ibm univ locales 16:52:13 i am extracting the indic ones from the LC_COLLATE file 16:52:29 we will have to chek if they are proper & acceptable 17:01:36 nyone listening? 17:03:12 sorry... was trying to google for more info and all 17:05:26 karnunakar: what exactly do you need from the hindi guys .. regarding collation ? 17:06:01 read abv, there is some collation info, need to verify if its ok & acceptable 17:06:26 if it conforms to any standard available 17:07:20 or find if we have any standard for hindi on collation 17:07:50 am i being to vague? 17:09:35 karunakar: how do i convey it to someone who doesnt really understand the technicalities .. thats all 17:09:42 karunakar: i will try the sorting angle to explain things 17:09:53 karunakar: will be back in 15 mins 17:10:02 --- mary is now known as mary_away 17:18:55 mary_away: ok i will take care of that - will list what information is needed & the simplest form to give in 17:33:24 --> upendra (~upn@202.153.33.158) has joined #indlinux 17:35:19 <-- upendra (~upn@202.153.33.158) has left #indlinux 17:53:20 --- mary_away is now known as mary 18:02:16 anybody home .. 18:02:28 current devel directions for indlinux ? 18:02:45 anything about the new developments in Qt ? 18:14:59 yup, installing debian 18:15:21 --> pamri (~pamri@PPP-219-65-143-155.bng.vsnl.net.in) has joined #indlinux 18:15:24 locales is a long pendng issue - need to get that fixed 18:17:54 pamri: got one table having UCA data 18:18:45 ok. update me more. 18:19:16 anyways ... 18:19:28 gotta leave .. 18:19:28 ciao 18:19:49 http://www.openi18n.org/download/ in locale/src it has LC_COLLATE.ISO14651 18:21:36 <-- mary (~mary@203.200.123.2) has left #indlinux 18:25:20 ok. does it contain collation info for indic languages too. 18:26:50 yeah - all those in unicode 18:27:47 ok. we shd strip info for each language & put it on indlinux 18:28:01 yeah almost done that 18:28:34 what i will do is add them in locale def files & build & test 18:29:21 ok. also put them seperately too. 18:31:12 yeah 18:40:01 <-- goli has quit (Client Exiting) 18:58:14 <-- pamri has quit (Ping timeout: 600 seconds) 18:58:21 I have put some locale stuff at http://www.indlinux.org/downloads/locale/ 18:58:47 Will have the docs & howto ready by tuesday