Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Lucene: Java-User

Free software for language detection

 

 

Lucene java-user RSS feed   Index | Next | Previous | View Threaded


Lisheng.Zhang at BroadVision

Mar 27, 2009, 9:54 AM

Post #1 of 6 (2547 views)
Permalink
Free software for language detection

Hi,

Are you aware of any free software for language detection (given certain
text, see if it is French, or Japanese)? I saw Bob Carpenter's previous
mail which explained the principle nicely, but could not locate free tools?

Thanks very much for helps, Lisheng

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


jochen at scoutlabs

Mar 27, 2009, 10:03 AM

Post #2 of 6 (2491 views)
Permalink
Re: Free software for language detection [In reply to]

Lisheng,

Here's a package you could take a look at. I have used it in the past and it
worked reasonably well. Let me know what else you find and how it works for
you.

http://www.olivo.net/software/lc4j/

Good luck!
Jochen Frey

On Fri, Mar 27, 2009 at 9:54 AM, Zhang, Lisheng <
Lisheng.Zhang [at] broadvision> wrote:

> Hi,
>
> Are you aware of any free software for language detection (given certain
> text, see if it is French, or Japanese)? I saw Bob Carpenter's previous
> mail which explained the principle nicely, but could not locate free tools?
>
> Thanks very much for helps, Lisheng
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>


--
Jochen Frey . CTO
Scout Labs
415.366.0450


Lisheng.Zhang at BroadVision

Mar 27, 2009, 10:22 AM

Post #3 of 6 (2494 views)
Permalink
RE: Free software for language detection [In reply to]

Thanks very much!

-----Original Message-----
From: jochen.scout [at] gmail [mailto:jochen.scout [at] gmail]On Behalf Of
Jochen Frey
Sent: Friday, March 27, 2009 10:04 AM
To: java-user [at] lucene
Subject: Re: Free software for language detection


Lisheng,

Here's a package you could take a look at. I have used it in the past and it
worked reasonably well. Let me know what else you find and how it works for
you.

http://www.olivo.net/software/lc4j/

Good luck!
Jochen Frey

On Fri, Mar 27, 2009 at 9:54 AM, Zhang, Lisheng <
Lisheng.Zhang [at] broadvision> wrote:

> Hi,
>
> Are you aware of any free software for language detection (given certain
> text, see if it is French, or Japanese)? I saw Bob Carpenter's previous
> mail which explained the principle nicely, but could not locate free tools?
>
> Thanks very much for helps, Lisheng
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>


--
Jochen Frey . CTO
Scout Labs
415.366.0450

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


baleksan at gmail

Mar 27, 2009, 10:35 AM

Post #4 of 6 (2486 views)
Permalink
Re: Free software for language detection [In reply to]

Lisheng,

You might want to look at the Nutch LanguageID plugin
(http://wiki.apache.org/nutch/LanguageIdentifier) too.

Cheers,
Boris

On Fri, Mar 27, 2009 at 10:22 AM, Zhang, Lisheng
<Lisheng.Zhang [at] broadvision> wrote:
> Thanks very much!
>
> -----Original Message-----
> From: jochen.scout [at] gmail [mailto:jochen.scout [at] gmail]On Behalf Of
> Jochen Frey
> Sent: Friday, March 27, 2009 10:04 AM
> To: java-user [at] lucene
> Subject: Re: Free software for language detection
>
>
> Lisheng,
>
> Here's a package you could take a look at. I have used it in the past and it
> worked reasonably well. Let me know what else you find and how it works for
> you.
>
> http://www.olivo.net/software/lc4j/
>
> Good luck!
> Jochen Frey
>
> On Fri, Mar 27, 2009 at 9:54 AM, Zhang, Lisheng <
> Lisheng.Zhang [at] broadvision> wrote:
>
>> Hi,
>>
>> Are you aware of any free software for language detection (given certain
>> text, see if it is French, or Japanese)? I saw Bob Carpenter's previous
>> mail which explained the principle nicely, but could not locate free tools?
>>
>> Thanks very much for helps, Lisheng
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>> For additional commands, e-mail: java-user-help [at] lucene
>>
>>
>
>
> --
> Jochen Frey . CTO
> Scout Labs
> 415.366.0450
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>



--
Thanks,

Boris

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


karl.wettin at gmail

Mar 29, 2009, 2:02 AM

Post #5 of 6 (2441 views)
Permalink
Re: Free software for language detection [In reply to]

You can also look at https://issues.apache.org/jira/browse/LUCENE-1039
that I've successfully used for language detection of user queries.

karl

27 mar 2009 kl. 18.35 skrev Boris Aleksandrovsky:

> Lisheng,
>
> You might want to look at the Nutch LanguageID plugin
> (http://wiki.apache.org/nutch/LanguageIdentifier) too.
>
> Cheers,
> Boris
>
> On Fri, Mar 27, 2009 at 10:22 AM, Zhang, Lisheng
> <Lisheng.Zhang [at] broadvision> wrote:
>> Thanks very much!
>>
>> -----Original Message-----
>> From: jochen.scout [at] gmail [mailto:jochen.scout [at] gmail]On
>> Behalf Of
>> Jochen Frey
>> Sent: Friday, March 27, 2009 10:04 AM
>> To: java-user [at] lucene
>> Subject: Re: Free software for language detection
>>
>>
>> Lisheng,
>>
>> Here's a package you could take a look at. I have used it in the
>> past and it
>> worked reasonably well. Let me know what else you find and how it
>> works for
>> you.
>>
>> http://www.olivo.net/software/lc4j/
>>
>> Good luck!
>> Jochen Frey
>>
>> On Fri, Mar 27, 2009 at 9:54 AM, Zhang, Lisheng <
>> Lisheng.Zhang [at] broadvision> wrote:
>>
>>> Hi,
>>>
>>> Are you aware of any free software for language detection (given
>>> certain
>>> text, see if it is French, or Japanese)? I saw Bob Carpenter's
>>> previous
>>> mail which explained the principle nicely, but could not locate
>>> free tools?
>>>
>>> Thanks very much for helps, Lisheng
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>>> For additional commands, e-mail: java-user-help [at] lucene
>>>
>>>
>>
>>
>> --
>> Jochen Frey . CTO
>> Scout Labs
>> 415.366.0450
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>> For additional commands, e-mail: java-user-help [at] lucene
>>
>>
>
>
>
> --
> Thanks,
>
> Boris
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
For additional commands, e-mail: java-user-help [at] lucene


viniciusccarvalho at gmail

Jul 6, 2009, 5:37 PM

Post #6 of 6 (2044 views)
Permalink
Re: Free software for language detection [In reply to]

You can also check google's language API: I'm writing a blog entry on this,
hope to post tomorrow:

http://code.google.com/apis/ajaxlanguage/documentation/reference.html

Here a snippet of it working: (Using Json Simple to decode:
http://code.google.com/p/json-simple/)

try {
String s = URLEncoder.encode("Você sabe como implementar um
serviço", "UTF-8");
URL url = new URL("
http://ajax.googleapis.com/ajax/services/language/detect?v=1.0&q="+s);
BufferedReader in = new BufferedReader(new
InputStreamReader(url.openStream()));
String str;
StringBuilder buffer = new StringBuilder();
while ((str = in.readLine()) != null) {
buffer.append(str);
}
in.close();
JSONObject obj =
(JSONObject)JSONValue.parse(buffer.toString()).get("responseData");
System.out.println(obj.get("language"));


} catch (UnsupportedEncodingException e) {
// TODO Auto-generated catch block
e.printStackTrace();
} catch (MalformedURLException e) {
// TODO Auto-generated catch block
e.printStackTrace();
} catch (IOException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}

On Sun, Mar 29, 2009 at 6:02 AM, Karl Wettin <karl.wettin [at] gmail> wrote:

> You can also look at https://issues.apache.org/jira/browse/LUCENE-1039that I've successfully used for language detection of user queries.
>
> karl
>
> 27 mar 2009 kl. 18.35 skrev Boris Aleksandrovsky:
>
>
> Lisheng,
>>
>> You might want to look at the Nutch LanguageID plugin
>> (http://wiki.apache.org/nutch/LanguageIdentifier) too.
>>
>> Cheers,
>> Boris
>>
>> On Fri, Mar 27, 2009 at 10:22 AM, Zhang, Lisheng
>> <Lisheng.Zhang [at] broadvision> wrote:
>>
>>> Thanks very much!
>>>
>>> -----Original Message-----
>>> From: jochen.scout [at] gmail [mailto:jochen.scout [at] gmail]On Behalf Of
>>> Jochen Frey
>>> Sent: Friday, March 27, 2009 10:04 AM
>>> To: java-user [at] lucene
>>> Subject: Re: Free software for language detection
>>>
>>>
>>> Lisheng,
>>>
>>> Here's a package you could take a look at. I have used it in the past and
>>> it
>>> worked reasonably well. Let me know what else you find and how it works
>>> for
>>> you.
>>>
>>> http://www.olivo.net/software/lc4j/
>>>
>>> Good luck!
>>> Jochen Frey
>>>
>>> On Fri, Mar 27, 2009 at 9:54 AM, Zhang, Lisheng <
>>> Lisheng.Zhang [at] broadvision> wrote:
>>>
>>> Hi,
>>>>
>>>> Are you aware of any free software for language detection (given certain
>>>> text, see if it is French, or Japanese)? I saw Bob Carpenter's previous
>>>> mail which explained the principle nicely, but could not locate free
>>>> tools?
>>>>
>>>> Thanks very much for helps, Lisheng
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>>>> For additional commands, e-mail: java-user-help [at] lucene
>>>>
>>>>
>>>>
>>>
>>> --
>>> Jochen Frey . CTO
>>> Scout Labs
>>> 415.366.0450
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>>> For additional commands, e-mail: java-user-help [at] lucene
>>>
>>>
>>>
>>
>>
>> --
>> Thanks,
>>
>> Boris
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
>> For additional commands, e-mail: java-user-help [at] lucene
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe [at] lucene
> For additional commands, e-mail: java-user-help [at] lucene
>
>


--
The intuitive mind is a sacred gift and the
rational mind is a faithful servant. We have
created a society that honors the servant and
has forgotten the gift.

Lucene java-user RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.