Hi,
I would like to filter all the sentence in my dataset which is not in English language. I already take a look to remove for non-ASCII character but sometime the character for some language are the same with English.
Do you know any Perl library or dictionary to filter non-english word.
any...
Hi all,
I would like to extract the message that contain questions indicators like question mark and string 'what, where, who, how' at the beginning of the sentence.
herewith my code
use strict;
#use warnings;
use diagnostics;
open( INFILE, "tweets.data" )
or die("Can not open input...
Hi all,
i was wondering if anyone has any experience using Twitter API? Do you have any idea how i can crawl the Twitter messages using Twitter API with perl?
Actually i need to crawl the tweets and replies to the respective tweet(GET statuses/show/:id). This data i will use it to do some data...
Hi,
i have data as below:
-DATA-
tweetid, workerid
10115, user1
10115, user2
10190, user1
10190, user2
10193, user3
10320, user2
10320, user1
I have no idea, how i can write a code to transform this data to the 2D matrix.
My desire output is:
tweetid user1, user2, user3
10115 1...
Hi,
i found a lot of code to remove duplicate data. in my case, i would like to print only none duplicate data. any suggestion how to do it?
example data.
aaaaaaaaaaa
bbbbbbbbbbb
bbbbbbbbbbb
ccccccccccc
ddddddddddd
ddddddddddd
the desire output:
aaaaaaaaaaa
ccccccccccc
any help is much...
Hi,
i have data in csv file. i want to split it base on line and column.
my code so far:
#!/usr/bin/perl use strict;
use strict;
use warnings;
use Text::CSV_XS;
# Store our CSV file name
my $file = 'input.csv';
open( CSV_XS, '<', $file )
or die( 'Unable to open csv file ', $file...
Hi,
i want to split the sentences into words and count its frequency. i have tried using the code below but it doesn't worked. This code actually from the text book.
----Data----
If milk goes bad if not refrigerated
Jones in rush to contribute for Bears (AP): Kevin Jones burst through the...
Hi,
i have 1200 line of data as below:
user1 date time @Glebe2037 Ha! but of course.....
user2 date time @aerynea It mostly missed Fort Collins
user3 date time Lansons is 20 years old this year
....
current output:
"@Glebe2037 Ha! but of course.....","10001","@aerynea It mostly missed Fort...
Hi all,
I have a problem to count the majority selectedresult for each pair of string (sysA,sysB). for each query, i have 3 different combination of string comparision.
* comparison("lucene-std-rel","lucene-noLen-rr");
* comparison("lucene-noLen-rr","lucene-std-rel");
*...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.