Blogspot - jmgomezhidalgo.blogspot.com - Nihil Obstat
General Information:
Latest News:
Data Mining for Political Elections, and Isaac Asimov 23 Aug 2013 | 02:47 pm
Using Data Mining, Data Science and Big Data is cool in political elections, and in political decision-making. Well, not sure if cool, but it is a trending topic in Data Science in the latest years. ...
More Clever Tokenization of Spanish Text in Social Networks 27 Jul 2013 | 08:06 pm
Text written by users in Social Networks is noisy: emoticons, chat codes, typos, grammar mistakes, and moreover, explicit noise created by users as a style, trend or fashion. Consider the next utteran...
Negobot is in the news! 22 Jul 2013 | 07:54 pm
... And I must say, it is quite popular out there. Negobot is a conversational agent posing as a 14 year old girl, intended to detecting paedophilic intentions and adapting to them. Negobot is based ...
Performance Analysis of N-Gram Tokenizer in WEKA 8 Jul 2013 | 11:47 am
The goal of this post is to analyze the WEKA class NGramTokenizer in terms of performance, as it depends on the complexity of the regular expression used during the tokenization step. There is a poten...
Chat or What: Approaching Text Normalization in Chats and Social Networks 5 Jul 2013 | 12:45 am
It is not strange that, with the overload of user-generated content, there is an increasing interest on processing chat/SMS-like language. Social Networks, virtual worlds, MMORPGs and chat rooms are p...
Sample Code for Text Indexing with WEKA 23 Jun 2013 | 04:18 am
Following the example in which I demonstrated how to develop your own classifier in Java based on WEKA, I propose an additional example on how to index a collection of texts in you Java code. This pos...
Comparing baselines of keyword and learning based sentiment analysis 19 Jun 2013 | 08:13 pm
In my previous post, I have presented a simple example of using WEKA for Sentiment Analysis (or Opinion Mining). As most of my blog posts on text mining with WEKA, I approach interesting, hot or easy ...
Baseline Sentiment Analysis with WEKA 11 Jun 2013 | 04:21 pm
Sentiment Analysis (and/or Opinion Mining) is one of the hottest topics in Natural Language Processing nowadays. The task, defined in a simplistic way, consists of determining the polarity of a text u...
Baseline Sentiment Analysis with WEKA 11 Jun 2013 | 04:21 pm
Sentiment Analysis (and/or Opinion Mining) is one of the hottest topics in Natural Language Processing nowadays. The task, defined in a simplistic way, consists of determining the polarity of a text u...
Compilation of Resources for Text-based Age Detection 23 May 2013 | 09:07 pm
Text-based age detection consists of estimate the age of a user according to the kind of texts he/she writes. This task is atracting some attention in the latest years, as for instance it promises to ...