楼主: oliyiyi
1392 4

etcML Promises to Make Text Classification Easy [推广有奖]

版主

泰斗

0%

还不是VIP/贵宾

-

TA的文库  其他...

计量文库

威望
7
论坛币
271951 个
通用积分
31269.3519
学术水平
1435 点
热心指数
1554 点
信用等级
1345 点
经验
383775 点
帖子
9598
精华
66
在线时间
5468 小时
注册时间
2007-5-21
最后登录
2024-4-18

初级学术勋章 初级热心勋章 初级信用勋章 中级信用勋章 中级学术勋章 中级热心勋章 高级热心勋章 高级学术勋章 高级信用勋章 特级热心勋章 特级学术勋章 特级信用勋章

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

etcML is a new and free tool that allows even novice user use the power of machine learning and text classification.

By Ajay Ohri, Mar 5, 2014.

etcML.com is a new website that helps bring the power of machine learning to classifying text even to users who are not proficient in machine learning and text classification. It does so by creating an easy to use interface that helps a user upload data, create a classifier, and apply the classifier to predict. You can even use it to classify tweets for sentiment using an inbuilt classifier and integration with Twitter search.

The  website is backed by a young team led by Richard Socher out of Stanford with faculty adviser as the famed professor Andrew Ng, creator of Coursera. The website is free and is currently at the status of a research project at Stanford. The same team has been behind the creation of NASENTwhich is an improved version of sentiment analysis algorithms.

There are other websites that promise to help users with easy machine learning classification includinggoogle prediction api and   bigml.com, however the ease of usage helps make this a promising candidate to watch. This could potentially help widen the audience for the direct end users of machine learning classification (like marketing and product strategy) teams while creating a challenge for existing social media analysis tools like Radian6.

The basic process of text classification using etcml.com is as follows-

Step 1- choose among

  • upload dataset of text data, OR

  • choose existing text dataset,  OR

  • classify tweets (publicly available text data)   


Step 2 - choose among

  • existing classifiers models OR

  • your own classifier models


Step 3 predict the dataset using the chosen classifier.

You can then download the dataset with both the input  text and the output classification labels.

An additional point is while the interface makes it very easy to explain classification (thus being of some use to the academic world), it also allows you to create public and private datasets as well as public and private classifiers.

It could thus potentially bring in a marketplace of both datasets and predictive models, both of which have been tried and tested without resounding success in separate formats.

For a sentiment analysis of tweets, it shows top positive , top negative and top neutral tweets, a graphical description of when the tweets happened and the ability to change the classification manually.   

We noticed some caveats- while the tool allows manual intervention to change the labels, there is no way to feed this back automatically into the classifier that was originally used. This of course applies to the traditional text mining challenges of classifying sarcasm, slang or even double negatives. Another drawback is the lack of model diagnostics- especially the confusion matrix for the lift curve, Also needed is perhaps a paid version for tweets from a longer period (since Twitter sells the API data through resellers like Datasift). We noticed some API information and documentation herebut better bindings especially for Python, and R communities can only enhance the usage .

Overall better interfaces is something that should come to the data science world, and etcML.com is a great effort to make this possible.


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:promises Promise cation Mises ATION interface learning creating machine website

缺少币币的网友请访问有奖回帖集合
https://bbs.pinggu.org/thread-3990750-1-1.html
沙发
william9225 学生认证  发表于 2016-8-13 22:50:30 来自手机 |只看作者 |坛友微信交流群
谢谢分享

使用道具

藤椅
Kamize 学生认证  发表于 2016-9-1 00:58:08 来自手机 |只看作者 |坛友微信交流群
oliyiyi 发表于 2016-8-13 18:43
etcML is a new and free tool that allows even novice user use the power of machine learning and text ...
谢谢楼主
已有 1 人评分经验 收起 理由
oliyiyi + 20 精彩帖子

总评分: 经验 + 20   查看全部评分

使用道具

板凳
Kamize 学生认证  发表于 2016-9-2 00:35:55 来自手机 |只看作者 |坛友微信交流群
oliyiyi 发表于 2016-8-13 18:43
etcML is a new and free tool that allows even novice user use the power of machine learning and text ...
谢谢楼主的资料
已有 1 人评分经验 收起 理由
oliyiyi + 20 精彩帖子

总评分: 经验 + 20   查看全部评分

使用道具

报纸
Kamize 学生认证  发表于 2016-9-2 22:53:13 来自手机 |只看作者 |坛友微信交流群
oliyiyi 发表于 2016-8-13 18:43
etcML is a new and free tool that allows even novice user use the power of machine learning and text ...
谢谢楼主
已有 1 人评分论坛币 收起 理由
oliyiyi + 20 精彩帖子

总评分: 论坛币 + 20   查看全部评分

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-20 00:38