Produce code for Tidy Sentiment Analysis in R

- or -

Post a project like this

Ends in (days)

2126

Fixed Price

£40(approx. $50)

Posted: 6 years ago
Proposals: 5
Remote
#2041837
Awarded

have already sent a proposal.

Description

Experience Level: Intermediate

Hi all,
I am working on a natural language processing project and want to use the Tidy package from R. I would like some help to create an R script to run some natural language processing tools over a set of documents that I have so i can compare results and build some models.

The code will need to
1/ be able to be run over a series of documents provided (so I can compare the results between documents)
2/ perform tokenisation of the document
3/ clean up (removing uninteresting words) and create a "bag of words"
4/ run some basic descriptive statistics:
*word count
*run an analysis of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD)
*run some indexes for "readability" (e.g., Flesch, Simple Measure of Gobbledygook, LIX, Dale-Chall)
5/ run three sentiment lexicons: AFINN, Bing and NRC and make a table of results for comparison
6/ compare the match ratio of each lexicon to the document text (to see which is the best lexicon to use) eg results in a table or graph
if you can build this code using an example dataset available in R so I can re-run it myself and play around with it.

New Proposal

Clarification Board Ask a Question

There are no clarification messages.

Description

Jacqueline H.

New Proposal

Clarification Board Ask a Question