Produce code for Tidy Sentiment Analysis in R
- or -
Post a project like this2126
£40(approx. $50)
- Posted:
- Proposals: 5
- Remote
- #2041837
- Awarded
Description
Experience Level: Intermediate
Hi all,
I am working on a natural language processing project and want to use the Tidy package from R. I would like some help to create an R script to run some natural language processing tools over a set of documents that I have so i can compare results and build some models.
The code will need to
1/ be able to be run over a series of documents provided (so I can compare the results between documents)
2/ perform tokenisation of the document
3/ clean up (removing uninteresting words) and create a "bag of words"
4/ run some basic descriptive statistics:
*word count
*run an analysis of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD)
*run some indexes for "readability" (e.g., Flesch, Simple Measure of Gobbledygook, LIX, Dale-Chall)
5/ run three sentiment lexicons: AFINN, Bing and NRC and make a table of results for comparison
6/ compare the match ratio of each lexicon to the document text (to see which is the best lexicon to use) eg results in a table or graph
if you can build this code using an example dataset available in R so I can re-run it myself and play around with it.
I am working on a natural language processing project and want to use the Tidy package from R. I would like some help to create an R script to run some natural language processing tools over a set of documents that I have so i can compare results and build some models.
The code will need to
1/ be able to be run over a series of documents provided (so I can compare the results between documents)
2/ perform tokenisation of the document
3/ clean up (removing uninteresting words) and create a "bag of words"
4/ run some basic descriptive statistics:
*word count
*run an analysis of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD)
*run some indexes for "readability" (e.g., Flesch, Simple Measure of Gobbledygook, LIX, Dale-Chall)
5/ run three sentiment lexicons: AFINN, Bing and NRC and make a table of results for comparison
6/ compare the match ratio of each lexicon to the document text (to see which is the best lexicon to use) eg results in a table or graph
if you can build this code using an example dataset available in R so I can re-run it myself and play around with it.
Jacqueline H.
100% (12)Projects Completed
8
Freelancers worked with
8
Projects awarded
31%
Last project
29 May 2022
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies