Stata Projects
Looking for freelance Stata jobs and project work? PeoplePerHour has you covered.
I need a research assistant
The ideal candidate will possess strong analytical skills, attention to detail, and a passion for learning. Key Responsibilities: • Assist in literature reviews and background research on relevant topics. • Screen search results in Covidence based on predefined criteria to select studies for inclusion in the review. • Extract data from selected studies using standardized forms and protocols. • Verify the accuracy and completeness of extracted data, resolving discrepancies through discussion with team members. • Summarize and synthesize findings from included studies, highlighting key results and trends. • Participate in team meetings, providing updates on progress, challenges, and insights gained during the review process. • Perform other duties as assigned by the research supervisor or principal investigator. Qualifications: • Master graduate, preferably in a relevant field such as public health, health economics, or a related discipline. • Strong academic record with coursework in research methods, statistics, and quantitative data analysis preferred with SAS or STATA • Excellent written and verbal communication skills. • Proficiency in Microsoft Office Suite (Word, Excel, PowerPoint) and research software/tools. • Ability to work independently and collaboratively in a fast-paced environment. • Detail-oriented with strong organizational and time management skills. • Prior research experience in systematic reviews and meta-analysis and manuscript publication is a plus Duration and Compensation: • This is a part-time position, between 10 and 20 hours per month. Compensation will be commensurate with experience and qualifications. Application Process: Interested candidates should indicate their interest in the position and relevant experience. Deadline: Applications will be reviewed on a rolling basis until the position is filled. Early applications are encouraged.
12 days ago34 proposalsRemote
Past "Stata" Projects
Data visualisation in R, STATA, or tableau
I need a creative and experienced data scientist who can develop neat figures from a dataset that has been analysed and conclusions made. The data are time estimates for several activities in a system, formatted as a .CSV Would need to run regressions and plot distributions and point estimates, but the form of analysis does not need to be discussed. Knowledge of the standard data science methods and programming would be needed.
Basic Analytics using STATA
Hi all, I need an expert in statistical analysis to run through the attached document and provide answers, using STATA.
Stata analysis
I need help with merging my data/doing propensity score matching in Stata. I need to match treated and nontreated, that should be in the same industry/country/sector. then, I think if there are still multiple matches possible, I must choose the best match based on other data. I need help asap. we can discuss the budget when we discuss with how much you can help, it can be quite a bog project vs just helping me out with creating the first matches.
To run random forest algorithm in R, Python or Stata
To support in running random forest algorithm in R, Python or stata package. it is expected that this will take note more than an hour.
Merge datasets using common and uncommon/time-varying variables
I am trying to merge multiple datasets based on subject IDs and dates of measurement. The subject IDs are common across datasets (but some datasets may contain subject IDs that others do not). The dates of measurement are uncommon i.e. they differ between datasets. I am trying to match entries for the same subjects between datasets that were recorded within 14 days of each other. Please see the uploaded samples for an example of a pair of datasets I am trying to merge (Dataset 1.txt and Dataset 2.txt) as well as the end result (Merged Dataset.txt). I assume that I need to match two datasets at a time, as is customary in data matching functions such as merge in Stata or R. In the merged dataset, t1 reflects the date of measurement in dataset 1; t2 reflections the date of measurement in dataset 2; tdiff reflects the difference in days between t1 and t2. There should be no values in tdiff that >|14|. The periods reflect missing values. As you can see, only those entries that were recorded within +/-14 days of each other for a given subject have been merged on a 1:1 basis. There is an instance where two entries in dataset 2 fall within 14 days of one entry in dataset 1 for subject ID 1. In cases like this, I would like to take the pair of entries that are closest in date (e.g., out of 26/03/2019 and 29/03/2019 in dataset 2, the latter is closer to 30/03/2019 in dataset 1). There may be more than two entries in one dataset that fall within 14 days of an entry in the other dataset; again I would be looking to save the pair of entries that are closest in time. There are some subjects that are not included in the merged dataset as they are not included in both datasets (e.g., subject 4 in dataset 1 and subject 8 in dataset 2). All variables including the dates of measurement (t) from each dataset have been carried over (e.g., x1-x3 in dataset 1 and y1 and x4). Each dataset has a different number of variables to merge. There are 12 datasets to merge in total, which I envision doing in pairs. Also, subjects vary in how many data entries they have recorded within a dataset (e.g., subject 1 has 5 entries whereas subject 7 only has 2) and between datasets (e.g., subject 2 has three entries recorded in dataset 1, but only one entry in dataset 2). I have had a few thoughts so far but feel lost in how to implement something like this: The data is currently in long format but there is no reason why we cannot transpose to wide format. It might be easier if a subject's data was on one row? Ideally we would have a standardized time variable that could be used to match measurements across datasets. I have thought of creating a variable that reflects the difference between an absolute starting time point and the date for a given entry, and then converting this measure into months, but we still have the problem that time variable is not the same across datasets. As for programs for implementation, I am using Stata, R and Excel for data management and analysis. Ideally it could be done in Stata but if not, in R. I am happy to negotiate prices. I am looking to get this done asap.
pre-funded
Help with a quantitative data analysis using STATA
I am looking for someone who can help with a quantitative analysis on STATA (must be in STATA). This project must be completed by Friday.
STATA code
I need someone to write a program for me to estimate the time period for survival analysis and readmission as explained in this excel file.
SAS to STATA
I need someone who is proficient in SAS and STATA. I've a code in SAS which I want to write in STATA. Thanks
Data analysis & interpretation of 4 papers-500 words each paper
This work consists of 4 papers- Paper 1: read the below paper on environmental exposure and complete tasks 1 to 3 below. Task 1. Provide a summary of the data collected for this paper and why. Task 2. Comment on one result presented in the paper and the uncertainty in that result (e.g. confidence intervals, adjustment). Task 3. Comment critically on the take home message of the paper. Reference: Van Dijk-Wesselius JE, Maas J, Hovinga D, et al. The impact of greening schoolyards on the appreciation, and physical, cognitive and social-emotional well-being of schoolchildren: A prospective intervention study. Landscape and Urban Planning 2018;180:15-26. https://www.sciencedirect.com/science/article/pii/S0169204618307369?via%3Dihub Paper 2: read online data for Graveyard write up analysis. St James Cemetery (http://www.stjamescemetery.co.uk/) in Liverpool holds the remains of nearly 58,000 people. The gravestones that remain in the park give fascinating insight into the lives of Liverpudlians since the 1700s. Information that may be obtained from the gravestones include names, year of death, month of death, age at death, gender, marital status. Courtesy of Dr Karyn Morrissey we have a small set of data recorded from these gravestones. I WILL SEND YOU AN EXCEL FILE. Import the data into Stata (or SPSS/other stats package if you prefer) and perform a descriptive analysis presented via a selection of appropriate statistics, tables and charts. Please present this information in the style of a brief ‘methods and results’ section of a journal paper, completing tasks 1 to 3: Task 1: provide description of how data have been managed (import to stats package etc.) Task 2: Calculation and presentation of appropriate statistics Task 3: Interpretation of results You should include description of how you managed the data, and produce statistics, tables or graphs. For example, you could consider some of the following statistics: • Average age of death • Average age of death by gender • Average age of death if age at death was greater than 16 • Average age of death if married • Average age of death if married and male • Most common surname • Most common first name for women • Most common first name for men • Numbers of deaths by age category Remember the tight word count; you may not want to do all of these, and may want to detail other analyses not listed here. Where appropriate, averages could be presented with 95% confidence intervals, and other appropriate statistics could be calculated Paper 3: Peer review: Qualitative examination of walking groups in deprived communities - Kassavou A, Turner A, French DP. The role of walkers’ needs and expectations in supporting maintenance of attendance at walking groups: a longitudinal multi-perspective study of walkers and walk group leaders. PLoS One. 2015 Mar 16;10(3):e0118754https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0118754 read this paper (Kassavou et al, 2015) with a critical viewpoint, and to write a peer review as if you were writing for a journal editor, using the guidance below. • Sense about Science: Peer Review: The nuts and bolts. https://senseaboutscience.org/activities/peer-review-the- nutsand-bolts/ • Rowan, M., Huston, P., 1997. Qualitative research articles: information for authors and peer reviewers. CMAJ: Canadian Medical Association Journal 157, 1442-1446. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1228487/ The review should include the following: 1. Summarise in a short paragraph the main aims, why this required a qualitative research design and findings of the study FULL LIST OF KEY AREAS TO COVER WILL BE PROVIDED Paper 4. Map data analysis- Task 1. A brief background summary describing the life expectancy data, specifically where they come from and what they represent. Task 2. Description and discussion of the regional geographies of life expectancy and why they might look like they do. Task 3. Concluding remarks.
Need a STATA guru 4 analysis Done by 10/31 midnight Eastern Time
Please see data and pdf for instructions. I need the .do file with your code and any comments needed to answer the questions. (Scroll down a bit when you open the pdf).
urgent
Controlled interrupted time series regression interpretation
A one hour consultation to discuss regression outputs from a controlled interrupted time series regression produced using STATA and the itsa command. MUST BE TODAY, IMMEDIATE DISCUSSION PLEASE.
I need a 2500 word professional report on econometrics in 12hrs
I need a professional REPORT in a Microsoft Word document of approximately 2,500 words in length about econometrics. NB: No Plagiarism Marking CriteriaResearch (uncovering of information) 10 Marks Systematic identification and use of a range of academic textbooks, many of which will be econometric textbooks, although not exclusively. Marks are awarded for the range of econometric and academic textbooks utilized and suitably referenced. Additional marks will be awarded to students who quote and utilize from econometric journals, although it is expected that at this study level this may be the exception rather than the rule. No marks are awarded for information gleaned from the Internet. Analysis 30 Marks Analysis here refers to the examination, interpretation, and discussion of the results of your empirical econometric investigation as it relates to the assessment brief, where sources of material are both of a theoretical and empirical nature.By results is meant not just regression results but tables of frequencies, and graphs produced using the data supplied for the coursework. Subject Knowledge 30 Marks Understanding and application of subject knowledge and underlying principles. Note that a student may state relevant knowledge in answering the coursework question, but may not apply such knowledge sufficiently in the analysis: having knowledge does not guarantee analysis. In addition, the application of subject knowledge will require technical competency which is explained below. Technical Competency 15 Marks This criteria is defined as the demonstration of the skills to enable the evaluation and execution of ideas appropriate to the assessment; E.g. well-thought out specifications are required for this econometric exercise; appropriate mathematical equations; appropriate theoretical diagrams which help answer the coursework question; logical analysis and discussion using appropriate theoretical concepts for discursive or quantitative style assessments; use of mathematical techniques where appropriate in more quantitative assessments. All tables, and diagrams, must be constructed by yourself and not copied and pasted.Under this criteria, the Stata do file and Stata log file submitted will also be evaluated. The do file must contain a "date stamp" in its list of commands.The preceding is not an exhaustive list.
Learn the basics of STATA PROGRAMMING LANGUAGE (ECONOMETRICS)
Good afternoon, I am looking for someone to teach me the basics of STATA in relation to econometrics. I need your hourly rate, the version of STATA that you use and also how you would proceed to teach STATA online. Best regards, Ananda
Fluent in STATA programming
I have a survey data on which I am trying to apply propensity score commands. i have the commands and the output. But I'm not very clear about the output. I need someone who could explain those outputs to me. The propensity score matching on a survey data is slightly different than non survey data so only people who are well versed with working on survey data need to apply. I am not anticipating it would take anyone more than 15 minutes to half an hour of your time. Thanks Only freelancers with the following talent need to apply Good Understanding of Propensity score matching on a survey data
Statistician needed for STATA & EXCEL regression analysis on P&L
Hi there! I have collected betting data on horse racing. My aim is to maximize my P&L. I have been placing bets on 14 horses. The horses do not race in the same races. There are variables that are active in the bets. Sometimes only a few variables are active and other times more are active. I am trying to establish the relationship between the variables and P&L. I think the analysis will find that certain variables, or combinations of certain variables, effect the likelihood of a profitable outcome. The study may find that when certain variables are active, the results are worse. This would also help me because I would know to avoid placing bets when those variables are active, therefore minimizing losses and increasing my P&L. I have placed around 1000 bets on 14 horses, and there are around 36 variables on each bet. Most of the time the bet only used around 10 of the 36 variables. I am aware the number of vairables : number of bets ratio is not ideal. However, there should still be enough data to draw some conclusions at the least. I have attached a small version of the file so you can see how I have recorded the bets. If the characteristic/variable is true for the bet it is marked in the cell with a "1" Do you think you can help me find if there is a relationship between a certain combination of bet variables and profitability, on each horse? You may find there are variables that work best on every horse, and you may also find that each horse has an optimum set of variables. I do not know and would love to find out. Thanks for you help! Notes: Columns "V and W" are specific to each horse so will be exempt from the variable correlation study. I would like to see [separately] if there is a correlation between profitability and the P1 number being above OR below it's median for each time period. I also want to do the same but use the mean number instead of median, and compare the two sets or results.
STATA and Econometrics coaching
Are you able to help on STATA, ECONOMETRICS and META-ANALYSIS? I am looking someone who would help me over a long period (2 or 3 years) on these aspects if I get the right person based in Europe. The ideal person would be a University Student in Economics who is very good in STATA and in Econometrics. What would be your best rate per hour? If you need any clarification, please let me know. Thank you in advance for your response & best regards, Ananda
STATA programmer required
I need a code to classify the column readmit (in this example) into two groups based upon hospital ID? Admission ID denotes individual pt. The first admission is an index and subsequent admission is readmit. I want to form two subgroups where readmitted patients went to the same hospital. So if hospital ID is the same for index and readmits I want to classify them in one group while if they were readmitted to a different hospital, they will belong to another group. In this example hospital IDs for patients in rows, 2 and 3 are different so they should be classified in one group while rows 8 and 9 have the same hosp ID and will belong to another group. I want to create a separate column with groups A, B, and 0. A- same facility, B different and C none. Index admissions should be assigned o. The excel file is just for the sample. I will supply you a dat file to check the code which is 10% of the main file (26GB). Thanks
Hausman test in stata (with spss data file)
I need someone to perform a hausman test (in order to decide wether to use a fixed/random effect model) in stata. I have a panel data set in spss, but I cant perform this test in spss. I would need it to be done fast as my deadline is nearby... I tried it myself but I get multiple errors, and I dont have the time to figure out how stata really works
Stata - regression analysis
I need help with Stata - performing multiple variable regression analysis - preferably from someone fluent in the language. I need to reproduce and validate results from a stats paper. This is urgent - next 48/ 72 hours.