Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add instructions for running the study from scratch #38

Closed
mikix opened this issue Jun 11, 2024 · 1 comment
Closed

Add instructions for running the study from scratch #38

mikix opened this issue Jun 11, 2024 · 1 comment
Labels
documentation Improvements or additions to documentation

Comments

@mikix
Copy link
Contributor

mikix commented Jun 11, 2024

Maybe in the readme or a separate markdown doc.

Instructions off the top of my head:

  1. Export DocRefs from EHR
    • If working with a limited set of DocRefs, make sure that they are relevant for the Covid study and are dated between March 2020 and June 2022
  2. Run the covid-symptom ETL task
  3. Run covid_symptom library study
  4. Export tables from the study in Athena with icd10 and ctakes labels into csv files (for chart-review later). I think these are correct:
    • select encounter_ref, symptom_display from covid_symptom__symptom_ctakes_negation
    • select encounter_ref, symptom_display from covid_symptom__symptom_icd10
  5. Configure Label Studio like:
<View>
  <Labels name="label" toName="text">
    <Label value="Congestion or runny nose" background="#100"/>
    <Label value="Cough" background="#040"/>
    <Label value="Diarrhea" background="#008"/>
    <Label value="Dyspnea" background="#b00"/>
    <Label value="Fatigue" background="#0f0"/>
    <Label value="Fever or chills" background="#40a"/>
    <Label value="Headache" background="#afa"/>
    <Label value="Loss of taste or smell" background="#f0f"/>
    <Label value="Muscle or body aches" background="#9bf"/>
    <Label value="Nausea or vomiting" background="#0aa"/>
    <Label value="Sore throat" background="#a44"/>
  </Labels>
  <Text name="text" value="$text"/>
</View>
  1. Identify notes to review
  2. Run cumulus-etl upload-notes --philter=disable --no-nlp on those notes
  3. Have humans review the charts
  4. Export the annotations from Label Studio
  5. Configure chart-review like:
labels:
  - Congestion or runny nose
  - Cough
  - Diarrhea
  - Dyspnea
  - Fatigue
  - Fever or chills
  - Headache
  - Loss of taste or smell
  - Muscle or body aches
  - Nausea or vomiting
  - Sore throat

annotators:
  human1: 1
  human2: 2
  icd10:
    filename: icd10.csv
  ctakes:
    filename: ctakes.csv
  1. Run chart-review accuracy
@mikix mikix added the documentation Improvements or additions to documentation label Jun 11, 2024
@mikix mikix changed the title Add instructions for re-running the study at a new place Add instructions for running the study from scratch Jun 11, 2024
@mikix
Copy link
Contributor Author

mikix commented Jun 17, 2024

Fixed by #39 - we can edit that doc as we see fit.

@mikix mikix closed this as completed Jun 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant