Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

...

  1.  For the Quarterly refresh only, update the latest RxNorm data for the medication subject area processing.
    1. Login using PuTTY terminal (or SSH GUI) to same server.
    2. Change directory to /dwhome/airflow_ci/master/dags/CDRN_ETL/RxNormLoad/
    3. Run file RxNormLoad.sh
  2. Login to the appropriate Airflow server via a web browser
    1. Development: http://plt-nosql01.dhe.duke.edu:8083/admin/
    2. Production: http://plp-nosql01.dhe.duke.edu:8083/admin/
  3. Edit the Airflow variable CDRN_ETL_Options_STAR
    1. Click Admin→Variables at the top of the Airflow web page.
    2. Click the Edit icon (pencil icon) for the entry CDRN_ETL_Options_STAR Image Modified
  4. Make the following changes to these entries in the Airflow variable CDRN_ETL_Options_STAR
    1. loadToDt to the previous day for the mini-CDM refresh or the last day of the previous month for the quarterly refresh in YYYY-MM-DD format
    2. network to CVD19 for the mini-CDM refresh or STAR for the quarterly refresh

    3. prevPrefix to CVDNEW_ for the mini-CDM refresh or the previous V##_ prefix used for the last STAR refresh

    4. newPrefix to CVDNEW_ for the mini-CDM refresh or the next number available for V##_ prefix for the the current STAR refresh

    5. tempPrefix to ETL_

  5. Start DAG process
    1. Click DAGs at the top of the Airflow web page.
    2. Scroll down the DAG list to the entry for STAR_ETL_Master.
    3. On the right side of the line, there are several icons. Click the Play button (Trigger DAG).
    4. Click OK on the confirmation popup window. This will start the DAG process.
  6. Monitor DAG process
    1. Click on the link for the DAG STAR_ETL_Master from the main Airflow dashboard. This will open the DAG details page.
    2. Click either Graph View or Tree View to see the progress status of the DAG.
      1. Graph View - Will display a graph representation of all task being run and status of each task (Running, Finished, Failed, etc.) Does not display Task Duration Time correctly when hovering over the task. 
      2. Tree View - Will display a tree representation of specific amount of executions, including current execution. Task Duration Time does display correctly here when hovering over the task.
    3. Usual run time for entire process is 10-12 hours.
  7. For quarterly refresh only, generate STAR hash token data.
    1. Login using PuTTY terminal (or SSH GUI) to same server.
    2. Change directory to /dwhome/airflow_ci/master/dags/CDRN_ETL/Datavant/
    3. Run file hash_token_load_master.sh STAR
  8. For quarterly refresh only, execute post-ETL process
    1. Click DAGs at the top of the Airflow web page.
    2. Scroll down the DAG list to the entry for STAR_ETL_PostOp
    3. Run following SQL to backfill any missing hash tokens from previous step:
      1. INSERT INTO HSC_PCOR.<<V##>>_HASH_TOKEN 
        SELECT PATID, 'XXX - S00000', 'XXX - S00000', 'XXX - S00000', 'XXX - S00000', 'XXX - S00000', 'XXX - S00000', 'Backfill'
        FROM HSC_PCOR.<<V##>>_DEMOGRAPHIC WHERE PATID IN
        (
        SELECT PATID FROM HSC_PCOR.<<V##>>_DEMOGRAPHIC vd
        minus
        SELECT PATID FROM HSC_PCOR.<<V##>>_HASH_TOKEN
        );
    4. Once DAG finishes, login using PuTTY terminal (or SSH GUI) to same server.
    5. Change directory to /dwhome/apps/cdrnLoad
    6. Run following command to generate optimization stats for PCORNet tables
      1. nohup sqlplus 'HSC_PCOR/<<PASSWORD>>'@eclrprd1 @gatherStats_HSC_PCOR.sql &

  9. For mini-CDM only, generate and send N3C data
    1. Login to the Windows server vwp-pmndmc.dhe.duke.edu via Remote Desktop Protocol (RDP).
    2. Open Windows File Explorer.
    3. Navigate to directory to D:\Apps\N3C\Project\
    4. Double click on file N3C_Main.bat to start the N3C data script.
    5. Usual run time for batch file is 8-10 hours.
    6. Can check for errors looking at log file H:\N3C\Project\N3C_debug.log
  10. For mini-CDM only, generate and send N3C hash tokens
    1. Go back to the Airflow server from step 1.
    2. Scroll down the DAG list to the entry for STAR_N3C_Hash_Tokens.
    3. Login using PuTTY terminal (or SSH GUI) to same server.
    4. Change directory to /dwhome/airflow_ci/master/dags/CDRN_ETL/Datavant/
    5. Run file hash_token_load_master.sh N3C
    6. Transfer ZIP file created from previous step to local computer
    7. Log into N3C hash token SFTP site (Setup required), 34.132.183.26:2222
      1. Need SFTP private key file
    8. Transfer ZIP file to N3C hash token SFTP site

...

diagnosisdiagnosis
Subject AreaSQL Scripts (Links to Git Repository Location)Notes

Converted to SQL Server?

Python DAGSTAR_ETL_Master.pyEmbedded SQL within Python code. May be able to pull out and place into own SQL file.

CVD19GetPatKeys.sql


STAR_create_etl_tables.sql


STAR_create_etl_tables_CDM61.sql


STAR_create_indexes.sql https://gitlab.dhe.duke.


STAR_replace_etl_views.sql


cdrn_addr_info.sql


cdrn_all_enc_clin.sql


cdrn_all_enc_hosp.sql


cdrn_all_enc_oa.sql


cdrn_meds_info.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/cdrn_meds_info.sql?ref_type=heads


compare_rpt.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/compare_rpt.sql?ref_type=heads


condition.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/condition.sql?ref_type=heads


death.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/death.sql?ref_type=heads


death_cause.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/death_cause.sql?ref_type=heads


demo_rpt.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/demo_rpt.sql?ref_type=heads


demographic.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/demographic.sql?ref_type=heads


diagnosis.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/diagnosis.sql?ref_type=heads


dispensing_per_year.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensing_per_year.sql?ref_type=heads


dispensing_per_year_Surescript_Step1.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensing_per_year_Surescript_Step1.sql?ref_type=heads


dispensing_per_year_Surescript_Step2.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensing_per_year_Surescript_Step2.sql?ref_type=heads


dispensing_use_inc_load.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensing_use_inc_load.sql?ref_type=heads


encounter.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/encounter.sql?ref_type=heads


encounter_dedup.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/encounter_dedup.sql?ref_type=heads


enrollment.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/enrollment.sql?ref_type=heads


harvest.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/harvest.sql?ref_type=heads


harvest_CDM61.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/harvest_CDM61.sql?ref_type=heads


hash_token.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/STARhash_create_indexestoken.sql?ref_type=heads


STARhash_replacetoken_etl_viewsCDM61.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/STARhash_replacetoken_etl_viewsCDM61.sql?ref_type=heads


cdrnhash_addrtoken_infoN3C.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/cdrnhash_addrtoken_infoN3C.sql?ref_type=heads


cdrnhash_all_enctoken_clinbackfill.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/cdrnhash_alltoken_enc_clinbackfill.sql?ref_type=headscdrn_all_enc_hosp


immunization.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/cdrn_all_enc_hospimmunization.sql?ref_type=headscdrn


lab_result_cm_all_enc_oayears.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/cdrnlab_result_cm_all_enc_oayears.sql?ref_type=heads


cdrnlds_medsaddress_infohistory.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/cdrnlds_medsaddress_infohistory.sql?ref_type=heads


comparemed_rptadmin.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/comparemed_rptadmin.sql?ref_type=heads


conditionobs_clin_all.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/conditionobs_clin_all.sql?ref_type=heads


deathobs_clin_flowsheet.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/deathobs_clin_flowsheet.sql?ref_type=headsdeath


obs_clin_causepft.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/deathobs_clin_causepft.sql?ref_type=heads


demoobs_clin_rptsmoking.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/demoobs_clin_rptsmoking.sql?ref_type=heads


demographicobs_clin_vitals.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/demographicobs_clin_vitals.sql?ref_type=heads


obs_clin_vitals_legacy.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obs_clin_vitals_legacy.sql?ref_type=heads


dispensingobs_per_yeargen.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensingobs_per_yeargen.sql?ref_type=headsdispensing


obs_per_year_Surescript_Step1gen_adt.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensingobs_per_year_Surescript_Step1gen_adt.sql?ref_type=headsdispensing


obs_per_year_Surescript_Step2gen_sdoh.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensingobs_per_year_Surescript_Step2gen_sdoh.sql?ref_type=heads


dispensing_useobs_incgen_loadvent.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/dispensingobs_usegen_inc_loadvent.sql?ref_type=heads


encounterpcornet_trial.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/encounterpcornet_trial.sql?ref_type=headsencounter_dedup


prescribing.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/encounter_dedupprescribing.sql?ref_type=headsenrollment


prescribing_mixture_broken_out.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/enrollmentprescribing_mixture_broken_out.sql?ref_type=heads


harvestprescribing_update_rx_basis.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/harvestprescribing_update_rx_basis.sql?ref_type=heads


harvestprivate_address_CDM61geocode.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/harvestprivate_address_CDM61geocode.sql?ref_type=heads




hashprivate_address_tokenhistory.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/hashprivate_address_tokenhistory.sql?ref_type=headshash


private_token_CDM61demographic.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/hashprivate_token_CDM61demographic.sql?ref_type=headshash


private_tokendemographic_N3Cdedup.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/hashprivate_tokendemographic_N3Cdedup.sql?ref_type=headshash


pro_token_backfillcm.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/hashpro_token_backfillcm.sql?ref_type=heads


immunizationpro_cm_sdoh.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/immunizationpro_cm_sdoh.sql?ref_type=headslab_result


_cm_all_yearsprocedures.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/lab_result_cm_all_yearsprocedures.sql?ref_type=heads


ldsprocedures_address_historycaboodle.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/ldsprocedures_address_historycaboodle.sql?ref_type=heads


med_adminprovider.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/med_adminprovider.sql?ref_type=heads


obs_clin_allreferral.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obs_clin_allreferral.sql?ref_type=headsobs


refresh_clinstats_flowsheetupdate.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obsrefresh_clinstats_flowsheetupdate.sql?ref_type=headsobs


remove_clindup_pftkeys.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obsremove_clindup_pftkeys.sql?ref_type=headsobs_clin_smoking


rename_etl_tables_no_drop.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obs_clin_smokingrename_etl_tables_no_drop.sql?ref_type=headsobs_clin


_vitalsrename_etl_tables_w_drop.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obs_clin_vitalsrename_etl_tables_w_drop.sql?ref_type=heads


obs_clin_vitals_legacyvital.sql https://gitlab.dhe.duke.edu/edw-foundations/cdrn/airflowetl/-/blob/master/projSql/CDRN_ETL/STAR/MainETL/obs_clin_vitals_legacyvital.sql?ref_type=heads