Dataset information
Available languages
English
Dataset description
National Cancer Registration and Analysis Service (NCRAS). (2019). Cancer Registration: Epidemiology of seven cancer sites: breast, colorectal, kidney, oesophagus, ovarian, pancreas, uterine (1985-2017) [Dataset]. Public Health England. https://doi.org/10.25503/b7aq-n559
1985-1994: 
•	PATIENT_PSEUDO_ID (Patient specific Pseudo ID) 
•	SEX (coded as 1=Male, 2=Female) 
•	LATERALITY 
•	DCO (Diagnosis of death certificate only coded as 0=No and 1=Yes) 
•	BASIS OF DIAGNOSIS (coded as 1= clinical, 2=clinical investigation, 3=unknown, 4= specific tumour markers, &=unknown)  
•	GRADE (coded as 1=well differentiated, 2=moderately differentiated, 3=poorly differentiated, 4=undifferentiated, &=unknown)
•	SEX_TEXT (coded as Male and Female) 
•	AGE_AT_DIAGNOSIS (grouped as 0-49 years and 50+ years)
•	TIME_PERIOD (grouped as 1985-1989 and 1990-1994)
•	TUMOUR_GROUP (coded as Breast female (C50), Colorectal (C18-C20), Kidney (C64), Oesophagus(C15), Pancreas(C25), Uterine (C54))
1995-2004: 
•	PATIENT_PSEUDO_ID (Patient specific Pseudo ID) 
•	SEX (coded as 1=Male, 2=Female) 
•	LATERALITY (coded as 9=Not known, B=Bilateral, L=Left, M=Midline, R=Right) 
•	HISTOLOGY_CODED
•	HISTOLOGY_CODED_DESC  
•	DCO (Diagnosis of death certificate only coded as N=No and Y=Yes)
•	BASIS OF DIAGNOSIS (1=clinical diagnosis made before death, 2=clinical investigation, 3=unknown, 4=specific tumour markers, 5=cytology, 6=histology of metastasis, 7=histology of primary, 8=unknown, 9=unknown) 
•	GRADE (coded as G1=well differentiated, G2=moderately differentiated, G3=poorly differentiated, G4=undifferentiated, GX= borderline malignancy) 
•	TUMOUR_GROUP (coded as Breast female (C50), Colorectal (C18-C20), Kidney (C64), Oesophagus(C15), Pancreas(C25), Uterine (C54))
•	SEX_TEXT (coded as Male or Female) 
•	AGE_AT_DIAGNOSIS (grouped as 0-49 years and +50 years)
•	TIME_PERIOD (grouped as 1995-1999 and 2000-2004)
•	IMDQUINTILE (coded as X=unknown) 
•	STAGE (coded as X=unknown) 
2005-2009:
•	PATIENT_PSEUDO_ID (Patient specific Pseudo ID) 
•	SEX (coded as 1=Male, 2=Female) 
•	LATERALITY (coded as 9=Not known, B=Bilateral, L=Left, M=Midline, R=Right)
•	HISTOLOGY_CODED 
•	HISTOLOGY_CODED_DESC 
•	DCO (Diagnosis of death certificate only coded as N=No and Y=Yes)
•	BASIS OF DIAGNOSIS (1=clinical diagnosis made before death, 2=clinical investigation, 3=unknown, 4=specific tumour markers, 5=cytology, 6=histology of metastasis, 7=histology of primary, 8=unknown, 9=unknown) 
•	GRADE (coded as G1=well differentiated, G2=moderately differentiated, G3=poorly differentiated, G4=undifferentiated, GX= borderline malignancy)
•	TUMOUR_GROUP (coded as Breast female (C50), Colorectal (C18-C20), Kidney (C64), Oesophagus(C15), Pancreas(C25), Uterine (C54))
•	SEX_TEXT (coded as Male or Female)
•	AGE_AT_DIAGNOSIS (grouped as 0-49 years and +50 years)
•	TIME_PERIOD (2005-2009)
•	IMDQUINTILE (coded as X=unknown) 
•	STAGE (coded as X=unknown) 
2010-2014: 
•	PATIENT_PSEUDO_ID (Patient specific Pseudo ID)
•	SEX (coded as 1=Male, 2=Female) 
•	LATERALITY (coded as 9=Not known, B=Bilateral, L=Left, M=Midline, R=Right)
•	HISTOLOGY_CODED 
•	HISTOLOGY_CODED_DESC 
•	DCO (Diagnosis of death certificate only coded as N=No and Y=Yes)
•	BASIS OF DIAGNOSIS (1=clinical diagnosis made before death, 2=clinical investigation, 3=unknown, 4=specific tumour markers, 5=cytology, 6=histology of metastasis, 7=histology of primary, 8=unknown, 9=unknown) 
•	GRADE (coded as G1=well differentiated, G2=moderately differentiated, G3=poorly differentiated, G4=undifferentiated, GX= borderline malignancy)
•	TUMOUR_GROUP (coded as Breast female (C50), Colorectal (C18-C20), Kidney (C64), Oesophagus(C15), Pancreas(C25), Uterine (C54))
•	SEX_TEXT (coded as Male or Female)
•	AGE_AT_DIAGNOSIS (grouped as 0-49 years and +50 years)
•	TIME_PERIOD (2010-2014)
•	IMDQUINTILE (coded as describing income deprivation where 1= least deprived to 5= most deprived, X=unknown)
•	STAGE (1=stage 1, 2=stage 2, 3=stage 3, 4=stage 4, X=unknown) 
2015-2017:
•	PATIENT_PSEUDO_ID (Patient specific Pseudo ID)
•	SEX (coded as 1=Male, 2=Female) 
•	LATERALITY (coded as 9=Not known, B=Bilateral, L=Left, M=Midline, R=Right)
•	HISTOLOGY_CODED  
•	HISTOLOGY_CODED_DESC 
•	DCO coded as (N=No, Y=Yes, X=unknown)
•	BASIS OF DIAGNOSIS (1=clinical diagnosis made before death, 2=clinical investigation, 3=unknown, 4=specific tumour markers, 5=cytology, 6=histology of metastasis, 7=histology of primary, 8=unknown, 9=unknown) 
•	GRADE (coded as G1=well differentiated, G2=moderately differentiated, G3=poorly differentiated, G4=undifferentiated, GX= borderline malignancy)
•	TUMOUR_GROUP (coded as Breast female (C50), Colorectal (C18-C20), Kidney (C64), Oesophagus(C15), Pancreas(C25), Uterine (C54))
•	SEX_TEXT (coded as Male or Female)
•	AGE_AT_DIAGNOSIS (grouped as 0-49 years and +50 years)
•	TIME_PERIOD (2015-2017)
•	IMDQUINTILE (coded as describing income deprivation where 1= least deprived to 5= most deprived)
•	STAGE (1=stage 1, 2=stage 2, 3=stage 3, 4=stage 4, X=unknown)
Build on reliable and scalable technology