Lecture – 10: Clinical epidemiology IInstructional Objectives:At the end of the lecture, the students would be able to:1- define Clinical epidemiology and its importance in individual patient care.2- demonstrate the role of Validity & Reliability in the clinical diagnosis.3- elicit the main components of validity of the diagnostic indicators.4- determine the "Gold Standard” ( Reference, Definitive) Test.5- construct 2x2 table for evaluating the validity of diagnostic indicators.6- estimate the sensitivity, specificity, (+)ve and (-)ve predictive values and (+)ve and (-)ve likelihood ratios and accuracy rate of the diagnostic indicators using the 2x2 table.7- detect the relation between (+)ve and (-)ve predictive values and disease prevalence.
Clinical epidemiology: Definition : It is " the science concerned with counting clinical events occurring in human beings, and it uses epidemiologic methods to carry out and analyze the counts obtained ". The basic purpose of clinical epidemiology is to develop and apply methods of clinical observations which will lead to valid clinical conclusions.
Clinical epidemiology has 3 main fields :
A . Diagnosis. B . Treatment. C . Prognosis.A clinical observation is valid if it corresponds to the true state in the person observed. Validity ( صلاحية)Two types of validity:Internal validity External validity The internal validity is the degree to which the results of an observation are correct for the patients being studied. The external validity (generalizability) is the degree to which the results of an observation are correct in other settings.
A . Diagnosis. A diagnostic test is valid if it detects most people with the target disorder and excludes most people without the disorder. A positive test usually indicates that the disorder is present and a negative test usually indicates that the disorder is absent . The performance of diagnostic indicators is assessed by epidemiologic methods to determine the: Validity & Reliability.
Validity (صلاحية) of a test expresses the magnitude of real values measured. e.g. when you measure the length of a child how much it is telling the real length? Reliability (مصداقية)expresses the consistency of the measurement if it is repeated many times. e.g. if you measure the systolic blood pressure it was 125 mm mercury if you repeat it another time or if two examiners measure it at the same time, would it be the same or will be other value? Validity is assessed by determining :Sensitivity, specificity, (+)ve and (-)ve predictive values and (+)ve and (-)ve likelihood ratios
The assessment of a diagnostic test could be the result of the components of the diagnosis; i.e.: * A symptom * A sign * A laboratory test Any one of theses indicators could be diagnostic by itself or one of them added to another. The Test: A test is anything that produces evidence from a patient at any stage in the clinical process, based on which a different clinical course will be taken depending on the different possible test outcomes (positive or negative, normal or abnormal, present or absent, high or low, ...). From the clinical epidemiology perspective, the following are examples of a "test": history taking (presence or absence of a component), clinical exam results (presence or absence of a sign), imaging findings (presence or absence of a feature on a radiograph), or response to therapy (as anticipated or not). Few if any tests in medicine are perfect; that is, produce results that can always be interpreted with absolute certainty on every patient to which the test is applied.
"Gold Standard” ( Reference, Definitive) Test: " The tests and procedures necessary to definitively establish to a high level of certainty the presence or absence of the disease in an individual" . e.g. of gold standard tests : biopsy for cancer , endoscope for duodenal ulcer , culture for bacterial infections.-Some disease have no gold standard test like psychosis . Some people have no disease but by these tests result is ( + )ve. The result is called false +ve . When the result of the test is ( - )ve but gold standard says there is disease, it is called false –ve .when the people is diseased and the result is +ve it is called true +ve , when there is no disease and the result is –ve called true –ve .
A 2x2 table:
a+b+c+db+d
a+c
Total
c+d
d True –ve c False –ve (-)ve
a+b
b False +ve
a True +ve
(+)ve
Absent
Present
Total
Disease ( according to gold standard)
Diagnostic indicator
e.g. In duodenal ulcer, we use endoscope (as a gold standard ) and we assess the validity of barium test in diagnosing duodenal ulcer :-
a+b+c+d
b+d
a+c
Total
c+d
d True –ve c False –ve (-)ve
a+b
b False +ve
a True +ve
(+)ve
Absent
Present
Total
Duodenal ulcer( according to endoscope)
Barium meal
SENSITIVITY OF THE TEST: Is the ability of the test to detect those who are test (+)ve among all diseased persons. True (+) ve *Sensitivity rate = ـــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــ x 100% Total persons with disease a Sensitivity rate = ــــــــــــــــــــ x 100% a + cThe sensitivity is the true positive rate True (+)ve rate of the test.
SPECIFICITY OF THE TEST Is the ability of the test to detect those who are test (-)ve among all who are not diseased persons. True (-) ve *Specificity rate = ــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــ x 100% Total persons without disease d Specificity rate = ــــــــــــــــــــ x 100% b + dThe Specificity is the true negative rate True (-)ve rate of the test
PREDECTIVE VALUES :-They are the ability of the test to uncover those who have (or have not) disease among all those with a positive ( or negative) test results.Positive predicative value : True positive Positive predicative (PPV) = ــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــ x100 value All those with positive test results It is the proportion of the diseased individuals among all those with a positive test result. It is also probability in % that individual with positive test result has the disease .
Negative predicative value : True Negative Negative predicative (NPV) = ــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــــ x100 value All those with negative test resultsIt is the proportion of the diseased-free individuals among all those with a negative test result . It is also probability in % that an individual with negative test result doesn't have the disease .Note :- Sensitivity and specificity are pre-test criteria , while the positive and negative predictive values are post-test criteria and these are considered when the test level ( result ) are available . The prevalence of disease = ( a + c ) / ( a +b +c +d )
Example In validating the use of chest X-rays (CXR) for the diagnosis of pulmonary TB. Against what can its accuracy be compared? The gold standard for diagnosing TB is the culture of Mycob TB from the sputum. To validate the use of CXR, we would have to select a certain number of TB suspects and perform both CXR & sputum cultures on them. Let us say 200 people were screened. Interpret the results given in the table:
200 a+b+c+d
100 ( b + d )
100 ( a + c )
Total
50(c+d)
30(d)
20(c )
(-)ve
150 (a+b)
70(b)
80(a)
(+)ve
Negative
Positive
Total
Sputum Culture for TB
Chest X-rays for TB
Using the 2x2 table, we could compute the sensitivity, specificity, positive predictive value and the negative predictive value of the test. Sensitivity: Sensitivity is the proportion of truly non-diseased person who are identified as diseased by the diagnostic test. It is :a/ (a + c) = 80/ (80 + 20) = 80 %. 80% of all those with TB are successfully picked up by chest X-rays. Specificity: Specificity is the proportion of truly non-diseased person who are so identified by the test. it is : d/ ( b + d ) = 30 / ( 30 + 70 ) = 30%. 30% of all those without TB are picked up by chest X-rays as negative.
The positive predictive value of a test is the probability of a test positive person truly having the disease. It is : a / ( a + b ) = 80 / ( 80 + 70 ) = 53% . In other words, only about half of those who are CXR positive are likely to have TB. The negative predictive value of a test is the probability of a test negative person truly not having the disease. It is: d / ( c + d) = 30 /(30+20( =60%In other words, 60% of those who are CXR negative are likely to be sputum culture negative
The more sensitive the test, the better its – ve predictive value.The more specific the test, the better its + ve predictive value.It is important that all new tests should be validated by comparison against a test which is established and considered a gold standard. Diagnostic tests are generally not 100% accurate. If the sensitivity is very high, the specificity tends to be low.
Likelihood Ratios: The newer and more powerful approach to evaluating new diagnostic tests is based on the concept of likelihood ratios. Basically the concept likelihood ratios takes into account both the pre-test probability of a disease (this is reflected by the baseline prevalence of the disease) and the post-test probability (this is reflected by the positive and negative predictive values of the test). a b Positive likelihood ratio = ــــــــــــــــــــ divided by ـــــــــــــــــــ a + c b + d i.e. true ( + ) ve rate / false ( + ) ve rate c d Negative likelihood ratio = ــــــــــــــــــــ divided by ـــــــــــــــــ a + c b + d i.e. false ( - ) ve rate / true ( - ) ve rate
Lecture – 11: Clinical epidemiology IIInstructional Objectives:At the end of the lecture, the students will be able to:1- list the main uses of a sensitive and a specific test.2- record the main Sources of Variability.3- define the Screening of diseases a- enumerate the Criteria of good screening test b- perform evaluation of a Screening Program.c- enumerate Types of Screeningd- list the Criteria For Instituting A screening program.
An ideal test of 100% sensitivity and 100% specificity does not exist. We generally have to choose from the available range of tests with varying sensitivities and specificities. How does one choose the right test? If very important not to miss a disease which is serious and potentially treatable (cancer, for example), it would be better to use a test which has greater sensitivity. One would like to pick up as many cases as possible doing this test. On the other hand, if making a positive diagnosis would result in much worry, stigma (HIV, for example) or cost, then it would be better to use a test which has high specificity.
Uses of a sensitive test: A sensitive test is used : 1- when there is a harmful effect of false negative results , e.g. missing a case of meningitis as a case of flue like disease is fatal in emergency clinics. 2- to rule out differential diagnoses e.g. tuberculin test is used to rule out a suspected TB case.( if negative we are sure there is no TB ) 3- low frequency of a disease in screening process.
Uses of a specific test: A specific test is used : 1- when there is a harmful effect of false positive results , e.g. wrong diagnosis of leukemia is very harmful emotionally and financially to a person with mild disease like iron deficiency anemia. Mostly useful in non-emergency situations. 2- to rule in differential diagnoses e.g. endoscope exam after barium meal testing is used to rule out a suspected gastric cancer case.( if positive confirms the diagnosis ).
Most useful result of a highly sensitive test is when it is negative. Most useful result of a highly specific test is when it is positive.
Accuracy: ( الدقة ) Is the degree to which, on average, a test represents the true value (that is, it is unbiased). a + d Accuracy rate = ـــــــــــــــــــــــــــــــــــ x 100% a +b +c +dReliability: ( المصداقية )The degree to which a test yields the same results when repeated under identical conditions on identical specimens. How good is a procedure when applied by different users. The degree to which different clinicians (observers) applying the procedure classify diseased individuals into the same diagnostic, prognostic or treatment categories.
Sources of Variability Biological variation : because the measures are not constant over time e.g. Bd Pressure, blood sugar, pulse rate, respiratory rate 2. Instrumental variation : due to calibration & standardization. 3. Intra-observer variability : among the same person. As in reading an X-ray which is not a 100% fixed. 4. Inter-observer variability : variation among different people.
Screening of diseases( استكشاف المرض ) Screening :- it is the process by which a specified diagnostic test is applied in order to sort out a group of people into two categories :-1.those who probably have the disease .2.those whop probably have not the disease . Screening of the disease is usually done to detect the disease process early in order to minimize its complications and consequences . The population being tested is comprised predominately of normal individuals that have not been identified as possibly having a clinical case of the disease. Thus, the probability that such an individual has the disease is the prevalence of the disease in the population being screened. Because the disease manifestations are likely minimal in affected individuals, the spectrum of disease is generally less severe in a screening than in a diagnostic setting.
Criteria of good screening test The screening test should have the following characteristics:- *It has high validity in term of sensitivity and specificity . *The test should be simple, and it should be accomplished easily and quickly . Sample of urine, prick of a needle *The test should be applicable and acceptable to a large no. of individuals . *The test should be a safe procedure and not producing harm to the individual being rested , also should be non-invasive. *Should be with beneficial effect for the people . *There should be detectable preclinical stage . e.g. for diseases which are suitable for screening :- essential hypertension , hypercholesterolemia , mammography for breast cancer.
Evaluation Of Screening ProgramsFeasibility: determined by :a- Acceptability (by the no. of people who accept to undergo the test).b- Cost-effectiveness (cost per detected case).c-provision of follow up d-Yield : no. of cases detected by a screening test in a screening program , it is measured by the Predictive Values ( + ve and – ve PV)
2. Effectiveness : It is impact of the screening program in decreasing mortality and morbidity from the disease. It is determined by: a- Severity of the disease at the time of diagnosis. (differences between screen detected cases vs symptoms detected cases). b- Comparison of cause-specific mortality rate among screen detected vs symptom detected. So if we detect a significant difference in the screening then it is effective’. e.g. Breast cancer mortality rates at different times after the start of the follow-up among women receiving screening (mammography) and controls.
Types of Screening Mass Screening : Screening the whole population. e.g. Hypertension, diabetes .. .etc 2. Multiple or Multiphase Screening. Using multiple screening tests at the same time as in Parallel & Series. 3. Targeted Screening : Screening of groups with specific exposures either: a-Occupational : lead factory workers should be checked for the level of lead in blood. b-Environmental : as in the Chernobyl Nuclear disaster. 4. Case-Finding (Opportunistic Screening): Screening of patients who consult a health facility for other purposes e.g. Malaria blood film.
Criteria For Instituting A screening program : * Disease : - serious - high prevalence of preclinical stage - natural history understood. long period between first signs & overt of the disease . *Diagnostic test: -sensitive & specific - simple & cheap - safe& acceptable reliable . *Diagnosis & Treatment: -facilities are adequate -effective, acceptable & safe treatment available.
Lead time is the amount of time by which screening advances the detection of the disease (i.e. the time between detection by a screening test and detection without a screening test). Even if the interval between the (unknown) biologic onset of the disease and death is unchanged, earlier detection will lengthen the interval between diagnosis and death so that survival appears lengthened. Lead time bias results when a screening program creates the appearance of delaying morbidity and mortality but in reality does not alter the natural history.
Suppose that we wish to explore the scope for reducing mortality from breast cancer by early diagnosis. One approach might be to compare the survival of patients whose tumors were detected at screening with that of women who only present once their disease has become symptomatic. However, this could be misleading. Survival might be longer in the screened women not because early treatment is beneficial, but simply because their tumors are being diagnosed earlier in the natural history of their disease (fig).
Lead time (with screening (a) disease is diagnosed earlier than without screening (b) and survival is longer from diagnosis, but this does not necessarily imply that the time course of the disease has been modified.)
Lecture – 12: Clinical epidemiology IIIInstructional Objectives:At the end of the lecture, the students will be able to:1- State the effect of combination of tests ( i.e. combination in parallel and combination in series ).2- determine the effectiveness of treatment.3- define the Prognosis of disease.
MULTIPLE TESTING (COMBINATION OF TESTS) It is the procedure by which we increase the validity criteria by applying two or more tests in a special sequence and consideration . There are two ways of combination :- 1.Combination in parallel : 2. Combination in series :
Combination in parallel :We apply two or more tests at the same time , if any one of them is +ve then the combination set is considered to be +ve . And it is considered –ve when all the tests are –ve . Should be used when rabid assessment is needed ( as in emergencies ) or for routine physical examination . A conclusion in this combination , the sensitivity will increase but the specificity will be decreased , because every ( + ) ve is considered as patient and ill while he is normal .2. Combination in series : We apply test thereafter , and the combination is considered to be positive (+ve) when all the tests are +ve , and considered to be –ve when any one of them is –ve . Serial tests should be used when rapid assessment is not necessary or when some of the available tests are expensive or risky . All tests must have positive results for the sequence to be considered positive for the disease. A conclusion: in this combination , the specificity will be increased because rarely normal persons considered to be ill, but the sensitivity will be decreased .
-
Not necessary
-
-
-
+
Series
+
+
+
-
-
-
+
+
-
Parallel
+
-
+
+
+
+
Interpretation
Test2
Test1
Tests done In :
“Schematic Diagram of the Combination of Screening tests”
B .Treatment : The effectiveness of treatment is measured through : Efficacy( theoretic effectiveness) 2. Effectiveness( practical effectiveness) Efficacy is established by restricting patients in a study ( those who will cooperate fully) with certain method of treatment under ideal conditions. The method will do a better performance is more efficacious. Effectiveness: is established by offering a treatment or a programme to patients and study the outcome under practical conditions.
C . Prognosis: When people they have many questions about their illness ? how it will affect them ? is it dangerous? Could it be fatal? Will there be pain? Prognosis is a prediction of the future course of disease following its onset. It can be measured in several ways e.g. case fatality rate or probability of survival( 5 year survival rate)