This class will cover use cases of the Bayesian methods in the medical domain. First part of the class is based on article: “Local computations with probabilities on graphical structures and their application to expert systems” by Lauritzen, Steffen L. and David J. Spiegelhalter. Second part is inspired by “An intercausal cancellation model for bayesian-network engineering. International Journal of Approximate Reasoning” by S.P. Woudenberg, L. C. van der Gaag, and C. M. Rademaker.

Medical Diagnosis

In this section we will follow a simplified use case of the medical diagnosis, as defined in the following quote from the article.

Structure

First task of the “knowledge engineer” is to find a structure of Bayesion network which fits the story. There exist automatic tools to learn the structure from examples, but in this case the structure should be clear enough to create the network by hand.

Assignments

Draw (on paper?) a Bayesian network describing the story from the previous section.
Write the corresponding ProbLog program:
1. there is no need for the first order logic here
2. use arbitrary probabilities

Probabilities

The problem with Bayesian model you've just created is that it doesn't provide with any useful info. Mostly because of the arbitrary prior probabilities, you've used. Reality is rather harsh, often you don't have access to any realistic priors (one of the arguments of critics of Bayesian methods). In this section we will try to make up for that and find make the network useful.

Learning

The simplest way to have realistic priors is to not have any priors at all :) In other words — we assume, we know nothing about probabilities. In ProbLog you can state this fact by using t(_) predicate, e.g.

t(_)::smoker.

Says you do not know nothing about probability of patient being smoker.

Now when we have admitted our lack of knowledge, we can start learning! In ProbLog learning can be achieved either by command line tool:

problog lfi

or in on-line editor by simply choosing Learning from the list.

In both cases you have to provide some learning examples, that consists simply of evidences separated by dotted line, e.g. two different patients can be described as:

evidence(smoker).
evidence(\+visitedAsia).
evidence(\+tubercolosis).
evidence(\+lung_cancer).
evidence(\+dyspnea).
evidence(\+xray_positive).
----------------
evidence(\+xray_positive).
evidence(tubercolosis).
evidence(visitedAsia).
evidence(\+lung_cancer).
evidence(dyspnea).
evidence(\+smoker).

The learning should result in new model with new probabilities.

If you receive an “Inconsistent Evidence” error, include leak probabilities in the model. Leak probabilities are probabilities stating that some random variable can be assigned to a value without any particular reason, e.g. here we state that variable var can't be true if no reason is true.

t(_)::var :- reason1.
t(_)::var :- reason2.
0.0::var. % leak probability

Make sure to include leak probabilities such that all possible ev idence can be linked to a possible world (otherwise ProbLog will return an “Inconsistent Evidence” error).

Assignments:

replace all probabilities in your model with t(_)
put some random learning data in the on-line IDE and check results of learning
download data of 10 000 patients
1. try to use it in on-line IDE
2. try to use it offline (cmdline)
  1. you may have to ask the teacher to install the problog for you
  2. you may have to limit number of iterations learning takes
3. what are the learned probabilites?
4. what is the probability of a smoker with positive x-ray to have a lung cancer?

Learning + Priors

The previous section was neat, but reality is really harsh and it's realy difficult to get good learning data of 10 0000 patients.

en/dydaktyka/problog/lab1.1496061932.txt.gz · Last modified: 2019/06/27 16:00 (external edit)

Show pagesource Old revisions

Media Manager Back to top

AIwiki

Menu

Dla Studentów

Old specialized AI courses

SMaDA/SMaIDA/AIDA

Informatyka (EAIiIB)

Studia Dr

Inne materialy dydaktyczne

Archiwum

Dyplomanci

Geist Season of Code

HeKatE

Public

Table of Contents

Probabilistic Programming --- Medical Cases