Resolution Theorem Proving

Resolution Theorem Proving

by Adam Pease and Chris Benzmueller

Resolution theorem proving is an approach to automated reasoning. This document explains the basic algorithm.

We will use TPTP notation (where upper case denotes a variable), but include SUO-KIF notation as well to the right of each example. After that we give pseudo-code for the algorithm.

Imagine the following theory

    TPTP notation                                                            SUO-KIF notation
    p(X)=>q(X)          ;; every p is a q                                   (=> 
                                                                              (instance ?X P) 
                                                                              (instance ?X Q))

    q(X)|r(X)=>t(X)     ;; if something is either                           (=> 
                        ;; a q or an r it is also a t                         (or 
                                                                                (instance ?X Q) 
                                                                                (instance ?X R)) 
                                                                              (instance ?X T))
                        
    p(a)                ;; a is a p                                         (instance A P)

    t(a)|r(a)                                                               (or 
                                                                              (instance A T) 
                                                                              (instance A R))

    -p(X)|q(X)                                                              (or 
                                                                              (not 
                                                                                (instance ?X P)) 
                                                                              (instance ?X Q))

    (-q(X)&-r(X))|t(X)  ;; needs more simplification                        (or 
                                                                              (and 
                                                                                (not 
                                                                                  (instance ?X Q)) 
                                                                                (not 
                                                                                  (instance ?X R))) 
                                                                              (instance ?X T))

    p(a)                                                                    (instance A P)

refutation

    -t(a)&-r(a) ;; can be simplified                                        (and 
                                                                              (not 
                                                                                (instance A T)) 
                                                                              (not 
                                                                                (instance A R)))

    (-q(X)|t(X))&(-r(X)|t(X))                                               (and 
                                                                              (or 
                                                                                (not 
                                                                                  (instance ?X Q)) 
                                                                                  (instance ?X T)) 
                                                                              (or 
                                                                                (not 
                                                                                  (instance ?X R)) 
                                                                                  (instance ?X T)))

    -q(X)|t(X)                                                              (or 
                                                                              (not 
                                                                                (instance ?X Q)) 
                                                                                (instance ?X T)) 

    -r(X)|t(X)                                                              (or 
                                                                              (not 
                                                                                (instance ?X R)) 
                                                                                (instance ?X T))

    -t(a)                                                                   (not 
                                                                              (instance A T)) 

    -r(a)                                                                   (not 
                                                                              (instance A R))

    A --> p(a)                                                              (instance A P)

    P --> -p(X)|q(X), p(a)                                                  (or 
                                                                              (not 
                                                                                (instance ?X P)) 
                                                                              (instance ?X Q)), 

                                                                            (instance A P)

    Q --> -q(X)|t(X), -p(X)|q(X)                                            (or 
                                                                              (not 
                                                                                (instance ?X Q)) 
                                                                              (instance ?X T)), 

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X P)) 
                                                                              (instance ?X Q))

    R --> -r(X)|t(X)                                                        (or 
                                                                              (not 
                                                                                (instance ?X R)) 
                                                                              (instance ?X T))

    T --> -q(X)|t(X), -r(X)|t(X)                                            (or 
                                                                              (not 
                                                                                (instance ?X Q)) 
                                                                                (instance ?X T)), 

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X R)) 
                                                                              (instance ?X T))

    TBU --> {-t(a), -r(a)}                                                  (not 
                                                                              (instance A T)), 

                                                                            (not 
                                                                              (instance A R))

    Candidates for -t(a) --> {p(a), -q(X)|t(X), -r(X)|t(X)}                 (instance A P), 

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X Q)) 
                                                                              (instance ?X T)), 

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X R)) 
                                                                              (instance ?X T))

    TBU --> {-r(a), -q(a), -r(a)}                                           (not 
                                                                              (instance A R)), 

                                                                            (not 
                                                                              (instance A Q)), 

                                                                            (not 
                                                                              (instance A R))

    TBU --> {-r(a), -q(a)}                                                  (not 
                                                                              (instance A R)), 

                                                                            (not 
                                                                              (instance A Q))

    Candidates for -r(a) --> {p(a), -t(a), -r(X)|t(X)}                      (instance A P), 

                                                                            (not 
                                                                              (instance A T)), 

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X R)) 
                                                                              (instance ?X T))

    TBU --> {-q(a)}                                                         (not 
                                                                              (instance A Q))

    Candidates for -q(a) --> {p(a), -t(a), -r(a), -q(X)|t(X), -p(X)|q(X)}   (instance A P), 

                                                                            (not 
                                                                              (instance A T)),

                                                                            (not 
                                                                              (instance A R)),

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X Q)) 
                                                                              (instance ?X T)), 

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X P)) 
                                                                              (instance ?X Q))

    TBU --> {-p(a)}                                                         (not 
                                                                              (instance A P))

    Candidates for -p(a) --> {p(a), -t(a), -r(a), -q(a), -p(X)vq(X)}        (instance A P), 

                                                                            (not
                                                                              (instance A T)),

                                                                            (not 
                                                                              (instance A R)),

                                                                            (not 
                                                                              (instance A Q)),

                                                                            (or 
                                                                              (not 
                                                                                (instance ?X P)) 
                                                                              (instance ?X Q))

Pseudo-Code

    Clausify()
    BuildIndexes()
    NegateQuery() - then put in TBU
    while not empty(TBU)
        remove statement S from TBU
        find candidate resolvers for S
        add S to knowledge base
        for each candidate C
            unify C and S
            if empty clause
                success!
            else
                assert remaining clause(s) to TBU

The biggest complication is speed. On a knowledge base of any size, the combinatorial explosion inherent in this depth first search makes heuristics and optimizations essential. One is to check to make sure that any clause added to TBU is not already subsumed by one in TBU, or the knowledge base. For example, if my knowledge base has p(X), there's no point adding p(a) or r(X)vp(X) to TBU.

Another small optimization is that we want to check as little as possible during run-time. The more we can do up-front and once, at least for many applications, the better off we'll be, as opposed to possibly doing the same operation several times, and at run time. We might canonicalize all variables, so that p(X)vr(Y) becomes p(V1)vr(V2) and p(M)vR(M) also becomes p(V1)vr(V2), allowing us to do a simple string compare to see if they are the same, since all variable names are local to their expressions. p(X)vr(X) is also identical to r(X)vp(X), so we might sort clauses alphabetically, resulting in p(X)vr(X) in both cases (the second assertion gets rearranged to the canonical order).

We could also have more complicated indexing, for example, having two sets of pointers: one in which the term appears in a positive clause, and another in which the term appears in a negative clause, thereby giving us a better chance at returning only those statements which have a chance to unify with the statement to be proven.

thanks to Geoff Sutcliffe for his help in correcting errors and making this clearer.

Webmaster