[POPLmark] Adequacy for LF

Robert Harper rwh at cs.cmu.edu
Mon May 2 22:19:21 EDT 2005


I have been through this exercise with a full-scale language (Standard 
ML), two times now, with one being formalized as we speak (it is 90% 
finished, and we do not expect any problems in completing it).  There 
is no value in the exercise; evaluation contexts buy you nothing.

Bob


On May 2, 2005, at 8:09 PM, Stephan Zdancewic wrote:

> Karl Crary wrote:
>
>> There's been some confusion about the adequacy of our encoding of the 
>> operational semantics; in particular, why our encoding is adequate, 
>> given that the paper presents the semantics using evaluation contexts 
>> and our encoding does not.  We wanted to clarify why it is adequate, 
>> and moreover, why we thought its adequacy was obvious.
>
> Hi Karl,
>  I would also like to elaborate a bit more on why we originally gave 
> the Twelf implementation a grade of "-" with respect to the 
> operational semantics.  Part of the confusion (and this is probably 
> our fault entirely) is that the point of PoplMark is *not* to prove 
> the soundness of FSub.  Instead, the point is to allow us to compare 
> proofs of "the same thing" using different theorem proving and term 
> representation strategies.  Because different term representations 
> will likely need to use different proof techniques, there has to be 
> some wiggle room for what exactly "the same thing" is, but by trying 
> to keep as many features of the formalism the same across different 
> implementations, we had hoped to make comparison easier.  In addition, 
> the other point of the PoplMark Challenge is to see how the 
> technologies scale to handle different language constructs, including 
> things like exceptions, etc. that are more cleanly handled by 
> evaluation contexts.  (See Peter's mail.)
>  As you rightly point out, the operational semantics for FSub doesn't 
> require evaluation contexts and omitting them may make the proofs 
> cleaner.  In fact, never have we said that your solution was 
> inadequate (in the technical sense).  Our objection was solely to the 
> fact that it didn't follow the spirit of the challenge.  We should 
> have clarified this from the beginning.
>
>  Putting the adequacy of your original encoding aside for the moment, 
> let me ask you this: How hard would it be to formalize the operational 
> semantics, using evaluation contexts, following the paper version as 
> closely as possible?  My impression is that it wouldn't be that much 
> harder than the existing solution.  If it *would* be significantly 
> harder, then that in itself is interesting, and I would like to 
> understand why.
> Thanks,
>  Steve
> _______________________________________________
> Poplmark mailing list
> Poplmark at lists.seas.upenn.edu
> http://lists.seas.upenn.edu/mailman/listinfo/poplmark



More information about the Poplmark mailing list