Log on: Remember me
Powered by Elgg
  • Publish Comment:

  • David Schlangen's Pages:

    Pages
  • David Schlangen

  • Owned communities

David Schlangen : Home Page > 050508cont

  - semantics of utterances: for prototype, treat all referring
    expressions as *names*, i.e. with their semantics specified in the
    lexicon? 
    - for pieces: expressions like "the cross". A bit of a stretch to
      treat things like "the second on the left side in the
      second row" as names. This can be translated into some sort of
      coordinate, but would still need to be resolved -- that this
      expression refers to piece3 can't be a fact of the lexicon.
    - for locations: expressions like "the back of the elephant" refer
      to a location, which can initially be represented by a single
      coordinate. Then zooming-in process sets in (see below).
  - how much intelligence does the non-ling-action executor have? What
    does the Dialogue Manager communicate to it?
    Let's assume the question was "where shall I move it?" (i.e.,
    piece has been identified), and the answer "to the back of the
    elephant" has been correctly understood. This instruction is
    underspecified. How do we get to the right location? What does the
    DM tell the "arm"? And what does the arm answer back?
    - DM: `move (10,20)'
    - Arm: done. closest legal position is (11,20). [i.e., arm knows
      about legal positions, identifies these independently]
    - DM: to TTS `here?'
    - User: `yes'
    - DM to arm `drop'
  - for prototype: movements can be interrupted by user. `action
    barge-in' 
  - for prototype: nothing can be removed?
  - TO DOs parser:
    - brain-stormed a bit. (On Tuesday.)
      - keep structure of referring expressions around, even if for
        now we treat them as names.
      - HPSG-inspired? With HEAD, DTRS, SEM, RESTR features etc.
      - but basis (for semantics) is the frame:
       ACTION: move
AGENS: system (implicit)
PATIENS: piece
START_POS: 
TARGET_POS:
        features can be left unspecified, even when utterance is
        complete.
      - identified issues:
       1. complex descriptions (see above). Keep internal structure
           of descriptions, for latter (concurrent) processing by
     other modules
2. also need semantics for interaction management utterances
           like "yes", "no", "one up", "daneben", etc.
        3. need info about (potential) completeness. Parser features
     as used in Nigh experiments.
4. fragments / context ellipsis.
  "Welches Teil soll ich jetzt nehmen?" "Das Kreuz"
  should produce same semantics as "Nimm das Kreuz"? Or is
     this contextual merge done in later module?
  - TO DOs for ASR part:
    - get partial results w/ n-gram model?
    - get confidence scores out
    - learn more about configurations
    - get n-best lists / lattices
    ---> Friday, 1st International Sphinx Day



das, 05/08/08 10:21 (GMT)

Add a new page under this one