Log on: Remember me
Powered by Elgg
  • Publish Comment:

  • David Schlangen's Pages:

    Pages
  • David Schlangen

  • Owned communities

David Schlangen : Home Page > minutes190508

  - present: M, T, D
  1. akustisches Modell
     - momentan nicht besonders gut. Trainiert mit Daten von vox forge
       (?).
     - to be redone with Pento Naming Corpus data
     - also add Verbmobil data? Kiel Korpus?
     - when that is done, tackle other points:
       - LexTree instead of SimpleLM, so that tri-gram LMs can be used
       - n-best lists, or lattices
       - confidence scores
     --> by end of May, we will have model that is good enough to at
     least get an idea of what we will be working with.
Germany's next top (acoustic) model!
  2. literature day(s)
     - topics:
       - parsing / semantics for SDS
       - incremental parsing (in general)
       - parsing and prosody
       - incremental systems
       - EOT prediction / recognition
     ----> collect paper suggestion on Wiki. Michaela will organise
       the (first) day.
  X. brief interlude: do we need new collaboration management system?
     What do we use at the moment:
     - email .. anouncements, rarely for content, moving of files
       - pro:
       - archivable / searchable
- attachments
     - IM    .. quick questions (T and D; M doesn't use at all)
       - pro:
       - fast, instant (d'uh)
       - cons:
       - not easy to search. breaks unity of transmission (where did
           I read that?)
     - Elgg  .. mostly for meeting minutes (and there the wiki part is
           used only). T uses it for status reports. M doesn't
   use it at all. Not used for literature notes etc.
       - pro:
       - archival.. ?
       - cons:
       - active effort needed to put things there & to check for new
     entries (since integration of RSS in our workflows doesn't
     seem to work yet)
     What we'd need:
     - ideal would be a system that has more than one interface,
       including email. That is, new content can be contributed by
       email, web, whatever, is spread via email, can be searched in
       one central place.
       Bonus: has interface to svn, e.g. one can link to documents in
       the svn. also has IM client, and archives IMs. 
     .. is that trac? probably not. Does that exist? Probably not.
  3. WOz
     - controls mouse and prompts
     - data can be used for acoustic model and language model, also
       for learning about dialogue dynamics that can be expected. Main
       goal is to see how people behave if they assume that the
       capabilities of the instruction follower are limited.
  4. Parser, requirements
     - robustness. Can't assume that it always will be able to parse
       into sentences. Should be happy with intermediate
       constituents.
       Doesn't this requirement fall out of incrementality in any
       case? If partial results are passed on, they always will be
       sub-S constiuents.
       Yes, but the problem has a slightly different aspect as
       well. The question is what to do if what the parser can
       possibly recognise (because of ASR problems) are sequences of
       NPs. If there is no syntactic rule that could potentially
       integrate those, if not specially prepared, the parser would
       not even attempt to build the later NPs.
       So what it boils down to again is the question of whether the
       parser needs a notion of being "restarted" or not.
     - incrementality. d'uh.
     - non-commital, capable of making revisions
       related to topic above. What happens if the parser decides to
       give up integrating new material into the current structure?
     - mid-term: probabilistic, integrate prosody (as information on
       words or as pseudo tokens), parse lattices
     Discussed:
     - is top-down parsing a good idea for an incremental parser? can
       this work? Think through, what are the problems that can arise
       in either case?
       --> Michaela? Think up a few example cases (including ambiguous
         and garbage-full sentences) and see what either parsing
     strategy will do.
     - how is commital (when parser decides that what it currently is
       consuming is unrelated to structure it has previously built)
       realised technically, how is it triggered? 
       Cleaning datastructures. Empty chart, or un-link datastructure?



das, 05/20/08 10:01 (GMT)

Keyword: acoustical model, ASR, meetings, minutes, parser

Add a new page under this one