(Joint work with Jeffrey Mark Siskind
and Andrei Barbu)
Learning to play Tic Tac ToeA system that learns to play board games. Three independent and uncoupled agents, the protagonist, antagonist, and wannabe, timeshare the arm and cameras. The protagonist and antagonist play several games of Tic Tac Toe while the wannabe watches and learns the rules. The wannabe then plays against the protagonist with the learned rules.
Learned rules
|
Learning to play HexapawnThe unmodified system can learn other games, such as Hexapawn. Variants on hexapawn that generate arbitrarily long games, with different captures and moving backwards can also be learned.
Learned rules
|
Learning to play Hexapawn variant DThe system learning to play an extension of Hexapawn augmented with sideways and backwards vertical non-capturing moves.
Learned rules
|
Learning Tic Tac Toe from English rulesStarting with rules described in English, the antagonist and protagonist play several games of Tic Tac Toe while the wannabe watches. Each views the board from a different position. The wannabe learns a logical description of the game, and then plays a game against the protagonist. |
Disassembly of a Lincoln Log structureAfter the arm is calibrated, the pose and structure of the Lincoln Log structure are determined, then the structure, whose pose and structure have been determined solely from visual input, is disassembled. |
Disassembly of a Lincoln Log structure from multiple views and linguistic inputAfter the arm is calibrated, the pose and structure of the Lincoln Log structure are determined, one view is insufficient, a second view is used and resolves the ambiguity, then the second view is forgotten and a linguistic constraint is applied. The structure determined solely from visual input, is then disassembled. |
Lincoln Log structure estimation from a single imageOnce we have determined the pose of a Lincoln Log assembly (left) using techniques from Web Figure 10, we can correctly determine the types and positions of the logs (shown in green) that constitute the assembly (right). |
|
![]() |
![]() |
Structure estimation from spatially distinct viewsDue to occlusion, a single view (top) may provide insufficient information to support correct structure estimation (the false negative shown in orange).Integrating information from a second view (bottom) of the same structure, prior to disassembly, can correct the error. (Correctly determined absence of logs is shown in blue.) |
|
![]() |
![]() |
![]() |
![]() |
Web Figure 13: Structure estimation from temporally distinct viewsAnother way to recover occluded information is to begin the task of disassembly with partial information (top) and then reimage the structure from the same camera pose part-way through disassembly after the occlusion has been eliminated.The information from two temporally distinct views of distinct assembly states can be integrated to yield a correct model of the initial structure (bottom). |
|
![]() |
![]() |
![]() |
![]() |
Web Figure 14: Structure estimation given constraintsOccluded information can be recovered from a single image by constraining the space of possible structures (in this case specification of the piece inventory).Our goal is for multiple agents to communicate such constraints linguistically and infer such constraints through high-level reasoning. |
|
![]() |
![]() |