In my Master's Thesis for the TICMA Master at the UPF I reviewed the shape descriptors and implemented three that can be used for control in tabletop systems, such as the reactable.
Here you can watch two videos showing the result of these descriptors:
reacTIVision is a computer vision intended to be used for tabletop systems.
A tabletop or surface computer is a flat surface similar to a table, that in fact is a big screen with multitouch capabilites. Is easier to understand what it is by looking for example at Microsoft Surface which is a commercial tabletop system.
reacTIVision is an open source vision system that can be used for the development of tabletop systems. Typically, a vision-based tabletop system has a camera under the semitransparent surface of the table which allows the computer to understand what is on the table. The information captured with the camera is the input for the tabletop (just as the keyboard and mouse is the input for a normal computer), and then the tabletop projects images in its surface (just like the screen of a computer).
The most successful aplication that uses reacTIVision is the reactable, a novel musical instrument developed at the Music Technology Group of the Universitat Pompeu Fabra by the same team that develops reacTIVision.
reacTIVision is able to recognize fingers and objects tagged with a fiducial (something similar to a barcode). The purpose of my Master's Thesis, was to extract information from generic untagged objects.
The three selected shape descriptors are:
The skeleton is computed from the Distance Transform and the Voronoi Diagram
The Distance Transform computes the distance to the nearest contour pixel. If we interpret the DT as a height in a 3D image, the ridges would be the skeleton:
This skeleton is then simplified keeping the points of high curvature and the necks.
From this simplification the shape can be reconstructed easily.
You can watch my presentation or you can download the Master's Thesis from the MTG's web site or my local copy
Comments