<b style>yes I agree with you in case of classical predator/tracking algorithm.But </b><b style>components used in pertilarly tld apart from tracking,provide necessary information for recognition.</b><div style><b> </b></div>
<div style><b>To make sure of these facts I communicated with Author of this algorithm,who used</b> <b>combination of haar waveletts and local binary patterns, as key features to trackobjects,which we use for recognition.</b></div>
<div style><b><br></b></div><div style><b>I gone throuth most of key concepts from his paper.His key components of model are tracker and detector which agree or decides position of object of interest.</b></div><div style>
<b>here is complete project details of opentld</b></div><div style><a href="http://info.ee.surrey.ac.uk/Personal/Z.Kalal/tld.html" target="_blank" style="color:rgb(17,85,204)">http://info.ee.surrey.ac.uk/Personal/Z.Kalal/tld.html</a></div>
<div style><br></div><div style><b>this predator is very generic,in this site itself we can see his(autor) face is also recognised,from set of photos when he shows in front of cam.one more important fact is we have parameter to vary confidence of recognition.</b></div>
<div style><b><br></b></div><div style><b>regarding depth recognition:this algorithm also provides scaling information about object(in our case its face).once we know initial position from cam,then on we can use transformation concepts to determine distance from cam.even though its very hard to determine depth exactly because we need stereo information like we need two cameras and it needs lot of time may be reseach,we get crude aproximation about depth which definately improve acoustic model.</b></div>
<div style><b><br></b></div><div style><b>I am happy if anyone help me to improve or let me know if any flaws in this approach.</b></div>