[완료]마우스 제스쳐 인식과 내적에 대한 질문입니다.

글쓴이: duraduri / 작성시간: 화, 2008/01/08 - 3:18오후

안녕하세요 이글은

블랙 앤 화이트에서 사용했던 마우스를 이용하여 특정한 패턴을 그리면 반응하는

글에 대한 질문입니다. 원문은

http://www.gamedev.net/reference/articles/article2039.asp 에 있습니다.

간단하게 점이 32개가 모여서 선을 만들게 되어있습니다.

이 선이 미리 정의된 다른 선과 비교를 하여 비슷한지 여부를 판단하게 됩니다.

이를 위해 내적을 이용하게 됩니다. 사실 저는 수학적 능력이 부족하여 그냥 각각의

점들의 위치를 비교하면 되지 않을까 했지만 본문에서 이렇게 내적을 사용하여 급호기심에

질문을 하게됩니다.

신기한것은 제가 고등학교때 배웠던 내적의 경우 2차원 벡터였습니다.

aㆍb = x1x2 + y1y2 = |a||b|cosθ

정도의 공식으로 알수 있었습니다. 그런데 이 글에서

The only difference is that our vectors are not 2 or 3 dimensional, but 2*N dimensional, where N is the number of points in the gesture. Otherwise it's the same familiar x1*x2 + y1*y2 +… (Professional statisticians call this formula correlation; now you know that's the good old dot product in disguise)

2, 3차원 벡터가 아니면 N이 점의 갯수인, 2*N차원 벡터로 계산을

x1*x2 + y1*y2 +(32개 점중 첫번째 점)
x1*x2 + y1*y2 +(32개 점중 두번째 점)
.............
x1*x2 + y1*y2 (32개 점중 32번째 점)

하여 전부 더해진 값으로 계산해줍니다.

그리고 나서

cosθ = (uㆍv) / (sqrt(uㆍu) * sqrt(vㆍv))

cosθ를 구해주어 1(두 벡터가 0도 차이가 나면 1이 되며 이게 완전하게 매칭된것입니다.)

두 벡터 사이에서는 알겠습니다만 어떻게 이렇게 연속된 점들간에 내적을 통해서도 cosθ를

이용해 얼마나 유사한지를 판단할수 있는데 내적을 쓸수 있는지 궁금합니다.

이런 심화된 내적들은 수학은 선형대수에 있는 건가요?

막상 쓰려고 하니 어떻게 돌아가는지가 참 궁금하네요. 그럼 좋은 하루 되세요

이 아래 부분이 매칭 부분관련 글 입니다.
-----------------------------------------------------------------------
Shape Up!

How would we like our magic shape matcher to work? I'd be nice if it would compare the gesture, drawn by user to one of the stored gestures, then it'd return a score: say, 1 for a perfect match, a number, slightly less than 1 for a similar shape and a small or even negative number for a very different shape. We can then compare the input to each of the gestures in our database, pick the one with the highest ranking, and if the ranking is high enough, we say "That's what it is!"

OK, it's easy to say, but how do you actually do it? Well, look closer at the requirements for the shape matcher, does it remind you of something? No? How about good old dot product? If you give it two normalized vectors it'll return 1 if the vectors are exactly the same, it will return a value slightly less than 1 for vectors that point in more-or-less same direction and it'll return a low or even negative number for vectors that point in wildly different (or opposite) directions. Amazingly enough, it works just as well for shape matching! The only difference is that our vectors are not 2 or 3 dimensional, but 2*N dimensional, where N is the number of points in the gesture. Otherwise it's the same familiar x1*x2 + y1*y2 +… (Professional statisticians call this formula correlation; now you know that's the good old dot product in disguise)

So how exactly do you use dot-product to compare shapes? You simply unroll each gesture into an array of numbers and manipulate these arrays as if they were familiar geometrical vectors. (What if the arrays turn out to have different dimensions? Well, then you know it is not a good match, for differing dimensions mean different number of points, and a different number of points means different number of strokes in corresponding gestures, which allows us to reject the match right away. This is why we made each stroke precisely 32 points long.)

float GestureDotProduct(const CGesture& gesture1, const CGesture& gesture2)
{
   if ( gesture1.size() != gesture2.size() ) return -1;
 
   CGesture::const_iterator i1;
   CGesture::const_iterator i2;
   float dotProduct = 0.0f;
 
   for ( i1 = gesture1.begin(), i2 = gesture2.begin();
         i1 != gesture1.end() && i2 != gesture2.end();
         ++i1, ++i2 )
   {
      const CStroke& stroke1 = *i1;
      const CStroke& stroke2 = *i2;
      if ( stroke1.size() != stroke2.size() ) return -1;
 
      CStroke::const_iterator j1;
      CStroke::const_iterator j2;
      for ( j1 = stroke1.begin(), j2 = stroke2.begin();
            j1 != stroke1.end() && j2 != stroke2.end();
            ++j1, ++j2 )
      {
         const CPoint2D& pt1 = *j1;
         const CPoint2D& pt2 = *j2;
         dotProduct += pt1.x * pt2.x + pt1.y * pt2.y;
      }
   }
 
   return dotProduct;
}
 
float Match(const CGesture& gesture1, const CGesture& gesture2 )
{
   float score = GestureDotProduct(gesture1,gesture2);
   if ( score <= 0.0f ) return 0.0f; //No match for sure
   //at this point our gesture-vectors are not quite normalized
   yet - their dot product with themselves is not 1.
 
   //we normalize the score itself
 
   //this is basically a version of a famous formula for a cosine of the 
   //angle between 2 vectors:
   //cos a = (u.v) / (sqrt(u.u) * sqrt(v.v)) = (u.v) / sqrt((u.u) * (v.v))
   score /= sqrtf(GestureDotProduct(gesture1, gesture1) *
                  GestureDotProduct(gesture2, gesture2));
   return score;
}

Forums:

프로그래밍 QnA

댓글 달기

벡터의 내적

글쓴이: panda005 / 작성시간: 화, 2008/01/08 - 4:26오후

벡터의 내적은 일명 두 벡터의 스칼라 곱이라고도 불리는 데
그 정의는 다음과 같습니다.

AㆍB = a1*b1 + a2*b2 + ... + an*bn = |A||B|cosθ

따라서 cosθ = (a1*b1 + a2*b2 + ... + an*bn) / |A|*|B|
이며 θ를 구할 수 있습니다.

이는 일반적인 벡터 연산에서 코사인 유사도라고 불리기도 하며, 여기저기 쓰이고 있답니다.

참고: http://ko.wikipedia.org/wiki/%EB%82%B4%EC%A0%81

답글

답변 감사합니다!

글쓴이: duraduri / 작성시간: 화, 2008/01/08 - 5:44오후

답변 감사합니다. 혹시 좌표 32개가 왜 32차원 벡터처럼 계산 되는지좀 알수 있을까요?

이렇게 왜 연결이 되는지 모르겠습니다....

답글

n 차원 벡터라는 게 n개의 축에 해당하는 좌표값을 갖는 벡터입니다.

글쓴이: panda005 / 작성시간: 수, 2008/01/09 - 11:03오전

n 차원 벡터라는 게 n개의 축에 해당하는 좌표값을 갖는 벡터입니다.

3차원 공간에서 사는 우리로써는,
1차원/2차원/3차원 벡터까지는 그림으로 그려서 이해할 수 있지만,
4차원 이상부터는 그림으로 그릴 수가 없지요.
하지만 그림으로 그리지 않는다면 아주 쉽습니다.

1차원벡터의 예(4)
2차원벡터의 예(4,3)
3차원벡터의 예(4,3,6)
4차원벡터의 예(4,3,6,2)
...

뭐 이런 식으로 죽죽 늘어나겠죠.

제가 마우스 제스쳐 인식에 대해서는 잘 몰라서 섣불리 말씀드리긴 뭐하지만,
이 문서에서 벡터라고 하는 것이,
어떤 공간적인 정보를 담아내는 것이 아니고
하나의 마우스 제스쳐를 32가지 특성으로 표현하는 것이라고 이해하시면 될 것 같습니다.

답글

감사합니다.

글쓴이: duraduri / 작성시간: 목, 2008/01/10 - 12:34오전

덕분에 이해가 되었습니다. 정말 감사합니다!

답글

댓글 달기

이름

제목

댓글 *

텍스트 포맷에 대한 자세한 정보

텍스트 양식

Filtered HTML

텍스트에 BBCode 태그를 사용할 수 있습니다. URL은 자동으로 링크 됩니다.
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param><hr>
다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.

BBCode

텍스트에 BBCode 태그를 사용할 수 있습니다. URL은 자동으로 링크 됩니다.
다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param>
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.

Textile

다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
You can use Textile markup to format text.
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param><hr>

Markdown

다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
Quick Tips:
- Two or more spaces at a line's end = Line break
- Double returns = Paragraph
- *Single asterisks* or _single underscores_ = Emphasis
- **Double** or __double__ = Strong
- This is [a link](http://the.link.example.com "The optional title text")
For complete details on the Markdown syntax, see the Markdown documentation and Markdown Extra documentation for tables, footnotes, and more.
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param><hr>

Plain text

HTML 태그를 사용할 수 없습니다.
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.
줄과 단락은 자동으로 분리됩니다.

CAPTCHA

이것은 자동으로 스팸을 올리는 것을 막기 위해서 제공됩니다.

부 메뉴

[완료]마우스 제스쳐 인식과 내적에 대한 질문입니다.

벡터의 내적

답변 감사합니다!

n 차원 벡터라는 게 n개의 축에 해당하는 좌표값을 갖는 벡터입니다.

감사합니다.

댓글 달기

Filtered HTML

BBCode

Textile

Markdown

Plain text

주 메뉴

둘러보기

부 메뉴

현재 위치

[완료]마우스 제스쳐 인식과 내적에 대한 질문입니다.

벡터의 내적

답변 감사합니다!

n 차원 벡터라는 게 n개의 축에 해당하는 좌표값을 갖는 벡터입니다.

감사합니다.

댓글 달기

Filtered HTML

BBCode

Textile

Markdown

Plain text

주 메뉴

검색 폼

둘러보기

사용자 로그인

Oauth2 Login :