C언어 토큰 분리법에 대해서 질문이~

글쓴이: feelsocrazy / 작성시간: 일, 2008/04/06 - 2:23오후

[token] [token] [token [token] token]

이 것을

1. token
2. token
3. token token token

이렇게 구분하고 싶은대요...

strtok로 쓰니깐

1. token
2. token
3. token
4. token
5. token

이렇게 되어버리내요....
3개로 구분하는 방법이 없을까요??

Forums:

프로그래밍 QnA

댓글 달기

파서를

글쓴이: mithrandir / 작성시간: 일, 2008/04/06 - 2:30오후

파서를 공부해보세요.

언제나 삽질 - http://tisphie.net/typo/

언제나 삽질 - http://tisphie.net/typo/
프로그래밍 언어 개발 - http://langdev.net

답글

저런 걸 파싱하려면

글쓴이: gamdora / 작성시간: 수, 2008/04/09 - 5:58오후

저런 걸 파싱하려면 context free parser가 필요한가요?

답글

요구사항으로 봐서는

글쓴이: IsExist / 작성시간: 월, 2008/04/07 - 11:52오전

요구사항으로 봐서는 단순한 토큰 파싱이 아닌 '[', ']' 로 depth 구별을
해야 할 것 같네요.

간략하게 의미적으로 코딩해 본다면 아래처럼.

depth = 0;
brace_open = 0;
clear_token_buffer();
while (1) {
    ch = getc (stdin);
    if (ch == EOF) break;
 
    if (ch == '[') {
        depth++;
        if (brace_open > 0) {
            push_token_buffer(); // token_buffer에 있는 내용을 스택에 넣는다.
            clear_token_buffer();// token_buffer를 초기화
        }
        brace_open++;
    }
    else if (ch == ']' && brace_open > 0) {
        push_token_buffer();
        clear_token_buffer();
        depth--;
        brace_open--;
        if (depth == 0) pop_all_token(); // 스택에 있는 모든 token을 꺼낸다.
    }
    else if (brace_open > 0 && isprint(ch)) {
        addto_token_buffer(ch); // token_buffer에 ch 문자 연접
    }
    else {
        /* handling unexpected input */
    }
}
if (depth != 0 || brace_open != 0) {
    /* print warning of bad input format */
}

(표준입력 가정,
---------
간디가 말한 우리를 파괴시키는 7가지 요소

첫째, 노동 없는 부(富)/둘째, 양심 없는 쾌락
셋째, 인격 없는 지! 식/넷째, 윤리 없는 비지니스

이익추구를 위해서라면..

다섯째, 인성(人性)없는 과학
여섯째, 희생 없는 종교/일곱째, 신념 없는 정치

---------
간디가 말한 우리를 파괴시키는 7가지 요소

첫째, 노동 없는 부(富)/둘째, 양심 없는 쾌락
셋째, 인격 없는 지! 식/넷째, 윤리 없는 비지니스

이익추구를 위해서라면..

다섯째, 인성(人性)없는 과학
여섯째, 희생 없는 종교/일곱째, 신념 없는 정치

답글

regex 강추!!

글쓴이: 오호라 / 작성시간: 목, 2008/04/10 - 12:53오전

IsExist님이 말씀하신 것처럼 strtok, strtok_r 등을 이용해섯 코딩상에서 depth 에 대한 처리를 해줄수 있겠고요.

아니면 정규식을 쓰는거죠.

고정된 포맷이면 전자가 좋겠고, 포맷이 자주 바뀐다면 정규식을 이용하시는게 코드수정이 용이합니다.

> man regex

ps. strtork()는 COW ( copy on write )

Hello World.

답글

댓글 달기

이름

제목

댓글 *

텍스트 포맷에 대한 자세한 정보

텍스트 양식

Filtered HTML

텍스트에 BBCode 태그를 사용할 수 있습니다. URL은 자동으로 링크 됩니다.
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param><hr>
다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.

BBCode

텍스트에 BBCode 태그를 사용할 수 있습니다. URL은 자동으로 링크 됩니다.
다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param>
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.

Textile

다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
You can use Textile markup to format text.
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param><hr>

Markdown

다음 태그를 이용하여 소스 코드 구문 강조를 할 수 있습니다: <code>, <blockcode>, <apache>, <applescript>, <autoconf>, <awk>, <bash>, <c>, <cpp>, <css>, <diff>, <drupal5>, <drupal6>, <gdb>, <html>, <html5>, <java>, <javascript>, <ldif>, <lua>, <make>, <mysql>, <perl>, <perl6>, <php>, <pgsql>, <proftpd>, <python>, <reg>, <spec>, <ruby>. 지원하는 태그 형식: <foo>, [foo].
Quick Tips:
- Two or more spaces at a line's end = Line break
- Double returns = Paragraph
- *Single asterisks* or _single underscores_ = Emphasis
- **Double** or __double__ = Strong
- This is [a link](http://the.link.example.com "The optional title text")
For complete details on the Markdown syntax, see the Markdown documentation and Markdown Extra documentation for tables, footnotes, and more.
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.
사용할 수 있는 HTML 태그: <p><div><span><br><a><em><strong><del><ins><b><i><u><s><pre><code><cite><blockquote><ul><ol><li><dl><dt><dd><table><tr><td><th><thead><tbody><h1><h2><h3><h4><h5><h6><img><embed><object><param><hr>

Plain text

HTML 태그를 사용할 수 없습니다.
web 주소와/이메일 주소를 클릭할 수 있는 링크로 자동으로 바꿉니다.
줄과 단락은 자동으로 분리됩니다.

CAPTCHA

이것은 자동으로 스팸을 올리는 것을 막기 위해서 제공됩니다.

부 메뉴

C언어 토큰 분리법에 대해서 질문이~

파서를

저런 걸 파싱하려면

요구사항으로 봐서는

regex 강추!!

댓글 달기

Filtered HTML

BBCode

Textile

Markdown

Plain text

주 메뉴

둘러보기

부 메뉴

현재 위치

C언어 토큰 분리법에 대해서 질문이~

파서를

저런 걸 파싱하려면

요구사항으로 봐서는

regex 강추!!

댓글 달기

Filtered HTML

BBCode

Textile

Markdown

Plain text

주 메뉴

검색 폼

둘러보기

사용자 로그인

Oauth2 Login :