编译原理入门——什么是编译器？

Valonia ·

更新时间:2024-11-10

· 511 次阅读

什么是编译器？

编译器告诉计算机该怎么去理解我们编写的代码。我们编写的高级语言大致分为两种：

解释型：像Python，可以写一句执行一句；

编译型：像C，需要经过编译成.exe文件才能运行。

因此，编译器又分两种：

interpreter，解释器，不用将源代码翻译成机器语言，而是直接处理和运行源代码。 compiler，编译器，将源代码翻译成机器语言然后运行，一般运行速度比解释器运行速度快一点。 编译器示例

举一个最简单的编译器例子：只接收正整数和“+”“-”，完成加减法，解释运行，计算机最后给出结果。

#
# EOF (end-of-file) token is used to indicate that
# there is no more input left for lexical analysis
INTEGER, PLUS, MINUS, EOF = 'INTEGER', 'PLUS', 'MINUS', 'EOF'
class Token(object):
    def __init__(self, type, value):
        # token type: INTEGER, PLUS, MINUS, or EOF
        self.type = type
        # token value: non-negative integer value, '+', '-', or None
        self.value = value
    def __str__(self):
        """String representation of the class instance.
        Examples:
            Token(INTEGER, 3)
            Token(PLUS, '+')
        """
        return 'Token({type}, {value})'.format(
            type=self.type,
            value=repr(self.value)
        )
    def __repr__(self):
        return self.__str__()
class Interpreter(object):
    def __init__(self, text):
        # client string input, e.g. "3 + 5", "12 - 5 + 3", etc
        self.text = text
        # self.pos is an index into self.text
        self.pos = 0
        # current token instance
        self.current_token = None
        self.current_char = self.text[self.pos]
    ##########################################################
    # Lexer code                                             #
    ##########################################################
    def error(self):
        raise Exception('Invalid syntax')
    def advance(self):
        """Advance the `pos` pointer and set the `current_char` variable."""
        self.pos += 1
        if self.pos > len(self.text) - 1:
            self.current_char = None  # Indicates end of input
        else:
            self.current_char = self.text[self.pos]
    def skip_whitespace(self):
        while self.current_char is not None and self.current_char.isspace():
            self.advance()
    def integer(self):
        """Return a (multidigit) integer consumed from the input."""
        result = ''
        while self.current_char is not None and self.current_char.isdigit():
            result += self.current_char
            self.advance()
        return int(result)
    def get_next_token(self):
        """Lexical analyzer (also known as scanner or tokenizer)
        This method is responsible for breaking a sentence
        apart into tokens. One token at a time.
        """
        while self.current_char is not None:
            if self.current_char.isspace():
                self.skip_whitespace()
                continue
            if self.current_char.isdigit():
                return Token(INTEGER, self.integer())
            if self.current_char == '+':
                self.advance()
                return Token(PLUS, '+')
            if self.current_char == '-':
                self.advance()
                return Token(MINUS, '-')
            self.error()
        return Token(EOF, None)
##########################################################
    # Parser / Interpreter code                              #
    ##########################################################
    def eat(self, token_type):
        # compare the current token type with the passed token
        # type and if they match then "eat" the current token
        # and assign the next token to the self.current_token,
        # otherwise raise an exception.
        if self.current_token.type == token_type:
            self.current_token = self.get_next_token()
        else:
            self.error()
    def term(self):
        """Return an INTEGER token value."""
        token = self.current_token
        self.eat(INTEGER)
        return token.value
    def expr(self):
        """Arithmetic expression parser / interpreter."""
        # set current token to the first token taken from the input
        self.current_token = self.get_next_token()
        result = self.term()
        while self.current_token.type in (PLUS, MINUS):
            token = self.current_token
            if token.type == PLUS:
                self.eat(PLUS)
                result = result + self.term()
            elif token.type == MINUS:
                self.eat(MINUS)
                result = result - self.term()
        return result
def main():
    while True:
        try:
            # To run under Python3 replace 'raw_input' call
            # with 'input'
            text = input('calc> ')
        except EOFError:
            break
        if not text:
            continue
        interpreter = Interpreter(text)
        result = interpreter.expr()
        print(result)
if __name__ == '__main__':
    main()

运行结果：
参考资源：
https://ruslanspivak.com/lsbasi-part1/

中国科技大学《编译原理》，华保健

作者：青花磁盘

编译器编译原理

1024 个赞

编辑举报

需要登录后方可回复, 如果你还没有账号请注册新账号

相关文章

一张图片能隐含千言万语之隐藏你的程序代码

Maha 2021-06-11

689

thinkPHP事务操作简单案例分析

Fiorenza 2020-06-15

974

编译原理复习

Thalia 2020-12-08

729

【编译原理】文法

Kohana 2021-01-27

643

Python常用编译器原理及特点解析

Phedra 2020-11-12

865

安装python3.7编译器后如何正确安装opnecv的方法详解

Paula 2020-12-11

646

python开发一个解析protobuf文件的简单编译器

Posy 2020-07-04

806

python开发一个解析protobuf文件的简单编译器

Beth 2021-05-11

830

C#编译器对局部变量的优化指南

Georgia 2020-04-30

520

深入了解Vue3模板编译原理

Hasana 2021-05-05

754

Microsoft VBScript 编译器错误错误 800a03e9 内存不够的解决方法

Ianthe 2021-12-15

870

C++编译原理之求解First集合

Bea 2021-12-16

173

详解C++编译器优化技术

Thea 2021-12-16

1433

推荐几款C/C++的编译器、编译环境（非常全面的比较）

Judy 2021-12-16

1545

C语言实现简易文本编译器

Laila 2021-12-16

1134

vue codemirror实现在线代码编译器效果

Natalia 2022-01-14

248

java性能优化之编译器版本与平台对应关系

Willow 2022-06-17

229

基于JS实现一个小型编译器

Vera 2022-07-15

911

Go语言编译原理之变量捕获

Elsa 2022-08-04

424

MinGW-w64C/C++编译器下载和安装的方法步骤(入门教程)

Prunella 2022-10-24

552

我要提问

致谢

帮助他人，成就自己。

人生最大成功就是伸出热情而温暖的双手，尽自己所能去帮助身边的每一个人，只要无私的奉献，就会收获到美好的生活。

1024问感谢每一位朋友的帮助和支持。
软件开发网提供编程的基础软件技术培训教程,软件开发编程实例讲解Go,Node,HTML,CSS,Javascript,Python,Java,Ruby,C,PHP,MySQL等软件开发编程语言以及数据开发的基础知识，也提供大量的软件开发在线实例、从入门到精通就在1024问。

育儿网微养生全球行美食街育儿菜谱大全海南旅游女性养狗百科星座