先上一段代码示例：

; Hello World Program (Calculating string length)
; Compile with: nasm -f elf helloworld-len.asm
; Link with (64 bit systems require elf_i386 option): ld -m elf_i386 helloworld-len.o -o helloworld-len
; Run with: ./helloworld-len
SECTION .data
msg     db      'Hello, brave new world!', 0Ah ; we can modify this now without having to update anywhere else in the program
SECTION .text
global  _start
_start:
    mov     ebx, msg        ; move the address of our message string into EBX
    mov     eax, ebx        ; move the address in EBX into EAX as well (Both now point to the same segment in memory)
nextchar:
    cmp     byte [eax], 0   ; compare the byte pointed to by EAX at this address against zero (Zero is an end of string delimiter)
    jz      finished        ; jump (if the zero flagged has been set) to the point in the code labeled 'finished'
    inc     eax             ; increment the address in EAX by one byte (if the zero flagged has NOT been set)
    jmp     nextchar        ; jump to the point in the code labeled 'nextchar'
finished:
    sub     eax, ebx        ; subtract the address in EBX from the address in EAX
                            ; remember both registers started pointing to the same address (see line 15)
                            ; but EAX has been incremented one byte for each character in the message string
                            ; when you subtract one memory address from another of the same type
                            ; the result is number of segments between them - in this case the number of bytes
    mov     edx, eax        ; EAX now equals the number of bytes in our string
    mov     ecx, msg        ; the rest of the code should be familiar now
    mov     ebx, 1
    mov     eax, 4
    int     80h
    mov     ebx, 0
    mov     eax, 1
    int     80h

最基础的字符串输出

首先复习一下单位概念：
1个字节(byte) = 8位(bit)。
位是内存最小单位，只能由0或1表示。
字节是表示整数、地址、字符的单位。
c语言中的char就占用1个字节

以下是汇编”hello,world”的编写方式：

section    .data
msg        db        "hello,world",0d10
section .text
global  _start
_start:
        mov        rax,    0d1    ; 向rax传递syscall代码1，表示调用函数sys_write
        mov        rdi,    1    ; 第一个参数，1表示stdout
        mov        rsi,    msg    ; 第二个参数，字符串内存地址，表示指针
        mov        rdx,    0d12; 第三个参数，12表示字符串字节长度(0d10为1个字节长度)
        syscall                ; 执行sys_write函数
        mov        rax,    0d60; 向rax传递syscall代码60,表示调用函数sys_exit
        mov        rdi,    0   ; 第一个参数，0表示no error
        syscall                ; 执行sys_exit函数

只用mov指令及db和syscall两个伪指令，就能实现打印输出，是不是很酷？

改进1：跳转后打印

如同我们在C语言中定义了一个print()函数，然后在main()函数中调用该函数打印一样，在汇编中，也可以使用不同的label(标签)进行跳转，而不必从上到下按顺序执行。

需要用到得新指令为：

cmp：compare(比较)指令，用于操作数之间的减法，但不保存结果，只根据结果设置相关的条件标志位（SF、ZF、CF、OF）。cmp指令后往往跟着条件转移指令，实现根据比较的结果产生不同的程序分支的功能。另外有一个sub指令处理减法，结果将保存在寄存器。
jz：jump if zero指令，当标志寄存器中的ZF标志为1就跳转，一般与cmp连用，用以判断两数是否相等。

示例：

section .data
msg     db      "jzjump,ok",0d10
section .text
global _start
_start:
        mov     r8,     0d2         ; r8 = 2
        cmp     r8,     0d2         ; r8 - 2 = 0,此时设置ZF=1
        jz         otherlable             ; 零标志为1,跳转到otherlable标签，有些像执行代码快
otherlable:
        mov     rax,    0d1
        mov     rdi,    0d1
        mov     rsi,    msg
        mov     rdx,    0d10
        syscall
        mov     rax,    0d60        ; 向rax传递syscall代码60,表示调用函数sys_exit
        mov     rdi,    0           ; 第一个参数，0表示no error
        syscall                        ; 执行sys_exit函数

改进2：计算字符串长度
sys_write要求我们向它传递一个指向我们要在内存中输出的字符串的指针以及我们要打印的字节长度。但是一旦修改字符串，就必须修改传递给sys_write 的字节长度，否则会出现几种情况：

传递的字节长度少于实际字符串的长度，只打印字节长度的字符串，即只能打印一部分。
传递的字节长度多于实际字符串的长度，会有一些空字节，浪费内存空间。
如何不小心填成其他类型数据，会造成程序出错。

因此需要类似length()一样的方式计算字符串长度，内核没有提供这种函数，只能从指令上考虑，我们可以这样做：
可以知道字符串的指针首地址，知道字符串结束后的下一个地址，两个地址相减，得字符串长度。

需要用到的新指令：
inc：increase，自增指令，只有一个操作数，寄存器操作数中的数据内容加1，例如一个操作数为5，执行inc之后，操作数为6。不是地址加1，是内容加1。
jmp：jump，无条件跳转指令，可转到内存中任何程序段。转移地址可在指令中给出，也可以在寄存器中给出，或在储存器中指出。jz和jnz是有跳转跳转的指令，即只有满足某些条件时执行这些指令才会跳转。
sub：subtract减法指令，结果保存在寄存器操作数中

section .data
msg     dward      
section .text
global _start
_start:
        mov     r8,     0d2         ; r8 = 2
        inc     r8                    ; r8 = 3
        jmp     otherlable             ; 零标志为1,跳转到otherlable标签，有些像执行代码快
otherlable:
        mov     rax,    r8
        mov     rdi,    0d1
        mov     rsi,    msg
        mov     rdx,    0d10
        syscall
        mov     rax,    0d60        ; 向rax传递syscall代码60,表示调用函数sys_exit
        mov     rdi,    0           ; 第一个参数，0表示no error
        syscall                        ; 执行sys_exit函数