Rss订阅

首页 »算法 » crc32算法:CRC32算法实现原理 »正文

crc32算法:CRC32算法实现原理

来源: 发布时间:星期四, 2009年2月12日浏览:174次评论:0

简而言的

CRC是

个数值

该数值被用于校验数据

正确性

CRC数值简单地说就是通过让你需要做处理

数据除以

个常数而得到

余数

当你得到这个数值后你可以将这个数值附加到你

数据后

当数据被传送到其他地方后

取出原始数据(可能在传送过程中被破坏)和附加

CRC数值

然后将这里

原始数据除以的前那个常数(约定好

)然后得到新

CRC值

比较两个CRC值是否相等即可确认你

数据是否在传送过程中出现

那么

如何让你

数据除以

个常数？思路方法是对你

数据进行必要

编码处理

逐字节处理成数字

那么这个常数是什么？你不必关注它是什么

也不需要关注它是如何获得

当你真

要动手写

个CRC

实现算法时

我可以告诉你

CRC

理论学家会告诉你

区别长度

常数对应着区别

CRC实现算法

当这个常数为32位时

也就是这里所说

CRC32

以上内容你不必全部理解

你需要查阅其他资料来获取CRC完整

理论介绍

The mathematics behind CRC ?

很多教科书会把CRC和多项式关联起来

这里

多项式指

是系数为0或1

式子

例如:a0 + a1*x + a2*x^2 + ... + an*x^n

其中a0, a1, ..., an要么为0要么为1

我们并不关注x取什么值

(如果你要关注

你可以简单地认为x为2) 这里把a0, a1, ..., an

值取出来排列起来

就可以表示比特流

例如 1 + x + x^3所表示

比特流就为:1101

部分资料会将这个顺序颠倒

这个很正常

什么是生成多项式？

所谓

生成多项式

就是上面我所说

常数

注意

在这里

个多项式就表示了

个比特流

也就是

堆1、0

组合起来最终就是

个数值

例如CRC32算法中

这个生成多项式为:c(x) = 1 + x + x^2 + x^4 + x^5 + x^7 + x^8 + x^10 + x^11 + x^12 + x^16 + x^22 + x^23 + x^26 + x^32

其对应

数字就为:11101101101110001000001100100000(x^32在实际计算时隐含给出

因此这里没有包含它

系数)

也就是0xEDB88320(多项式对应

数字可能颠倒

颠倒后得到

是0x04C11DB7

其实也是正确

)

由此可以看出

CRC值也可以看成我们

数据除以

个生成多项式而得到

余数

如何做这个除法？

套用大部分教科书给出

计算思路方法

任何数据都可以被处理成纯数字

因此

在某种程度上说

我们可以直接开始这个除法

尽管事实上这并不是标准

除法

例如

我们

数据为1101011011(方便起见我直接给 2进制表示了

从这里也可以看出

CRC是按bit进行计算

)

给定

生成多项式(对应

值)为10011

通常

教科书会告诉我们在进行这个除法前

会把我们

数据左移几位(生成多项式位数-1位)

从而可以容纳将来计算得到

CRC值(我上面所说

将CRC值附加到原始数据后)

但是为什么要这样做？我也不知道

(不知道

东西不能含糊过)那么

除法就为:
1100001010
_______________
10011 ) 11010110110000 附加了几个零

新数据
10011......... 这里

减法(希望你不至于忘掉小学算术)是

个异或操作
-----.........
10011........
10011........
-----........
00001....... 逐bit计算
00000.......
-----.......
00010......
00000......
-----......
00101.....
00000.....
-----.....
01011....
00000....
-----....
10110...
10011...
-----...
01010..
00000..
-----..
10100.
10011.
-----.
01110
00000
-----
1110 = 这个余数也就是所谓

CRC值

通常又被称为校验值

希望进行到这里

你可以获取更多有关CRC

感性认识

而我们所要做

也就是实现

个CRC

计算算法

说白了

就是提供

个

给定

段数据

以及

个生成多项式(对于CRC32算法而言该值固定)

然后计算得出上面

1110余数

The simplest algorithm.

最简单

实现算法

是

种模拟算法

我们模拟上面

除法过程

遵从网上

份比较全面

资料

我们设定

个变量register

我们逐bit地将我们

数据放到register中

然后判断register最高位是否为1

如果是则和生成多项式异或操作

否则继续处理

这个过程简单地模拟了上述除法过程:

/**////
/// The simplest CRC implement algorithm.
///
/**//*
Load the register with zero bits.
Augment the message by appending W zero bits to the end of it.
While (more message bits)
Begin
Sh

t the register left by _disibledevent=>End
The register now contains the re

der.
*/

#

<stdio.h>

#

POLY 0x13

{
/**//// the data
unsigned

data = 0x035b;
/**//// load the register with zero bits
unsigned

regi = 0x0000;
/**//// augment the data by appending W(4) zero bits to the end of it.
data <<= 4;

/**//// we do it bit after bit
for(

cur_bit = 15; cur_bit >= 0; -- cur_bit )
{
/**//// test the highest bit which will be poped later.
/// in fact, the 5th bit from right is the hightest bit here

( ( ( regi >> 4 ) & 0x0001 )

0x1 )
{
regi = regi ^ POLY;
}
/**//// sh

t the register
regi <<= 1;
/**//// reading the next bit of the augmented data
unsigned

tmp = ( data >> cur_bit ) & 0x0001;
regi |= tmp;

}

/**//// and now, register contains the re

der which is also called CRC value.

0;
}

better algorithm ?

很多时候这种让人容易理解

算法都不会被实际用到

这种逐bit操作

算法实在很慢

你可能知道

般

CRC32算法都是

种基于表(table-driven)

算法

但是你可能不知道这个表是如何来

种改善这种bit after bit

思路方法就是将这个bit扩大

例如典型

做法就是换成

这里我要详细地叙述下上面那种算法

过程:

我们每次会先检查register

最高位是否为1

如果为1

则将生成多项式(所谓

Poly)和register进行异或操作

然后

将register左移

位

也就舍弃了最高位

然后将我们

数据拿

bit出来放到register

最低位

也就是说

register中

某

位

值会决定后面几位

值

如果将register最高字节每

bit编码为:t7 t6 t5 t4 t3 t2 t1 t0

那么

t7会决定t6-t0

值(如果为1)

t6会决定t5-t0

值

依次类推

但是

无论谁决定谁

值

当上面那个算法迭代

个字节后(8bits)

t7-t0都会被丢弃(whatever you do)

唯

留下来

东西

就是对这个字节以后字节

影响

那么

如果我们可以直接获取这个影响

我们就可以

after

地处理

而不是bit after bit

如何获取这个影响呢？这个影响又是什么呢？这个影响就对应着我们

table-driven CRC算法中

表元素！

但是

为什么我们逐bit进行计算

过程为什么可以简化为

步操作？事实上

我们没有简化这个操作

种用于教学

算法

是实时地计算这个影响值:

/**////
/// The table-driven CRC implement algorithm part 1.
///
/**//*
While (augmented message is not exhausted)
Begin
Examine the top

of the register
Calculate the control

from the top

of the register
Sum all the Polys at various off

s that are to be XORed

o
the register in accordance with the control

t the register left by _disibledevent=>/**//// load the register with the data
unsigned long regi = 0;
/**//// allocate memory to contain the AUGMENTED data (added some zeros)
unsigned char p[8];
/**//// copy data
mem

( p, 0, 8 );
memcpy( p, &data, 4 );

/**//// because data contains 4

s
for(

i = 0; i < 8;

i )
{
/**//// get the top

of the register
unsigned char top_

= (unsigned char)( ( regi >> 24 ) & 0xff );
/**//// sum all the polys at various off

s
unsigned long sum_poly = top_

<< 24;
for(

j = 0; j < 8;

j )
{
/**//// check the top bit

( ( sum_poly >> 31 ) != 0 )
{
/**//// TODO : understand why '<<' first
sum_poly = ( sum_poly << 1 ) ^ POLY;
}

{
sum_poly <<= 1;
}
}
/**//// sh

t the register left by _disibledevent=>/**//// xor the summed polys to the register
regi ^= sum_poly;
}

/**//// and now, register contains the re

der which is also called CRC value.

0;
}

其中:

/**//// sum all the polys at various off

s
unsigned long sum_poly = top_

<< 24;
for(

j = 0; j < 8;

j )
{
/**//// check the top bit

( ( sum_poly >> 31 ) != 0 )
{
/**//// TODO : understand why '<<' first
sum_poly = ( sum_poly << 1 ) ^ POLY;
}

{
sum_poly <<= 1;
}
}

就是用于计算这个影响值

事实上

table-driven CRC算法中

那个表就是通过这段代码生成

(排除其他

些细节)

你可能并不是很理解

这里我建议你忽略各种细节(更多

细节见参考资料)

你所需要知道

是

我们将8次逐bit

操作合并到了

次

操作中

而这个

操作

就是8次bit操作

合操作(上面提到

影响值)

这个

操作其实就是

个数值

也就是table-driven CRC算法中那个表

个元素

区别序列

bit操作其实对应着区别

unsigned char值

因此那个table有256个元素

show me where the table is :

如上所说

上面

算法很容易地就可以引进

个表:

进

步简化:

上述算法

个典型特征是会在我们

数据后面添加若干0

这样做其他做了很多没用

计算

种简化做法就是将这些没用

计算合并到其他计算中

其实这都是

些位操作

窍门技巧:

/**////
/// The table-driven CRC implement algorithm part 2.
///
/**//*
While (augmented message is not exhausted)
Begin
Examine the top

of the register
Calculate the control

from the top

of the register
Sum all the Polys at various off

s that are to be XORed

o
the register in accordance with the control

t the register left by _disibledevent=>for(

j = 0; j < 8;

j )
{
/**//// check the top bit

( ( sum_poly >> 31 ) != 0 )
{
/**//// TODO : understand why '<<' first
sum_poly = ( sum_poly << 1 ) ^ POLY;
}

{
sum_poly <<= 1;
}
}

sum_poly;
}

void create_table( unsigned long *table )
{
for(

i = 0; i < 256;

i )
{
table[i] = get_sum_poly( (unsigned char) i );
}
}

{
/**//// the data
unsigned long data = 0x1011035b;
/**//// load the register with the data
unsigned long regi = 0;
/**//// allocate memory to contain the AUGMENTED data (added some zeros)
unsigned char p[8];
/**//// copy data
mem

( p, 0, 8 );
memcpy( p, &data, 4 );

/**//// the table
unsigned long table[256];
/**//// create the table
create_table( table );

/**//// because data contains 4

s
for(

i = 0; i < 8;

i )
{
/**//// get the top

of the register
unsigned char top_

= (unsigned char)( ( regi >> 24 ) & 0xff );
/**//// sh

t the register left by _disibledevent=>/**//// xor the summed polys to the register
regi ^= table[top_

];
}

/**//// and now, register contains the re

der which is also called CRC value.

0;
}

讨厌

附加0

以上算法有个很大

特征就是要为我们

数据附加很多0

附加0后其实也附加了很多无用

操作

我们要将这些讨厌

0去掉:

{
/**//// the data
unsigned long data = 0x1011035b;
/**//// load the register with the data
unsigned long regi = 0;
/**//// allocate memory to contain the data
unsigned char p[4];
/**//// copy data
memcpy( p, &data, 4 );

/**//// the table
unsigned long table[256];
/**//// create the table
create_table( table );

/**//// because data contains 4

s
for(

i = 0; i < 4;

i )
{
regi = ( regi << 8 ) ^ table[ ( regi >> 24 ) ^ p[i] ];
}

/**//// and now, register contains the re

der which is also called CRC value.

0;
}

关键

句regi = ( regi << 8 ) ^ table[ ( regi >> 24 ) ^ p[i] ]; 简化了很多没用

操作

In practice :

似乎

切被我说

很简单

我想只是

我没说清楚

我尽量让你注意到事情

重点

我们进行到这里

似乎我们立马就可以写出自己

CRC32算法并用于实战

但是你很快就会发现

事情并不如你想像

那么简单

在实际处理时

很多数据

bit会进行

种颠倒操作

例如1010会被颠倒为0101

出现这样

情况是

某些硬件在实现CRC算法时

采用了这种(丑陋

)习惯

有些软件Software实现CRC算法时

也延用了这个习惯

另外

有关register

值问题

有些CRC算法会

化为0xffffffff

以下给出

个会进行bit颠倒

算法

该算法可以直接输出table-driven中

表:

/**////
/// The table-driven CRC implement algorithm part 4.
///
/// Donot need augment W/8 zero

s.
///
#

<stdio.h>
#

<stdlib.h>
#

<memory.h>

#

POLY 0x04C11DB7L

#

BITMASK(X) (1L << (X))

unsigned long refelect( unsigned long v,

b )
{

i;
unsigned long t = v;
for( i = 0; i < b;

i )
{

( t & 1L )
v |= BITMASK( (b-1)-i );

v &= ~BITMASK( (b-1)-i );
t >>= 1;
}

v;
}

/**//// i'll try to write a correct algorithm
unsigned long get_sum_poly( unsigned char

)
{

= (unsigned long) refelect(

, 8 );
unsigned long sum_poly =

<< 24;

for(

i = 0; i < 8;

i )
{
/**//// check the top bit

( ( sum_poly >> 31 ) != 0 )
{
/**//// TODO : understand why '<<' first
sum_poly = ( sum_poly << 1 ) ^ POLY;
}

{
sum_poly <<= 1;
}
}

sum_poly = refelect( sum_poly, 32 );

sum_poly;
}

void create_table( unsigned long *table )
{
for(

i = 0; i <= 255;

i )
{
table[i] = get_sum_poly( (unsigned char) i );
}
}

void output_table( const unsigned long *table )
{
FILE *fp = fopen( "table.txt", "w" );

for(

y = 0; y < 64;

y )
{
fpr

f( fp, "0x%08lXL,\t0x%08lXL,\t0x%08lXL,\t0x%08lXL, \n",
table[ y * 4 + 0],
table[ y * 4 + 1],
table[ y * 4 + 2],
table[ y * 4 + 3] );
}

fclose( fp );
}

{
/**//// the table
unsigned long table[256];
/**//// the data
unsigned long data = 0x1011035b;
/**//// load the register with the data
unsigned long regi = 0;
/**//// allocate memory to contain the data
unsigned char p[4];
/**//// copy data
memcpy( p, &data, 4 );
/**//// create the table
create_table( table );
/**//// output the table
output_table( table );

/**//// because data contains 4

s
for(

i = 0; i < 4;

i )
{
regi = ( regi << 8 ) ^ table[ ( regi >> 24 ) ^ p[i] ];
}

/**//// and now, register contains the re

der which is also called CRC value.

0;
}

Please FORGIVE me

我想我并没有将整个过程彻底地讲清楚

但是我希望你能明白大致

原理

有关table-driven中那个神奇

表

来历

有关CRC32算法

推导过程等等的类

代码下载:

/uploads/soft/1_080624030837.rar

标签：fft算法原理数据挖掘原理与算法 crc算法原理 crc32算法

下载文章的 PDF文档电子版离线看

我顶

专注于互联网--专注于架构

首页 »算法 » crc32算法:CRC32算法实现原理 »正文

crc32算法:CRC32算法实现原理

相关文章

读者评论

发表评论

热门标签

精华推荐

最新标签

Dig排行

阅读排行

最新文章