git.gag.com Git - debian/gnuradio/blob - gr-trellis/doc/gr-trellis.xml

   1 <?xml version="1.0" encoding="ISO-8859-1"?>
   2 <!DOCTYPE article PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
   3           "docbookx.dtd" [
   4   <!ENTITY test_tcm_listing SYSTEM "test_tcm.py.xml">
   5 ]>
   6
   7 <article>
   8
   9 <articleinfo>
  10   <title>Trellis-based algorithms for GNU Radio</title>
  11   <author>
  12      <firstname>Achilleas</firstname>
  13      <surname>Anastasopoulos</surname>
  14      <affiliation>
  15         <address>
  16            <email>anastas@umich.edu</email>
  17         </address>
  18      </affiliation>
  19   </author>
  20
  21 <revhistory>
  22   <revision>
  23   <revnumber>v0.0</revnumber>
  24   <date>2006-08-03</date>
  25   <revremark>
  26     First cut.
  27   </revremark>
  28   </revision>
  29
  30 </revhistory>
  31
  32 <abstract><para>This document provides a description of the
  33 Finite State Machine (FSM) implementation and the related
  34 trellis-based encoding and decoding algorithms
  35 for GNU Radio.
  36 </para></abstract>
  37
  38 </articleinfo>
  39
  40
  41
  42
  43 <!--=====================================================-->
  44 <sect1 id="intro"><title>Introduction</title>
  45
  46 <para>....</para>
  47
  48 <para>
  49 The basic goal of the implementation is to have a generic way of
  50 describing an FSM that is decoupled from whether it describes a
  51 convolutional
  52 code (CC), a trellis code (TC), an inter-symbol interference (ISI)
  53 channel, or any
  54 other communication system that can be modeled with an FSM.
  55 To achieve this goal, we need to separate the pure FSM descrition from the
  56 rest of the model details. For instance, in the case of a rate 2/3 TC,
  57 the FSM should not involve details about the modulation used (it can
  58 be an 8-ary PAM, or 8-PSK, etc). Similarly, when attempting maximum likelihood
  59 sequence detection (MLSD)--using for instance the Viterbi algorithm (VA)--
  60 the VA implementation should not be concerned with the channel details
  61 (such as modulations, channel type, hard or soft inputs, etc).
  62 Clearly, having generality as the primary goal implies some penalty
  63 on the code efficiency, as compared to fully custom implementations.
  64 </para>
  65
  66 <para>
  67 We will now describe the implementation of the basic ingedient, the FSM.
  68 </para>
  69
  70 </sect1>
  71
  72
  73 <!--=====================================================-->
  74 <sect1 id="fsm"><title>The FSM class</title>
  75
  76 <para>An FSM describes the evolution of a system with inputs
  77 x<subscript>k</subscript>, states s<subscript>k</subscript> and outputs y<subscript>k</subscript>. At time k the FSM state is s<subscript>k</subscript>.
  78 Upon reception of a new input symbol x<subscript>k</subscript>, it outputs an output symbol
  79 y<subscript>k</subscript> which is a function of both x<subscript>k</subscript> and s<subscript>k</subscript>.
  80 It will then move to a next state s<subscript>k+1</subscript>.
  81 An FSM has a finite number of states, input and output symbols.
  82 All these are formally described as follows:
  83 </para>
  84
  85 <itemizedlist>
  86 <listitem><para>The input alphabet A<subscript>I</subscript>={0,1,2,...,I-1}, with cardinality I, so that x<subscript>k</subscript> takes values in A<subscript>I</subscript>.</para></listitem>
  87 <listitem><para>The state alphabet A<subscript>S</subscript>={0,1,2,...,S-1}, with cardinality S, so that s<subscript>k</subscript> takes values in A<subscript>S</subscript>.</para></listitem>
  88 <listitem><para>The output alphabet A<subscript>O</subscript>={0,1,2,...,O-1}, with cardinality O, so that y<subscript>k</subscript> takes values in A<subscript>O</subscript></para></listitem>
  89 <listitem><para>The "next-state" function NS: A<subscript>S</subscript> x A<subscript>I</subscript> --> A<subscript>S</subscript>,
  90 with the meaning
  91 s<subscript>k+1</subscript> = NS(s<subscript>k</subscript>, x<subscript>k</subscript>)</para></listitem>
  92 <listitem><para>The "output-symbol" function OS: A<subscript>S</subscript> x A<subscript>I</subscript> --> A<subscript>S</subscript>,
  93 with the meaning
  94 y<subscript>k</subscript> = OS(s<subscript>k</subscript>, x<subscript>k</subscript>)</para></listitem>
  95 </itemizedlist>
  96
  97 <para>
  98 Thus, a complete description of the FSM is given by the
  99 the five-tuple (I,S,O,NS,OS).
 100 Observe that implementation details are hidden
 101 in how the outside world interprets these input and output
 102 integer symbols.
 103 Here is an example of an FSM describing the (2,1) CC
 104 with constraint length 3 and generator polynomial matrix
 105 (1+D+D<superscript>2</superscript> ,  1+D<superscript>2</superscript>)
 106 from Proakis-Salehi pg. 779.
 107 </para>
 108
 109
 110 <example id="cc_ex"><title>(2,1) CC with generator polynomials (1+D+D<superscript>2</superscript> , 1+D<superscript>2</superscript>)</title>
 111
 112 <para>
 113 This CC accepts 1 bit at a time, and outputs 2 bits at a time.
 114 It has a shift register storing the last two input bits.
 115 In particular,
 116 b<subscript>k</subscript>(0)=x<subscript>k</subscript>+
 117 x<subscript>k-1</subscript>+x<subscript>k-2</subscript>, and
 118 b<subscript>k</subscript>(1)=x<subscript>k</subscript>+
 119 x<subscript>k-2</subscript>, where addition is mod-2.
 120 We can represent the state of this system
 121 as s<subscript>k</subscript> = (x<subscript>k-1</subscript> x<subscript>k-2</subscript>)<subscript>10</subscript>. In addition we can represent its
 122 output symbol as y<subscript>k</subscript> = (b<subscript>k</subscript>(1) b<subscript>k</subscript>(0))<subscript>10</subscript>.
 123 Based on the above assumptions, the input alphabet A<subscript>I</subscript>={0,1}, so I=2;
 124 the state alphabet A<subscript>S</subscript>={0,1,2,3}, so S=4; and
 125 the output alphabet A<subscript>O</subscript>={0,1,2,3}, so O=4.
 126 The "next-state" function NS(,) is given by
 127 <programlisting>
 128 s<subscript>k</subscript>       x<subscript>k</subscript>       s<subscript>k+1</subscript>
 129 0       0       0
 130 0       1       2
 131 1       0       0
 132 1       1       2
 133 2       0       1
 134 2       1       3
 135 3       0       1
 136 3       1       3
 137 </programlisting>
 138 The "output-symbol" function OS(,) can be given by
 139 <programlisting>
 140 s<subscript>k</subscript>       x<subscript>k</subscript>       y<subscript>k</subscript>
 141 0       0       0
 142 0       1       3
 143 1       0       3
 144 1       1       0
 145 2       0       1
 146 2       1       2
 147 3       0       2
 148 3       1       1
 149 </programlisting>
 150 </para>
 151
 152 <para>
 153 Note that although the CC outputs 2 bits per time period, following
 154 our approach, there is only one (quaternary) output symbol per
 155 time period (for instance, here we use the decimal representation
 156 of the 2-bits). Also note that the modulation used is not part of
 157 the FSM description: it can be BPSK, OOK, BFSK, QPSK with or without Gray mapping, etc;
 158 it is up to the rest of the program to interpret the meaning of
 159 the symbol y<subscript>k</subscript>.
 160 </para>
 161
 162 </example>
 163
 164
 165 <para>
 166 The C++ implementation of the FSM class keeps private information about
 167 I,S,O,NS,OS and public methods to read and write them. The NS
 168 and OS matrices are implemented as STL 1-dimensional vectors.
 169 </para>
 170
 171 <programlisting>
 172 class fsm {
 173 private:
 174   int d_I;
 175   int d_S;
 176   int d_O;
 177   std::vector&lt;int&gt; d_NS;
 178   std::vector&lt;int&gt; d_OS;
 179   std::vector&lt;int&gt; d_PS;
 180   std::vector&lt;int&gt; d_PI;
 181 public:
 182   fsm();
 183   fsm(const fsm &amp;FSM);
 184   fsm(const int I, const int S, const int O, const std::vector&lt;int&gt; &amp;NS, const std::vector&lt;int&gt; &amp;OS);
 185   fsm(const char *name);
 186   fsm(const int mod_size, const int ch_length);
 187   int I () const { return d_I; }
 188   int S () const { return d_S; }
 189   int O () const { return d_O; }
 190   const std::vector&lt;int&gt; &amp; NS () const { return d_NS; }
 191   const std::vector&lt;int&gt; &amp; OS () const { return d_OS; }
 192   const std::vector&lt;int&gt; &amp; PS () const { return d_PS; }
 193   const std::vector&lt;int&gt; &amp; PI () const { return d_PI; }
 194 };
 195 </programlisting>
 196
 197 <para>
 198 As can be seen, other than the trivial and the copy constructor,
 199 there are three additional
 200 ways to construct an FSM.
 201 </para>
 202
 203 <itemizedlist>
 204 <listitem>
 205 <para>Supplying the parameters I,S,O,NS,OS:</para>
 206 <programlisting>
 207   fsm(const int I, const int S, const int O, const std::vector&lt;int&gt; &amp;NS, const std::vector&lt;int&gt; &amp;OS);
 208 </programlisting>
 209 </listitem>
 210
 211 <listitem>
 212 <para>Giving a filename containing all the FSM information:</para>
 213 <programlisting>
 214   fsm(const char *name);
 215 </programlisting>
 216 <para>
 217 This information has to be in the following format:
 218 <programlisting>
 219 I S O
 220
 221 NS(0,0)   NS(0,1)   ...  NS(0,I-1)
 222 NS(1,0)   NS(1,1)   ...  NS(1,I-1)
 223 ...
 224 NS(S-1,0) NS(S-1,1) ...  NS(S-1,I-1)
 225
 226 OS(0,0)   OS(0,1)   ...  OS(0,I-1)
 227 OS(1,0)   OS(1,1)   ...  OS(1,I-1)
 228 ...
 229 OS(S-1,0) OS(S-1,1) ... OS(S-1,I-1)
 230 </programlisting>
 231 </para>
 232 <para>
 233 For instance, the file containing the information for the example mentioned above is of the form:
 234 <programlisting>
 235 2 4 4
 236
 237 0 2
 238 0 2
 239 1 3
 240 1 3
 241
 242 0 3
 243 3 0
 244 1 2
 245 2 1
 246 </programlisting>
 247 </para>
 248 </listitem>
 249
 250 <listitem>
 251 <para>The third way is specific to FSMs resulting from shift registers, and the output symbol being the entire transition (ie, current_state and current_input). These FSMs are usefull when describibg ISI channels. In particular the state is comprised of the.....
 252 </para>
 253 <programlisting>
 254   fsm(const int mod_size, const int ch_length);
 255 </programlisting>
 256 </listitem>
 257
 258 </itemizedlist>
 259
 260
 261 <para>
 262 Finally, as can be seen from the above description, there are
 263 two more variables included in the FSM class implementation,
 264 the PS and the PI matrices. These are computed internally
 265 when an FSM is instantiated and their meaning is as follows.
 266 Sometimes (eg, in the traceback operation of the VA) we need
 267 to trace the history of the state or the input sequence.
 268 To do this we would like to know for a given state s<subscript>k</subscript>, what are the possible previous states s<subscript>k-1</subscript>
 269 and what input symbols x<subscript>k-1</subscript> will get us from
 270 s<subscript>k-1</subscript> to s<subscript>k</subscript>. This information can be derived from NS; however we want to have it ready in a
 271 convenient format.
 272 In the following we assume that for any state,
 273 the number of incoming transitions is the same as the number of
 274 outgoing transitions, ie, equal to I. All applications of interest
 275 have FSMs satisfying this requirement.
 276
 277 If we arbitrarily index the incoming transitions to the current state
 278 by "i", then  as i goes from 0 to I-1, PS(s<subscript>k</subscript>,i)
 279 gives all previous states s<subscript>k-1</subscript>,
 280 and PI(s<subscript>k</subscript>,i) gives all previous inputs x<subscript>k-1</subscript>.
 281 In other words, for any given s<subscript>k</subscript> and any index i=0,1,...I-1, starting from
 282 s<subscript>k-1</subscript>=PS(s<subscript>k</subscript>,i)
 283 with input
 284 x<subscript>k-1</subscript>=PI(s<subscript>k</subscript>,i)
 285 will get us to the state s<subscript>k</subscript>.
 286 More formally, for any i=0,1,...I-1 we have
 287 s<subscript>k</subscript> = NS(PS(s<subscript>k</subscript>,i),PI(s<subscript>k</subscript>,i)).
 288
 289 </para>
 290
 291
 292 </sect1>
 293
 294
 295
 296
 297
 298 <!--=====================================================-->
 299 <sect1 id="tcm"><title>TCM: A Complete Example</title>
 300
 301 <para>
 302 We now discuss through a concrete example how
 303 the above FSM model can be used in GNU Radio.
 304
 305 The communication system that we want to simulate
 306 consists of a source generating the
 307 input information in packets, a CC encoding each packet separately,
 308 a memoryless modulator,
 309 an additive white Gaussian noise (AWGN) channel, and
 310 the VA performing MLSD.
 311 The program source is as follows.
 312 </para>
 313
 314 &test_tcm_listing;
 315
 316 <para>
 317 The program is called by
 318 <literallayout>
 319 ./test_tcm.py fsm_fname Es/No_db repetitions
 320 </literallayout>
 321 where "fsm_fname" is the file containing the FSM specification of the
 322 tested TCM code, "Es/No_db" is the SNR in dB, and "repetitions"
 323 are the number of packets to be transmitted and received in order to
 324 collect sufficient number of errors for an accurate estimate of the
 325 error rate.
 326 </para>
 327
 328 <para>
 329 The FSM is first intantiated in "main" by
 330 </para>
 331 <programlisting>
 332  62      f=trellis.fsm(fname) # get the FSM specification from a file (will hopefully be automated in the future...)
 333 </programlisting>
 334
 335
 336
 337
 338
 339
 340
 341 <para>
 342 Each packet has size Kb bits
 343 (we choose Kb to be a multiple of 16 so that all bits fit nicely into shorts and can be generated by the lfsr GNU Radio).
 344 Assuming that the FSM input has cardinality I, each input symbol consists
 345 of bitspersymbol=log<subscript>2</subscript>( I ). The Kb/16 shorts are now
 346 unpacked to K=Kb/bitspersymbol input
 347 symbols that will drive the FSM encoder.
 348 </para>
 349 <programlisting>
 350  63      Kb=1024*16  # packet size in bits (make it multiple of 16 so it can be packed in a short)
 351  64      bitspersymbol = int(round(math.log(f.I())/math.log(2))) # bits per FSM input symbol
 352  65      K=Kb/bitspersymbol # packet size in trellis steps
 353 </programlisting>
 354
 355
 356
 357 <para>
 358 The FSM will produce K output symbols (remeber the FSM produces always one output symbol for each input symbol). Each of these symbols needs to be modulated. Since we are simulating the communication system, we need not simulate the actual waveforms. An M-ary, N-dimensional
 359 modulation is completely specified by a set of M, N-dimensional real vectors. In "fsm_utils.py" file we give a number of useful modulations with the following format: modulation = (N,constellation), where
 360 constellation=[c11,c12,...,c1N,c21,c22,...,c2N,...,cM1,cM2,...cMN].
 361 The meaning of the above is that every constellation point c_i
 362 is an N-dimnsional vector c_i=(ci1,ci2,...,ciN)
 363 For instance, 4-ary PAM is represented as
 364 (1,[-3, -1, 1, 3]), while QPSK is represented as
 365 (2,[1, 0, 0, 1, 0, -1, -1, 0]). In our example we choose QPSK modulation.
 366 Clearly, M should be equal to the cardinality of the FSM output, O.
 367 Finally the average symbol energy and noise variance are calculated.
 368 </para>
 369 <programlisting>
 370  66      modulation = fsm_utils.psk4 # see fsm_utlis.py for available predefined modulations
 371  67      dimensionality = modulation[0]
 372  68      constellation = modulation[1]
 373  69      if len(constellation)/dimensionality != f.O():
 374  70          sys.stderr.write (&apos;Incompatible FSM output cardinality and modulation size.\n&apos;)
 375  71          sys.exit (1)
 376  72      # calculate average symbol energy
 377  73      Es = 0
 378  74      for i in range(len(constellation)):
 379  75          Es = Es + constellation[i]**2
 380  76      Es = Es / (len(constellation)/dimensionality)
 381  77      N0=Es/pow(10.0,esn0_db/10.0); # noise variance
 382 </programlisting>
 383
 384
 385
 386 <para>
 387 Then, "run_test" is called with a different "seed" so that we get
 388 different noise realizations.
 389 </para>
 390 <programlisting>
 391  82          (s,e)=run_test(f,Kb,bitspersymbol,K,dimensionality,constellation,N0,-long(666+i)) # run experiment with different seed to get different noise realizations
 392 </programlisting>
 393
 394
 395
 396 <para>
 397 Let us examine now the "run_test" function.
 398 First we set up the transmitter blocks.
 399 The Kb/16 shorts are first unpacked to
 400 symbols consistent with the FSM input alphabet.
 401 The FSm encoder requires the FSM specification,
 402 and an initial state (which is set to 0 in this example).
 403 </para>
 404 <programlisting>
 405  15      # TX
 406  16      src = gr.lfsr_32k_source_s()
 407  17      src_head = gr.head (gr.sizeof_short,Kb/16) # packet size in shorts
 408  18      s2fsmi = gr.packed_to_unpacked_ss(bitspersymbol,gr.GR_MSB_FIRST) # unpack shorts to symbols compatible with the FSM input cardinality
 409  19      enc = trellis.encoder_ss(f,0) # initial state = 0
 410 </programlisting>
 411
 412
 413
 414
 415 <para>
 416 We now need to modulate the FSM output symbols.
 417 The "chunks_to_symbols_sf" is essentially a memoryless mapper which
 418 for each input symbol y_k
 419 outputs a sequence of N numbers ci1,ci2,...,ciN  representing the
 420 coordianates of the constellation symbol c_i with i=y_k.
 421 </para>
 422 <programlisting>
 423  20      mod = gr.chunks_to_symbols_sf(constellation,dimensionality)
 424 </programlisting>
 425
 426 <para>
 427 The channel is AWGN with appropriate noise variance.
 428 For each transmitted symbol c_k=(ck1,ck2,...,ckN) we receive a noisy version
 429 r_k=(rk1,rk2,...,rkN).
 430 </para>
 431 <programlisting>
 432  22      # CHANNEL
 433  23      add = gr.add_ff()
 434  24      noise = gr.noise_source_f(gr.GR_GAUSSIAN,math.sqrt(N0/2),seed)
 435 </programlisting>
 436
 437
 438
 439 <para>
 440 Part of the design methodology was to decouple the FSM and VA from
 441 the details of the modulation, channel, receiver front-end etc.
 442 In order for the VA to run, we only need to provide it with
 443 a number representing a cost associated with each transition
 444 in the trellis. Then the VA will find the sequence with
 445 the smallest total cost through the trellis.
 446 The cost associated with a transition (s_k,x_k) is only a function
 447 of the output y_k = OS(s_k,x_k), and the observation
 448 vector r_k. Thus, for each time period, k,
 449 we need to label each of the SxI transitions with such a cost.
 450 This means that for each time period we need to evaluate
 451 O such numbers (one for each possible output symbol y_k).
 452 This is done
 453 in "metrics_f". In particular, metrics_f is a memoryless device
 454 taking N inputs at a time and producing O outputs. The N inputs are
 455 rk1,rk2,...,rkN.
 456 The O outputs
 457 are the costs associated with observations rk1,rk2,...,rkN and
 458 hypothesized output symbols c_1,c_2,...,c_M. For instance,
 459 if we choose to perform soft-input VA, we need to evaluate
 460 the Euclidean distance between r_k and each of c_1,c_2,...,c_M,
 461 for each of the K transmitted symbols.
 462 Other options are available as well; for instance, we can
 463 do hard decision demodulation and feed the VA with
 464 symbol Hamming distances, or even bit Hamming distances, etc.
 465 These are all implemented in "metrics_f".
 466 </para>
 467 <programlisting>
 468  26      # RX
 469  27      metrics = trellis.metrics_f(f.O(),dimensionality,constellation,trellis.TRELLIS_EUCLIDEAN) # data preprocessing to generate metrics for Viterbi
 470 </programlisting>
 471
 472 <para>
 473 Now the VA can run once it is supplied by the initial and final states.
 474 The initial state is known to be 0; the final state is usually
 475 forced to some value by padding the information sequence appropriately.
 476 In this example, we always send the the same info sequence (we only randomize noise) so we can evaluate off line the final state and then provide it to the VA (a value of -1 signifies that there is no fixed initial
 477 or final state). The VA outputs the estimates of the symbols x_k which
 478 are then packed to shorts and compared with the transmitted sequence.
 479 </para>
 480 <programlisting>
 481  28      va = trellis.viterbi_s(f,K,0,-1) # Put -1 if the Initial/Final states are not set.
 482  29      fsmi2s = gr.unpacked_to_packed_ss(bitspersymbol,gr.GR_MSB_FIRST) # pack FSM input symbols to shorts
 483  30      dst = gr.check_lfsr_32k_s();
 484 </programlisting>
 485
 486
 487
 488
 489 <para>
 490 The function returns the number of shorts and the number of shorts in error. Observe that this way the estimated error rate refers to
 491 16-bit-symbol error rate.
 492 </para>
 493 <programlisting>
 494  48      return (ntotal,ntotal-nright)
 495 </programlisting>
 496
 497
 498
 499 </sect1>
 500
 501
 502 <!--====================n================================-->
 503 <sect1 id="future"><title>Future Work</title>
 504
 505
 506
 507 <itemizedlist>
 508
 509 <listitem>
 510 <para>
 511 Improve the documentation :-)
 512 </para>
 513 </listitem>
 514
 515 <listitem>
 516 <para>
 517 automate fsm generation from generator polynomials
 518 (feedforward or feedback form).
 519 </para>
 520 </listitem>
 521
 522 <listitem>
 523 <para>
 524 Optimize the VA code.
 525 </para>
 526 </listitem>
 527
 528 <listitem>
 529 <para>
 530 Provide implementation of soft-input soft-output (SISO) decoders for
 531 potential use in concatenated systems. Also a host of suboptimal
 532 decoders, eg, sphere decoding, M- and T- algorithms, sequential decoding, etc.
 533 can be implemented.
 534 </para>
 535 </listitem>
 536
 537
 538 <listitem>
 539 <para>
 540 Although turbo decoding is rediculously slow in software,
 541 we can design it in principle. One question is, whether we should
 542 use the encoder, and SISO blocks and connect them
 543 through GNU radio or we should implement turbo-decoding
 544 as a single block (issues with buffering between blocks).
 545 </para>
 546 </listitem>
 547
 548 </itemizedlist>
 549
 550 </sect1>
 551
 552
 553 </article>