digram

Byte pair encoding or digram coding is a simple form of data compression in which the most common pair of consecutive bytes of data is replaced with a byte that does not occur within that data. A table of the replacements is required to rebuild the original data. The algorithm was first described publicly by Philip Gage in a February 1994 article "A New Algorithm for Data Compression" in the C Users Journal.A variant of the technique has shown to be useful in several natural language processing applications.

View More On Wikipedia.org
  • 1

    Fiju Chu

    Well-known Member
    • Posts
      2,285
    • Likes
      46
    • Points
      0
  • 1

    jyoti

    Newbie
    • Posts
      3
    • Likes
      0
    • Points
      0
  • Back
    Top