Text Compression
Redundancy Removal: Text often contains repeated patterns or characters. Compression algorithms identify these patterns and replace them with shorter representations.
Encoding Schemes for Redundancy Removal: Compression uses specific algorithms to encode the data. Some common methods include:
Identifying Patterns:"hello" appears three times.
Compressed Representation: Using RLE: The string could be represented as "3hello".
Stored Size:Original: 15 bytes (including spaces).
Compressed: 8 bytes ("3hello").