Lesson 10: Compact Strings & Memory Optimization in Java 9+

In Java 9, a major memory optimization was introduced called Compact Strings. This feature reduces memory usage for String objects by using a more efficient internal representation.


1. What Are Compact Strings?

Before Java 9, all String objects used UTF-16 encoding, where each character was stored using 2 bytes, even if it only required 1 byte (e.g., ASCII characters).

🚀 Java 9 introduced Compact Strings, which:
✅ Uses 1 byte per character for ASCII strings (Latin characters).
✅ Uses 2 bytes per character only when necessary (e.g., for non-Latin characters).
✅ Reduces heap memory usage significantly.
No code changes needed! Java optimizes this automatically.


2. How Compact Strings Work

  • Java 9 introduced a new byte[]-based internal representation in String.
  • Before Java 9, String used a char[] array, which always stored 2 bytes per character.
  • In Java 9+, ASCII strings use a byte[] array instead of char[], saving memory.

🔍 Java 8 String Representation (Always UTF-16, 2 bytes per char)

public final class String {
    private final char[] value; // Uses 2 bytes per character (UTF-16)
}

🚀 Java 9+ String Representation (Uses byte[] for ASCII, char[] for others)

public final class String {
    private final byte[] value; // Uses 1 byte per character if possible
    private final byte coder;   // Indicates if ASCII (LATIN1) or UTF-16
}

If all characters fit in ASCII, Java uses LATIN1 (1 byte per char).
If non-ASCII characters are present, Java falls back to UTF-16 (2 bytes per char).


3. Memory Savings with Compact Strings

Compact Strings reduce memory usage by 50% for ASCII-heavy applications.

📌 Example: Measuring String Memory Usage

The following program creates Strings in Java 8 and Java 9+ and compares their memory usage.

import java.lang.management.ManagementFactory;
import java.lang.management.MemoryMXBean;
import java.lang.management.MemoryUsage;
import java.util.ArrayList;
import java.util.List;

public class Main {
    public static void main(String[] args) {
        List<String> strings = new ArrayList<>();
        
        for (int i = 0; i < 1_000_000; i++) {
            strings.add("Hello"); // ASCII String
        }

        MemoryMXBean memoryBean = ManagementFactory.getMemoryMXBean();
        MemoryUsage heapMemoryUsage = memoryBean.getHeapMemoryUsage();
        System.out.println("Used Memory: " + heapMemoryUsage.getUsed() / 1024 / 1024 + " MB");
    }
}

Expected Output in Java 8 (Higher Memory Usage)

Used Memory: 100 MB

Expected Output in Java 9+ (Lower Memory Usage)

Used Memory: 50 MB

Java 9+ saves ~50% memory for ASCII strings!


4. Performance Benefits of Compact Strings

Reduced Heap Usage – Less memory footprint for String-heavy applications.
Better CPU Cache Utilization – Smaller Strings fit into CPU cache more efficiently.
Less Garbage Collection (GC) Overhead – Fewer objects created, reducing GC load.


5. When Does Java Use Compact Strings?

String TypeEncoding in Java 9+Memory Used per Character
"Hello" (ASCII)LATIN1 (byte[])1 byte per character
"你好" (Chinese)UTF-16 (char[])2 bytes per character
"مرحبا" (Arabic)UTF-16 (char[])2 bytes per character
"12345" (Numbers)LATIN1 (byte[])1 byte per character

6. How to Check if Compact Strings Are Enabled?

To verify that Compact Strings are active, use:

java -XX:+PrintFlagsFinal | grep CompactStrings

Expected Output:

 1   636 CompactStrings                              true

✔ If CompactStrings is true, then Java 9+ is optimizing memory usage automatically.

🚀 Disabling Compact Strings (Not Recommended)

java -XX:-CompactStrings

✔ This forces Java to always use UTF-16, increasing memory usage.


7. When Compact Strings Won’t Help?

🚨 Compact Strings only optimize ASCII characters (Latin1).
🚨 If a String contains even one non-ASCII character, it falls back to UTF-16.

Example: Mixed ASCII and Non-ASCII Strings

String asciiOnly = "Hello";    // Uses LATIN1 (1 byte per char)
String mixed = "Hello 你好";    // Uses UTF-16 (2 bytes per char)

Key Takeaway:
✔ If most of your data is English or numbers, Compact Strings will save memory.
✔ If your application processes multi-language data, the benefits may be lower.


Why Does the Used Memory Always Show same memory usage in the Example?

When running the memory measurement program, the reported “Used Memory” does not change significantly between different string values like "Hello" or "HelloWorld". Here’s why:


1️⃣ Java String Deduplication (JVM Optimization)

Java’s Garbage Collector (GC) deduplicates Strings to save memory, meaning:

  • If a string already exists in memory, Java reuses it instead of creating a new copy.
  • Even if you add "Hello" a million times, Java may store it only once internally.

How It Works:

String str1 = "Hello";
String str2 = "Hello"; // ✅ Uses the same memory as str1 (deduplicated)

✔ Java does not create a new "Hello" instance; it reuses the existing one.

📌 Enable String Deduplication Check:
If you’re using G1 Garbage Collector (Java’s default since Java 9), deduplication is enabled by default.
To check:

java -XX:+PrintStringDeduplicationStatistics -jar YourApp.jar

Output Example:

Before Deduplication: 1,000,000 strings
After Deduplication: 1, unique instances: 1

🚀 Even though we created 1,000,000 "Hello" strings, Java optimized memory usage to store only 1!


2️⃣ String Interning – Java Saves Memory Automatically

When using string literals ("Hello"), Java automatically interns the string, meaning:

  • It stores the value once in the String Pool.
  • All identical string literals point to the same memory location.

Example:

String s1 = "Hello"; 
String s2 = "Hello"; 
System.out.println(s1 == s2); // ✅ true (same reference)

Memory-efficient: Both s1 and s2 point to the same location in memory.

🚀 Effect on Your Program:
Even though you added "Hello" a million times to the list, the JVM interned it and stored only one copy, leading to minimal memory changes.


3️⃣ Compact Strings Save Memory Efficiently

In Java 9+, Compact Strings further optimize memory by:

  • Using 1 byte per character for "Hello" (ASCII characters).
  • Using 2 bytes per character only for non-ASCII text (e.g., "你好").
  • Since "Hello" is ASCII-only, Java optimizes storage further, keeping memory usage low.

📌 Example (UTF-16 vs. Compact String Optimization):

String asciiString = "Hello";    // ✅ Uses 5 bytes (LATIN1)
String unicodeString = "你好";    // ❌ Uses 4 bytes (UTF-16)

In Java 9+, "Hello" takes 5 bytes instead of 10 bytes (Java 8).
No matter how many "Hello" strings you create, Java efficiently manages memory.


4️⃣ How to Force Java to Consume More Memory?

If you want to see higher memory usage, try modifying the program so Java cannot deduplicate strings:

🔴 Original Code (Optimized Memory Usage)

for (int i = 0; i < 1_000_000; i++) {
    strings.add("Hello"); // ✅ Java reuses same "Hello" (deduplication)
}

Memory stays low (~9 MB) because Java stores only one “Hello”.


🟢 Force Java to Create New Strings

🚀 Solution: Use new String("Hello") to bypass interning.

for (int i = 0; i < 1_000_000; i++) {
    strings.add(new String("Hello" + i)); // 🔥 Different value each time!
}

Expected Memory Usage:

Used Memory: ~50 MB or more

Why? Java now creates unique String objects, increasing memory usage.


🔍 Summary: Why Memory Usage Stays the Same

ReasonEffect on Memory
String DeduplicationJava reuses the same "Hello" string.
String InterningJava stores "Hello" once in memory.
Compact Strings (Java 9+)"Hello" uses only 1 byte per char, reducing memory footprint.
Lack of New Object CreationJava does not create multiple copies of "Hello".

🚀 Want to see higher memory usage?
✔ Use new String("Hello" + i) instead of "Hello".
✔ Add non-ASCII characters (e.g., "你好" forces UTF-16 encoding).


Final Thoughts

Java 9+ optimizes memory for Strings using Compact Strings, Deduplication, and Interning.
Even if you add 1 million "Hello" strings, Java stores it only once, keeping memory low.
Use new String("Hello") to bypass optimization and see real memory impact.

The next feature: JShell (Java’s REPL for testing code interactively)? 😊🚀

Tags:

Java Sleep