Summary: Arrays, Lists & Autoboxing¶

Combined Knowledge from: Tim Buchalka's Course + Effective Java
Mastery Level:

Topic Overview¶

A deep understanding of Java's core data structures — from fixed-size arrays to resizable lists, the iteration patterns that traverse them, and the autoboxing mechanism that bridges primitive and object worlds. This summary covers both the how (practical usage) and the why (memory layout, performance, JVM internals).

The Four Data Structures¶

1. Arrays¶

Definition: A fixed-size, contiguous block of memory holding elements of the same type. The most fundamental data structure in Java.

flowchart LR
    subgraph memory[" ARRAY MEMORY LAYOUT "]
        direction LR
        I0["[0] = 10"]
        I1["[1] = 20"]
        I2["[2] = 30"]
        I3["[3] = 40"]
        I4["[4] = 50"]
    end

    REF["int[] arr"] --> I0

Key Properties¶

Property	Detail
Size	Fixed at creation — cannot grow or shrink
Memory	Contiguous — elements stored side-by-side
Access	O(1) random access via index
Type	Can hold primitives (`int[]`) or objects (`String[]`)
Covariant	`Sub[]` is a subtype of `Super[]` — a design flaw (Item 28)

Essential Operations & Big O¶

Operation	Big O	Notes
Access by index	O(1)	Direct memory offset calculation
Search (unsorted)	O(n)	Linear scan required
Search (sorted)	O(log n)	`Arrays.binarySearch()`
Insert/Delete	O(n)	Must shift elements
Sort	O(n log n)	`Arrays.sort()` — Dual-Pivot Quicksort for primitives, TimSort for objects

java.util.Arrays Utilities¶

int[] arr = {5, 3, 1, 4, 2};

Arrays.sort(arr);                    // [1, 2, 3, 4, 5]
Arrays.fill(arr, 0);                 // [0, 0, 0, 0, 0]
int[] copy = Arrays.copyOf(arr, 10); // Copy with new length
String s = Arrays.toString(arr);     // "[0, 0, 0, 0, 0]"
boolean eq = Arrays.equals(a, b);    // Deep value comparison

2. ArrayList¶

Definition: A resizable array implementation of the List interface. Internally backed by an Object[] that grows automatically.

flowchart TD
    subgraph al[" ArrayList INTERNALS "]
        direction TB
        ARR["Object[] elementData"]
        SIZE["size = 3"]
        CAP["capacity = 10"]

        subgraph data[" Backing Array "]
            D0["[0] A"]
            D1["[1] B"]
            D2["[2] C"]
            D3["[3] null"]
            D4["[4] null"]
            D5["..."]
        end
    end

    ARR --> data

Key Properties¶

Property	Detail
Size	Dynamic — grows automatically (by ~50%)
Memory	Contiguous backing array + overhead for bookkeeping
Access	O(1) random access (like arrays)
Insertion	O(1) amortized at end, O(n) at arbitrary position
Deletion	O(n) — must shift elements to fill gap

Growth Strategy¶

When the backing array is full: 1. Allocate new array with newCapacity = oldCapacity + (oldCapacity >> 1) (~50% growth) 2. Copy all elements to the new array — Arrays.copyOf() 3. Old array becomes eligible for garbage collection

List<String> list = new ArrayList<>();  // Default capacity: 10
// After adding 11th element: capacity grows to 15
// After adding 16th element: capacity grows to 22

CRUD Quick Reference¶

List<String> items = new ArrayList<>(List.of("A", "B", "C"));

// CREATE
items.add("D");              // Append
items.add(1, "X");           // Insert at index 1

// READ
String first = items.get(0); // By index O(1)
int idx = items.indexOf("B");// By value O(n)

// UPDATE
items.set(0, "Z");           // Replace at index

// DELETE
items.remove(0);             // By index
items.remove("B");           // By value (first occurrence)

3. LinkedList¶

Definition: A doubly-linked list where each node holds a value and pointers to the previous and next nodes. Also implements Queue, Deque, and can be used as a Stack.

flowchart LR
    HEAD["head"] --> N1
    subgraph N1["Node 1"]
        V1["data: A"]
    end
    N1 <--> N2
    subgraph N2["Node 2"]
        V2["data: B"]
    end
    N2 <--> N3
    subgraph N3["Node 3"]
        V3["data: C"]
    end
    N3 --> TAIL["tail"]

Key Properties¶

Property	Detail
Size	Dynamic — grows node-by-node
Memory	Non-contiguous — each node is a separate heap object
Access	O(n) — must traverse from head or tail
Insert/Delete at ends	O(1) — just rewire pointers
Insert/Delete at middle	O(n) — must traverse to position first

ArrayList vs LinkedList¶

Operation	ArrayList	LinkedList	Winner
`get(i)`	O(1)	O(n)	ArrayList
`add(end)`	O(1) amortized	O(1)	Tie
`add(0, e)`	O(n) shift	O(1)	LinkedList
`remove(0)`	O(n) shift	O(1)	LinkedList
Memory per element	~4 bytes (ref)	~24 bytes (node)	ArrayList
Cache friendliness	Excellent	Poor	ArrayList
Iterator remove	O(n) shift	O(1)	LinkedList

Rule of thumb: Use ArrayList by default. Use LinkedList only when you need frequent insertions/removals at both ends (Queue/Deque pattern) or safe removal during iteration.

Multi-Interface Usage¶

LinkedList<String> ll = new LinkedList<>();

// As List
ll.add("A"); ll.get(0);

// As Queue (FIFO)
ll.offer("B"); ll.poll();     // Add to tail, remove from head

// As Deque (double-ended)
ll.offerFirst("C"); ll.offerLast("D");
ll.pollFirst(); ll.pollLast();

// As Stack (LIFO)
ll.push("E"); ll.pop();       // Add/remove from head

4. Enums¶

Definition: A special class type that defines a fixed set of named constants. Each constant is a public static final instance of the enum class.

flowchart TD
    subgraph enum[" DayOfTheWeek.class "]
        SUN["SUN (ordinal: 0)"]
        MON["MON (ordinal: 1)"]
        TUE["TUE (ordinal: 2)"]
        WED["WED (ordinal: 3)"]
        THU["THU (ordinal: 4)"]
        FRI["FRI (ordinal: 5)"]
        SAT["SAT (ordinal: 6)"]
    end

    VALUES["values()"] --> enum

What Enums Really Are¶

Under the hood, enum DayOfTheWeek { SUN, MON, ... } compiles to:

public final class DayOfTheWeek extends Enum<DayOfTheWeek> {
    public static final DayOfTheWeek SUN = new DayOfTheWeek("SUN", 0);
    public static final DayOfTheWeek MON = new DayOfTheWeek("MON", 1);
    // ...

    private DayOfTheWeek(String name, int ordinal) {
        super(name, ordinal);
    }

    public static DayOfTheWeek[] values() { ... }
    public static DayOfTheWeek valueOf(String name) { ... }
}

Enum Capabilities¶

Feature	Example
Built-in methods	`name()`, `ordinal()`, `values()`, `valueOf()`
Switch support	`switch(day) { case MON -> ... }`
Custom fields	`BACON(1.50)` — constructor with args
Custom methods	`getPrice()` — behavior per constant
Implement interfaces	`enum Op implements Calculable { ... }`

Golden Rules (from Effective Java)¶

Use enums instead of int constants (Item 34) — type safety, readability, iteration
Never derive data from ordinal() (Item 35) — use instance fields instead
Use EnumSet for sets of enums (Item 36) — replaces bit fields, same performance
Use EnumMap for enum-keyed maps (Item 37) — type-safe, array-backed internally

Key Internals to Understand¶

1. Array Memory Layout & Cache Performance¶

Arrays store elements in a contiguous block of memory. This is critical for performance because of CPU cache behavior.

flowchart LR
    subgraph cache[" CPU CACHE LINE (64 bytes) "]
        direction LR
        A0["arr[0]"]
        A1["arr[1]"]
        A2["arr[2]"]
        A3["arr[3]"]
        A4["arr[4]"]
        A5["arr[5]"]
        A6["arr[6]"]
        A7["arr[7]"]
    end

    CPU["CPU"] -->|"one fetch"| cache

When you access arr[0], the CPU fetches an entire cache line (~64 bytes). For int[], that's ~16 consecutive elements loaded at once. Sequential reads become almost free.

LinkedList nodes, by contrast, are scattered across the heap. Each node access may cause a cache miss, requiring a separate memory fetch.

Data Structure	Cache Behavior	Sequential Read Speed
`int[]`	Excellent — contiguous	~1 ns/element
`ArrayList`	Good — contiguous backing array	~2–3 ns/element
`LinkedList`	Poor — scattered nodes	~5–10 ns/element

2. ArrayList Resizing: Amortized O(1)¶

When ArrayList's backing array is full, it creates a new one ~50% larger and copies everything. This copy is O(n), but happens infrequently enough to be amortized O(1).

flowchart TD
    subgraph adds[" ADDING 17 ELEMENTS "]
        A1["Add 1–10: fits in initial capacity"]
        A2["Add 11: RESIZE to 15, copy 10 elements"]
        A3["Add 12–15: fits"]
        A4["Add 16: RESIZE to 22, copy 15 elements"]
        A5["Add 17: fits"]
    end

    A1 --> A2 --> A3 --> A4 --> A5

Amortized analysis: Over n insertions, total copy cost ≈ n + n/2 + n/4 + ... ≈ 2n. Divided by n insertions = O(1) per insertion on average.

Performance Tip

If you know the final size, use new ArrayList<>(expectedSize) to avoid all resizing overhead.

3. ConcurrentModificationException & Iterator Safety¶

The modCount mechanism is Java's fast-fail protection against structural modification during iteration.

flowchart TD
    subgraph safe[" ✅ SAFE: Iterator.remove() "]
        S1["Iterator tracks modCount"]
        S2["remove() updates BOTH list and iterator's expectedModCount"]
        S3["Next call: modCount == expectedModCount ✓"]
    end

    subgraph unsafe[" ❌ UNSAFE: List.remove() during for-each "]
        U1["For-each uses hidden Iterator"]
        U2["list.remove() increments modCount"]
        U3["Iterator.next() checks: modCount ≠ expectedModCount"]
        U4["Throws ConcurrentModificationException!"]
    end

// ❌ THROWS ConcurrentModificationException
for (String item : list) {
    if (item.equals("remove me")) {
        list.remove(item);  // Modifies list behind iterator's back!
    }
}

// ✅ SAFE — Iterator.remove() keeps state in sync
Iterator<String> it = list.iterator();
while (it.hasNext()) {
    if (it.next().equals("remove me")) {
        it.remove();  // Updates both modCount and expectedModCount
    }
}

4. Autoboxing: What the JVM Actually Does¶

When you write Integer x = 42;, the compiler inserts Integer x = Integer.valueOf(42);. This is autoboxing.

flowchart LR
    subgraph boxing[" AUTOBOXING "]
        P1["int 42"] -->|"Integer.valueOf(42)"| W1["Integer object"]
    end

    subgraph unboxing[" UNBOXING "]
        W2["Integer object"] -->|"intValue()"| P2["int 42"]
    end

The Integer Cache Trap¶

Integer.valueOf() caches values between -128 and 127. This creates a subtle identity trap:

Integer a = 127;  // Cached: Integer.valueOf(127)
Integer b = 127;  // Same cached object!
System.out.println(a == b);      // true — same object!

Integer c = 128;  // NOT cached: new Integer(128)
Integer d = 128;  // Different new Integer(128)
System.out.println(c == d);      // false — different objects!
System.out.println(c.equals(d)); // true — same VALUE

Value Range	`==` Behavior	Why
-128 to 127	Returns `true`	Same cached `Integer` object
< -128 or > 127	Returns `false`	Different heap objects
Any range with `.equals()`	Correct	Compares values, not references

Always Use .equals() for Boxed Types

Never use == on Integer, Long, Double, etc. The caching behavior makes == unreliable and unpredictable.

Performance Cost of Autoboxing¶

// ❌ SLOW: Autoboxing in a loop — creates ~millions of Integer objects!
Long sum = 0L;
for (long i = 0; i < Integer.MAX_VALUE; i++) {
    sum += i;  // Unbox sum, add, re-box — every iteration!
}

// ✅ FAST: Use primitives
long sum = 0L;
for (long i = 0; i < Integer.MAX_VALUE; i++) {
    sum += i;  // No boxing at all
}

5. Enum Internals: How the JVM Handles Enums¶

Enums look like simple constants, but the JVM treats them as full classes:

flowchart TD
    subgraph loading[" CLASS LOADING "]
        L1["JVM loads enum class"]
        L2["Creates static final instances"]
        L3["Populates $VALUES array"]
        L4["Enum constants ready"]
        L1 --> L2 --> L3 --> L4
    end

    subgraph guarantees[" JVM GUARANTEES "]
        G1["Singleton per constant"]
        G2["Serialization-safe"]
        G3["Reflection-proof"]
        G4["Thread-safe initialization"]
    end

    loading --> guarantees

Key guarantees: - Each enum constant is instantiated exactly once by the JVM - Enum constructors are called during class loading (static initialization) - values() returns a clone of the internal $VALUES array each time - Enums are inherently thread-safe — no synchronization needed

Design Patterns & Best Practices¶

Choosing the Right Data Structure¶

flowchart TD
    START[Need to store multiple elements?] --> Q1{Fixed size known?}
    Q1 -->|Yes, primitives| ARR["Array: int[], double[]"]
    Q1 -->|Yes, objects| Q2{Will size change later?}
    Q2 -->|No| ARR2["Array or List.of()"]
    Q2 -->|Yes| AL["ArrayList"]
    Q1 -->|No| Q3{Primary operation?}
    Q3 -->|"Random access"| AL
    Q3 -->|"Queue/Deque"| LL["LinkedList or ArrayDeque"]
    Q3 -->|"Insert/remove at ends"| LL
    Q3 -->|"General purpose"| AL

Array ↔ ArrayList Conversions¶

// Array → ArrayList (mutable)
String[] arr = {"A", "B", "C"};
List<String> list = new ArrayList<>(Arrays.asList(arr));

// ArrayList → Array
String[] back = list.toArray(new String[0]);

// Immutable list from values
List<String> immutable = List.of("A", "B", "C");

Effective Java Best Practices Applied¶

Practice	Item	Rationale
Prefer lists to arrays	Item 28	Compile-time type safety vs runtime errors
Use enums, not int constants	Item 34	Type safety, namespace, iteration, printing
Use instance fields, not ordinals	Item 35	Order-independent, allows duplicates and gaps
Use `EnumSet` for flag sets	Item 36	Same speed as bit fields, far more readable
Use `EnumMap` for enum keys	Item 37	Type-safe, internally array-backed

Common Pitfalls¶

1. Array Index Errors¶

int[] arr = new int[5];
arr[5] = 10;  // ❌ ArrayIndexOutOfBoundsException! (valid: 0-4)

2. Arrays.asList() Returns Fixed-Size List¶

List<String> list = Arrays.asList("A", "B", "C");
list.add("D");    // ❌ UnsupportedOperationException!
list.set(0, "Z"); // ✅ Modification of existing elements is OK

3. Autoboxing == Trap¶

Integer a = 200;
Integer b = 200;
System.out.println(a == b);      // ❌ false! (outside cache range)
System.out.println(a.equals(b)); // ✅ true

4. ConcurrentModificationException¶

for (String s : list) {
    list.remove(s);  // ❌ ConcurrentModificationException!
}
// ✅ Use Iterator.remove() or removeIf()
list.removeIf(s -> s.equals("target"));

5. NullPointerException from Unboxing¶

Integer boxed = null;
int value = boxed;  // ❌ NullPointerException! (unboxing null)

6. Relying on ordinal()¶

// ❌ BAD: Breaks if constants are reordered
int index = myEnum.ordinal();  // Don't use this for logic!

// ✅ GOOD: Use explicit fields
double price = myTopping.getPrice();  // Instance method

Best Practices Checklist¶

Learning Resources¶

Array Memory & Performance¶

ArrayList Internals¶

LinkedList vs ArrayList¶

Autoboxing & Integer Cache¶

Enums Deep Dive¶

Effective Java¶

References¶

Course: Tim Buchalka - Java Programming Masterclass (Sections 9–10)
Book: Effective Java - Joshua Bloch (Items 28, 34–39)
API: java.util.Arrays
API: java.util.ArrayList
API: java.util.LinkedList
API: java.util.Iterator
API: java.lang.Enum
API: java.lang.Integer

Completed: 2026-02-11 | Confidence: 9/10

Summary: Arrays, Lists & Autoboxing¶

Topic Overview¶

The Four Data Structures¶

1. Arrays¶

Key Properties¶

Essential Operations & Big O¶

java.util.Arrays Utilities¶

2. ArrayList¶

Key Properties¶

Growth Strategy¶

CRUD Quick Reference¶

3. LinkedList¶

Key Properties¶

ArrayList vs LinkedList¶

Multi-Interface Usage¶

4. Enums¶

What Enums Really Are¶

Enum Capabilities¶

Golden Rules (from Effective Java)¶

Key Internals to Understand¶

1. Array Memory Layout & Cache Performance¶

2. ArrayList Resizing: Amortized O(1)¶

3. ConcurrentModificationException & Iterator Safety¶

4. Autoboxing: What the JVM Actually Does¶

The Integer Cache Trap¶

Performance Cost of Autoboxing¶

5. Enum Internals: How the JVM Handles Enums¶

Design Patterns & Best Practices¶

Choosing the Right Data Structure¶

Array ↔ ArrayList Conversions¶

Effective Java Best Practices Applied¶

Common Pitfalls¶

1. Array Index Errors¶

2. Arrays.asList() Returns Fixed-Size List¶

3. Autoboxing == Trap¶

4. ConcurrentModificationException¶

5. NullPointerException from Unboxing¶

6. Relying on ordinal()¶

Best Practices Checklist¶

Learning Resources¶

Array Memory & Performance¶

ArrayList Internals¶

LinkedList vs ArrayList¶

Autoboxing & Integer Cache¶

Enums Deep Dive¶

Effective Java¶

Related Topics¶

References¶