What is Fork/Join Framework in Java?

Tuanh.netTuanh.net
4 min read

1. What is the Fork/Join Framework?

The Fork/Join Framework is part of the java.util.concurrent package, introduced in Java 7. It is designed for tasks that can be recursively divided into smaller chunks, where each chunk can be processed independently. The framework works on the principle of divide and conquer, making it ideal for tasks like sorting, searching, and other recursive algorithms.

2. Implementing the Fork/Join Framework

2.1 ForkJoinTask

ForkJoinTask is the base class for tasks that can run within the Fork/Join Framework. It provides the core operations that allow tasks to fork new subtasks and join them once they are complete.

Example:

import java.util.concurrent.RecursiveTask;

public class SumTask extends RecursiveTask<Integer> {
private final int[] arr;
private final int start, end;

public SumTask(int[] arr, int start, int end) {
this.arr = arr;
this.start = start;
this.end = end;
}

@Override
protected Integer compute() {
if (end - start <= 10) {
int sum = 0;
for (int i = start; i < end; i++) {
sum += arr[i];
}
return sum;
} else {
int mid = (start + end) / 2;
SumTask leftTask = new SumTask(arr, start, mid);
SumTask rightTask = new SumTask(arr, mid, end);

leftTask.fork();
int rightResult = rightTask.compute();
int leftResult = leftTask.join();

return leftResult + rightResult;
}
}
}

2.2 ForkJoinPool

ForkJoinPool is the central class that manages a pool of worker threads to execute ForkJoinTask instances. It uses a work-stealing algorithm to keep all threads busy by redistributing tasks from busy threads to idle ones.

Example:

import java.util.concurrent.ForkJoinPool;

public class ForkJoinExample {
public static void main(String[] args) {
int[] arr = new int[100];
for (int i = 0; i < arr.length; i++) {
arr[i] = i + 1;
}

ForkJoinPool pool = new ForkJoinPool();
SumTask task = new SumTask(arr, 0, arr.length);

int result = pool.invoke(task);
System.out.println("Sum: " + result);
}
}

2.3 RecursiveTask vs RecursiveAction

RecursiveTask is used when your task returns a result, while RecursiveAction is used when it does not return any result.

Example using RecursiveAction:

import java.util.concurrent.RecursiveAction;

public class PrintTask extends RecursiveAction {
private final int[] arr;
private final int start, end;

public PrintTask(int[] arr, int start, int end) {
this.arr = arr;
this.start = start;
this.end = end;
}

@Override
protected void compute() {
if (end - start <= 10) {
for (int i = start; i < end; i++) {
System.out.print(arr[i] + " ");
}
System.out.println();
} else {
int mid = (start + end) / 2;
PrintTask leftTask = new PrintTask(arr, start, mid);
PrintTask rightTask = new PrintTask(arr, mid, end);

invokeAll(leftTask, rightTask);
}
}
}

2.4 Demo and Results

Running the ForkJoinExample will output the sum of the array elements. The Fork/Join Framework divides the task into smaller chunks and processes them in parallel, showing significant performance improvements, especially with large datasets.

For instance, summing up the numbers from 1 to 100:

Sum: 5050

In the case of the PrintTask, the framework divides the array printing task, executing it in parallel and outputting the segments simultaneously:

1 2 3 4 5 6 7 8 9 10 
11 12 13 14 15 16 17 18 19 20
...

3. Various Dimensions of the Fork/Join Framework

3.1 Advantages of Fork/Join

  • Efficiency: Utilizes all available CPU cores, leading to faster task execution.
  • Scalability: Can handle large datasets by breaking them down into smaller, manageable tasks.
  • Work-stealing: Keeps all threads busy by redistributing tasks from overloaded threads to idle ones.

3.2 Disadvantages of Fork/Join

  • Complexity: Requires careful design and understanding of parallelism, which can increase code complexity.
  • Overhead: Forking and joining tasks have inherent overhead, which might not be beneficial for smaller tasks.
  • Debugging: Parallel tasks can be challenging to debug due to the non-deterministic nature of thread execution.

3.3 When to Use Fork/Join

  • Large Recursive Problems: When you have tasks that naturally divide into smaller sub-tasks, such as sorting, searching, and matrix multiplication.
  • 1
  • CPU-bound Operations: Tasks that require intensive CPU computations and can benefit from parallel execution.

4. Conclusion

The Fork/Join Framework is a powerful tool in Java for optimizing parallel processing tasks. It excels in scenarios where tasks can be broken down into smaller subtasks, executed independently, and then combined to produce the final result. While it introduces complexity, the performance benefits in multi-core environments often outweigh the downsides, making it an excellent choice for CPU-bound and large recursive problems.

Read more at : What is Fork/Join Framework in Java?

0
Subscribe to my newsletter

Read articles from Tuanh.net directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Tuanh.net
Tuanh.net