Logo del repository
  1. Home
 
Opzioni

Parallelization of the pre-stack depth migration code

Enayati, Mohammad
2026-03-27
  • other

Abstract
Pre-stack depth migration (PSDM) is a computationally intensive algorithm widely used in seismic imaging to accurately position subsurface reflectors. Its high computational cost, largely dominated by deeply nested loops and irregular memory access patterns, makes performance optimization essential for large-scale seismic processing. This thesis investigates performance improvements of a PSDM kernel through enhanced vectorization and pure OpenMP-based parallelization. Initially, the code was analyzed to identify opportunities for improving compiler auto-vectorization within the most computationally demanding loops. Loop restructuring, reduction of data dependencies, and improved memory access patterns were applied to increase vectorization efficiency and better utilize modern CPU vector units. However, due to varying loop bounds across iterations, relying solely on SIMD optimization limits both workload balance and overall scalability. In such cases, combining outer-loop parallelism with inner-loop compiler auto-vectorization typically provides better control over workload distribution and vectorization efficiency. Building on this observation, the optimized OpenMP implementation parallelizes higher-level loops while preserving the previously improved vectorized kernels. Special attention was given to workload distribution, scheduling strategies, and memory access patterns in order to minimize thread contention and maximize processor utilization. To evaluate the effectiveness of the proposed approach, both strong scaling and weak scaling studies were conducted to analyze performance behavior as the number of processing cores and problem size vary. In addition to performance analysis, energy consumption considerations were incorporated by monitoring the energy usage of the OpenMP-parallelized implementation, allowing an assessment of the trade-offs between performance gains and energy efficiency on multi-core systems. Experimental results demonstrate that the optimized implementation achieves significant performance improvements over the baseline version while maintaining favorable energy characteristics. Overall, the findings highlight that combining improved vectorization with scalable thread-level parallelism and careful performance analysis can substantially accelerate PSDM workloads on modern high-performance computing architectures.
Archivio
https://hdl.handle.net/20.500.11767/151811
https://ricerca.unityfvg.it/handle/20.500.11767/151811
Diritti
open access
license:non specificato
license uri:na
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback