University calendar

Data Science MS Thesis Defense by Bhavana Gowda

Wednesday, January 21, 2026 at 1:00pm to 2:00pm

College of Engineering - Data Science Master's Thesis Defense

“Cn² Prediction Using Machine Learning Models for Marine Wave Boundary Layer”

By Bhavana Gowda

Thesis Advisor: Professor Miles A. Sundermeyer

Committee Members: Professor Donghui Yan and Professor Ashokkumar Patel

Abstract: The refractive index structure parameter, Cn², quantifies optical turbulence and strongly influences the performance of electromagnetic and free-space optical systems operating within the marine wave boundary layer. Predicting Cn² remains challenging due to nonlinear interactions among atmospheric variables and shows strong changes between daytime and nighttime, especially in coastal environments. This thesis examines these challenges using high-frequency meteorological data from ATI sonic anemometers and co-located Scintillometer observations collected at the SMAST Pier in Buzzards Bay, Massachusetts, during the summer and fall of 2025.

A Random Forest (RF) modeling framework is developed to evaluate not only predictive accuracy, but also the conditions under which machine-learning-based estimates of Cn² agree or disagree with observations. Atmospheric data are strongly time-dependent, meaning that common random train–test splits can introduce information leakage and overestimate model skill. To avoid this problem, the analysis uses a blocked, leakage-free train-gap-test strategy based on decorrelation time scales. Low-frequency and seasonal background trends are removed using variance-preserving spectral filtering, and time-lagged features are included based on correlation analyses to better represent the physical memory of the system.

Separate RF models developed for summer and fall are evaluated using within-season validation. Results show moderate overall predictive skill, with clear differences between daytime and nighttime performance. Nighttime predictions consistently explain more of the observed variability in log₁₀(Cn²) than daytime predictions, particularly during summer. This improved nighttime performance may be linked to more stable and less varying turbulence conditions after sunset. In contrast, daytime predictions show larger errors, likely due to stronger convection and rapidly changing surface-layer processes.

Overall, this study shows that the reliability of machine-learning-based Cn² predictions depends strongly on temporal structure, diurnal regime, and how autocorrelation is handled. The findings highlight the importance of using regime-aware validation and variance-based diagnostics rather than relying only on single accuracy metrics. This work provides practical insight into when ML-based turbulence predictions can be trusted in coastal marine environments, and offers guidance for developing more robust and physically consistent forecasting approaches.

For additional information please contact Miles Sundermeyer at msundermeyer@umassd.edu.

Zoom Meeting
https://umassd.zoom.us/j/94776945291?pwd=KakMJx8EXYcb1g7OCiQReebZZxVzf0.1

Back to top of screen