Efficient Cardiac Interval Estimation from Seismocardiography Using Wavelet-Based Neural Networks

Before continuing please visit: Master Thesis

TL;DR

Design, train, and embed a wavelet-front-end neural pipeline (DWT / WPT / scattering, with a CNN, LSTM, or TCN backbone) for cardiac-interval regression from SCG, and benchmark it against a raw-signal baseline on the nRF52840 to quantify the accuracy–energy trade-off under near-sensor constraints.

Motivation

Seismocardiography (SCG) measures heart-induced chest vibrations and enables estimation of key cardiac intervals such as the Pre-Ejection Period (PEP), Left-Ventricular Ejection Time (LVET), Isovolumic Contraction/Relaxation Times (ICT/IRT), and the Myocardial Performance Index (MPI). However, reliable extraction of these intervals is challenging due to noise, inter-subject variability, and subtle waveform features.Recent deep learning approaches operate directly on raw SCG signals. While effective, they require large models and datasets, making them inefficient for embedded systems such as wearable devices. SCG signals exhibit structured time–frequency patterns that can be captured using wavelet-based methods. A prior study from the Smart Sensors Group demonstrated that the Discrete Wavelet Transform (DWT) provides an efficient, zero-parameter representation on embedded hardware (nRF52840), but did not integrate this into a full learning pipeline.

This thesis addresses this gap by combining wavelet-based representations with neural networks and evaluating their efficiency under embedded constraints.

Candidate architectures (Arch A: DWT front-end, Arch B: WPT front-end, Arch C: scattering front-end) sketched in the precursor study

Figure: Candidate architectural directions sketched in the precursor study (Arch A — DWT front-end, Arch B — WPT front-end, Arch C — scattering front-end). For illustration only — the concrete decomposition depths, retained subbands / leaves, fusion strategies, and downstream backbone (CNN, LSTM, TCN, or hybrid) are open design choices to be explored within this thesis.

Research Question

To what extent does a fixed multi-resolution wavelet front-end (DWT, WPT, scattering) reduce the parameter, sample, and energy budgets of neural cardiac-interval regression from SCG, relative to raw-signal baselines, under embedded constraints?

Objectives

The thesis will deliver a reproducible, end-to-end study covering the following objectives. Concrete decomposition depths, retained subband subsets, and front-end hyperparameters are outcomes of the investigation, not preconditions: they are to be determined empirically through principled sensitivity studies, not fixed a priori.

The downstream neural regressor backbone is also a design degree of freedom. The student is free to choose between CNN, LSTM/GRU, Temporal Convolutional Network (TCN), or hybrid variants for each architecture, and is encouraged to motivate the choice in light of SCG's local morphology versus its longer-range temporal dependencies. The same backbone family must, however, be used consistently for fair comparison within a given experiment, and the choice must be reported alongside the matched-budget constraint.

Work Plan and Timeline

The thesis is structured into six phases (M1–M6) over ~26 weeks, each ending in a concrete milestone.

Phase Weeks Scope Key Activities Milestone (finishing criteria)
M1 — Onboarding & reproduction 1 – 4 Alignment with the precursor study; project setup. Repo / dataset / lab onboarding; reproduce the wavelet-family benchmark and embedded Pareto from the precursor study; literature review on wavelet–NN hybrids for biosignals; freeze the evaluation protocol (cross-validation scheme, metrics, seeds, splits, statistical-comparison procedure); choose and motivate the downstream backbone family (CNN / LSTM / GRU / TCN / hybrid). Reproduced precursor figures + frozen protocol document committed to GitLab. (mandatory introductory presentation)
M2 — Arch A: DWT front-end + regressor 5 – 9 First structured-input pipeline; first head-to-head against the raw baseline. Build the multi-branch / multi-channel regressor over the chosen DWT subbands; subband-selection and decomposition-depth sensitivity sweep; cross-validated training and hyperparameter search; train a matched-budget raw-signal baseline in the same backbone family; first comparison vs. baseline on PEP / LVET / ICT / IRT / MPI. Trained Arch A model, comparison table vs. raw baseline, sensitivity-sweep results.
M3 — Arch B & Arch C: WPT and scattering front-ends 10 – 14 Wavelet-packet and analytic-scattering variants; cross-architecture comparison closes here. Wavelet-packet decomposition with relevance-driven leaf pruning (energy / mutual information / learned attention) and fusion with coarser subbands; analytic scattering front-end (e.g. Kymatio) with depth and Q-factor sweep; consistent backbone across runs; cross-architecture sensitivity studies (filter family, decomposition depth, pruning policy); paired-bootstrap statistical comparison and CCC matrix across all four pipelines. Complete architectural comparison report; mid-term presentation (optional).
M4 — Embedded realisation 15 – 19 Embedded port and on-silicon characterisation. Quantisation (int8 / q15) of the favoured architecture; CMSIS-NN integration; RIOT-OS firmware integration with the existing q15 wavelet cascade; PPK2 energy capture; latency, RAM, and flash-footprint measurement; quantisation-induced accuracy delta. On-device inference with measured accuracy–energy–footprint Pareto.
M5 — Exploratory / robustness study 20 – 22 One deeper investigation, chosen at end of M4. Either subband-driven distributed inference: which subbands, transmitted at what rate, preserve interval accuracy under a BLE-class uplink budget — or robustness analysis: motion artefacts, posture changes, sensor placement, additive noise, and inter-subject / inter-session generalisation under cohort or recording-condition shift. The choice is motivated by progress and findings up to M4. One additional analysis chapter.
M6 — Writing & defence 23 – 26 Thesis, defence, and reproducible release. Thesis writing; defence preparation; tagged release of the reproducible code repository (training, evaluation, embedded firmware, measurement notebooks); optional draft of a short workshop / conference paper. Submitted thesis, tagged GitLab repository, final defence.

Deliverables

  1. Written thesis in LaTeX (Smart Sensor Group template).
  2. Reproducible code repository (training, evaluation, embedded firmware) with tagged release.
  3. Trained model checkpoints and evaluation scripts.
  4. nRF52840 firmware image plus build/flash documentation.
  5. PPK2 measurement logs and post-processing notebooks.
  6. Final defence presentation.
  7. (Optional) draft of a short conference / workshop paper.

Working Mode and Expectations

Evaluation Criteria for the Thesis

The Work Packages and the Deliverables define the finishing criteria, i.e., what must be in place for the thesis to be considered complete. They do not define the grade. The final grade is determined holistically by the examiner across the full body of work, weighing in particular:

Required Skills

Required

Desirable

Application Process

Contact: please send a single PDF to the superviosor mentioned above; reference "MA—DWT-NN-SCG" in the email subject line.

Further Reading