We report on a statistical analysis of the people agglomeration soundscape. Specifically, we investigate the normalized sound amplitudes and intensities that emerge from human collective meetings. Our findings support the existence of non-trivial dynamics characterized by heavy tail distributions in the sound amplitudes, long-range correlations in the sound intensity and non-exponential distributions in the return interval distributions. Additionally, motivated by the time-dependent behavior present in the volatility/variance series, we compare the observational data with those obtained from a minimalist autoregressive stochastic model, namely the generalized autoregressive conditional heteroskedastic process (the GARCH process), and find that there is good agreement.