Descriptive analysis of circular data with outliers using Python programming language

Authors

  • N.S. Zulkipli Centre for Mathematical Sciences, College of Computing and Applied Sciences, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang, Malaysia
  • S.Z. Satari Centre for Mathematical Sciences, College of Computing and Applied Sciences, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang, Malaysia
  • W.N.S. Wan Yusoff Centre for Mathematical Sciences, College of Computing and Applied Sciences, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang, Malaysia

DOI:

https://doi.org/10.15282/daam.v1i01.5085

Keywords:

Circular data; descriptive analysis; python; programming language; outlier

Abstract

Descriptive statistics are commonly used in data analysis to describe the basic features of raw data. Descriptive summaries enable us to present the data in a more simple and meaningful way so that the interpretation will be easier to understand. The descriptive analysis of circular data with outliers is discussed in this study. Circular data is different from linear data in many aspects such as statistical modeling, descriptive statistics and etc. Hence, unlike linear data, the availability of statistical software specialises in analysing circular data is very limited. Python is a programming language which frequently used by data analysts nowadays. However, the package for circular statistics is not fully developed and it is not ready to use like in Splus or R programming language. In this study, the descriptive analysis of circular data is performed using the in-demand programming language, Python. Descriptive statistics of the circular data especially with the existence of outliers are discussed and the proposed Python code is available to use.

References

Hasan Jammalamadaka SR, Sengupta A. Topics in circular statistics. Singapore: World Scientific Publishing; 2001.

Fisher NI, Lewis T, Embleton BJ. Statistical analysis of spherical data. New York: Cambridge university press; 1993 Aug 19.

Mardia KV. Statistics of directional data. Journal of the Royal Statistical Society Series B: Statistical Methodology. 1975 Jul;37(3):349-71.

Best DJ, Fisher NI. The BIAS of the maximum likelihood estimators of the von Mises-Fisher concentration parameters: the BIAS of the maximum likelihood estimators. Communications in Statistics-Simulation and Computation. 1981 Jan 1;10(5):493-502.

Satari SZ, Khalif KM. Review on outliers identification methods for univariate circular biological data. Advances in Science, Technology and Engineering Systems. 2020;5(2):95-103.

Hassan SF, Hussin AG, Zubairi YZ. Analysis of Malaysian wind direction data using ORIANA. Modern Applied Science. 2009 Mar;3(3):115-9.

Mardia KV, Jupp PE. Directional statistics. London: John Wiley & Sons; 2009 Sep 25.

Rousseeuw PJ, Hampel FR, Ronchetti EM, Stahel WA. Robust statistics: the approach based on influence functions. New York: Wiley; 1986.

Chen CP, Zhang CY. Data-intensive applications, challenges, techniques and technologies: A Survey on Big Data. Information Sciences. 2014 Aug 10;275:314-47.

Cass S. The top programming languages: Our latest rankings put Python on top-again-[Careers]. IEEE Spectrum. 2020 Jul 28;57(8):22-22.

Python. The Python Standard Library [Online]. Retrieved from https://docs.python.org/3/library/index.html: 18 August 2020.

Ferguson DE, Landreth HF, Mckeown JP. Sun compass orientation of the northern cricket frog, Acris crepitans. Animal Behaviour. 1967 Jan 1;15(1):45-53.

Collett D. Outliers in circular data. Journal of the Royal Statistical Society: Series C (Applied Statistics). 1980 Mar;29(1):50-7.

Abuzaid AH, Hussin AG, Rambli A, Mohamed I. Statistics for a new test of discordance in circular data. Communications in Statistics-Simulation and Computation. 2012 Nov 1;41(10):1882-90.

Badarisam F, Rambli A, Sidik M. A comparison on two discordancy tests to detect outlier in von mises (VM) sample. Indonesian Journal of Electrical Engineering and Computer Science. 2020 Jul;19(1):156.

Downloads

Published

2020-12-01 — Updated on 2020-12-31

Versions

How to Cite

Zulkipli, N., Satari, S., & Wan Yusoff, W. (2020). Descriptive analysis of circular data with outliers using Python programming language. Data Analytics and Applied Mathematics (DAAM), 1(1), 31–36. https://doi.org/10.15282/daam.v1i01.5085 (Original work published December 1, 2020)

Issue

Section

Research Articles