References

Abadi, Martín, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems.” https://www.tensorflow.org/.
Adler, Daniel, Christian Gläser, Oleg Nenadic, Jens Oehlschlägel, Martijn Schuemie, and Walter Zucchini. 2022. Ff: Memory-Efficient Storage of Large Data on Disk and Fast Access Functions. https://CRAN.R-project.org/package=ff.
Aldrich, Eric M. 2014. Chapter 10 - GPU Computing in Economics.” In Handbook of Computational Economics Vol. 3, edited by Karl Schmedders and Kenneth L. Judd, 3:557–98. Handbook of Computational Economics. Elsevier. https://doi.org/https://doi.org/10.1016/B978-0-444-52980-0.00010-4.
Aldrich, Eric M., Jesus Fernandez-Villaverde, A. Ronald Gallant, and Juan F. Rubio-Ramirez. 2011. Tapping the supercomputer under your desk: Solving dynamic equilibrium models with graphics processors.” Journal of Economic Dynamics and Control 35 (3): 386–93. https://doi.org/https://doi.org/10.1016/j.jedc.2010.10.001.
Allaire, JJ, and François Chollet. 2022. Keras: R Interface to ’Keras’. https://CRAN.R-project.org/package=keras.
Angrist, Joshua D, and Joern-Steffen Pischke. 2008. Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton University Press.
Bates, Douglas, Martin Maechler, and Mikael Jagan. 2022. Matrix: Sparse and Dense Matrix Classes and Methods. https://CRAN.R-project.org/package=Matrix.
Bengtsson, Henrik. 2021. A Unifying Framework for Parallel and Distributed Processing in R using Futures.” The R Journal 13 (2): 273–91. https://doi.org/10.32614/RJ-2021-048.
Burns, Patrick. 2011. The r Inferno. Lulu Press, Inc. https://www.burns-stat.com/pages/Tutor/R_inferno.pdf.
Chang, Winston, Javier Luraschi, and Timothy Mastny. 2020. Profvis: Interactive Visualizations for Profiling r Code. https://CRAN.R-project.org/package=profvis.
Cheng, Heng-Tze, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, et al. 2017. TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks.” In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1763–71. KDD ’17. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/3097983.3098171.
Chollet, François et al. 2015. “Keras.” https://keras.io [last visited: 16/05/2023].
Cohen, Lauren, and Christopher J. Malloy. 2014. “Friends in High Places.” American Economic Journal: Economic Policy 6 (3): 63–91. https://doi.org/10.1257/pol.6.3.63.
de Jonge, Edwin, Jan Wijffels, and Jan van der Laan. 2023. Ffbase: Basic Statistical Functions for Package ’Ff’. https://github.com/edwindj/ffbase.
Determan, Charles. 2019. gpuR: GPU Functions for r Objects. http://github.com/cdeterman/gpuR.
Dhillon, Paramveer, Yichao Lu, Dean P. Foster, and Lyle Ungar. 2013. “New Subsampling Algorithms for Fast Least Squares Regression.” In Advances in Neural Information Processing Systems 26, 360–68.
Donoho, David. 2017. 50 years of data science.” Journal of Computational and Graphical Statistics 26 (4): 745–66. https://doi.org/10.1080/10618600.2017.1384734.
Dowle, Matt, and Arun Srinivasan. 2022. Data.table: Extension of ‘Data.frame‘. https://CRAN.R-project.org/package=data.table.
Fatahalian, K., J. Sugerman, and P. Hanrahan. 2004. Understanding the Efficiency of GPU Algorithms for Matrix-Matrix Multiplication.” In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, 133–37. HWWS ’04. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/1058129.1058148.
Frick, Hannah, Fanny Chow, Max Kuhn, Michael Mahoney, Julia Silge, and Hadley Wickham. 2022. Rsample: General Resampling Infrastructure. https://CRAN.R-project.org/package=rsample.
Fultz, Neal, and Gergely Daróczi. 2019. AWR.athena: ’AWS’ Athena ’DBI’ Wrapper. https://CRAN.R-project.org/package=AWR.Athena.
Gaure, Simen. 2013a. lfe: Linear Group Fixed Effects.” The R Journal 5 (2): 104–16. https://doi.org/10.32614/RJ-2013-031.
———. 2013b. OLS with Multiple High Dimensional Category Variables.” Computational Statistics & Data Analysis 66: 8–18. https://doi.org/https://doi.org/10.1016/j.csda.2013.03.024.
Gentzkow, Matthew, and Jesse M. Shapiro. 2010. What drives media slant? Evidence from U.S. daily newspapers.” Econometrica 78 (1): 35–71. https://doi.org/https://doi.org/10.3982/ECTA7195.
Gentzkow, Matthew, Jesse M. Shapiro, and Matt Taddy. 2019. Measuring group differences in high-dimensional choices: Method and application to congressional speech.” Econometrica 87 (4): 1307–40. https://doi.org/https://doi.org/10.3982/ECTA16566.
Hester, Jim, and Davis Vaughan. 2021. Bench: High Precision Timing of r Expressions. https://CRAN.R-project.org/package=bench.
Hester, Jim, Hadley Wickham, and Gábor Csárdi. 2023. Fs: Cross-Platform File System Operations Based on ’Libuv’. https://CRAN.R-project.org/package=fs.
Højsgaard, Søren, and Ulrich Halekoh. 2023. doBy: Groupwise Statistics, LSmeans, Linear Estimates, Utilities. https://CRAN.R-project.org/package=doBy.
Johnston, Myfanwy, and David Robinson. 2022. Gutenbergr: Download and Process Public Domain Works from Project Gutenberg. https://CRAN.R-project.org/package=gutenbergr.
Kane, Michael J., John Emerson, and Stephen Weston. 2013. “Scalable Strategies for Computing with Massive Data.” Journal of Statistical Software 55 (14): 1–19. https://www.jstatsoft.org/article/view/v055i14.
Karami, Amir, Aryya Gangopadhyay, Bin Zhou, and Hadi Kharrazi. 2017. “Fuzzy Approach Topic Discovery in Health and Medical Corpora.” International Journal of Fuzzy Systems 20: 1334–45.
Kincaid, Dave, and Kevin Kuo. 2023. Sparknlp: R Interface to John Snow Labs Spark NLP.
Kratochvil, Mirek. 2022. Scattermore: Scatterplots with More Points. https://CRAN.R-project.org/package=scattermore.
Kratochvíl, Miroslav, David Bednárek, Tomáš Sieger, Karel Fišer, and Jiří Vondrášek. 2020. ShinySOM: graphical SOM-based analysis of single-cell cytometry data.” Bioinformatics 36 (10): 3288–89. https://doi.org/10.1093/bioinformatics/btaa091.
Kuhn, Max, and Davis Vaughan. 2022. Parsnip: A Common API to Modeling and Analysis Functions. https://CRAN.R-project.org/package=parsnip.
Kuhn, Max, Davis Vaughan, and Emil Hvitfeldt. 2022. Yardstick: Tidy Characterizations of Model Performance. https://CRAN.R-project.org/package=yardstick.
Kuhn, Max, and Hadley Wickham. 2020. “Tidymodels: A Collection of Packages for Modeling and Machine Learning Using Tidyverse Principles.” https://www.tidymodels.org [last visited: 16/05/2023].
Leeper, Thomas J. 2020. Aws.s3: AWS S3 Client Package.
Luraschi, Javier, Kevin Kuo, Kevin Ushey, JJ Allaire, Hossein Falaki, Lu Wang, Andy Zhang, Yitao Li, Edgar Ruiz, and The Apache Software Foundation. 2022. sparklyr: R Interface to Apache Spark. https://spark.rstudio.com/.
Matloff, Norman. 2015. Parallel Computing for Data Science. Boca Raton, FL: CRC Press.
Mersmann, Olaf. 2021. Microbenchmark: Accurate Timing Functions. https://CRAN.R-project.org/package=microbenchmark.
Metamarkets Group Inc. 2023. RDruid: Druid Connector for R.
Meyer, David, Achim Zeileis, and Kurt Hornik. 2006. “The Strucplot Framework: Visualizing Multi-Way Contingency Tables with Vcd.” Journal of Statistical Software 17 (3): 1–48. https://doi.org/10.18637/jss.v017.i03.
———. 2023. Vcd: Visualizing Categorical Data. https://CRAN.R-project.org/package=vcd.
Microsoft Corporation, and Stephen Weston. 2022. doSNOW: Foreach Parallel Adaptor for the ’Snow’ Package. https://CRAN.R-project.org/package=doSNOW.
Microsoft, and Steve Weston. 2022. Foreach: Provides Foreach Looping Construct. https://CRAN.R-project.org/package=foreach.
Müller, Kirill, Hadley Wickham, David A. James, and Seth Falcon. 2022. RSQLite: SQLite Interface for r. https://CRAN.R-project.org/package=RSQLite.
Pollock, Matt. 2023. Sparklyr.nested: A ’Sparklyr’ Extension for Nested Data.
R Core Team. 2021. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
———. 2022. Foreign: Read Data Stored by ’Minitab’, ’s’, ’SAS’, ’SPSS’, ’Stata’, ’Systat’, ’Weka’, ’dBase’, ... https://CRAN.R-project.org/package=foreign.
R Special Interest Group on Databases (R-SIG-DB), Hadley Wickham, and Kirill Müller. 2022. DBI: R Database Interface. https://CRAN.R-project.org/package=DBI.
Richardson, Neal, Ian Cook, Nic Crane, Dewey Dunnington, Romain François, Jonathan Keane, Dragoș Moldovan-Grünfeld, Jeroen Ooms, and Apache Arrow. 2022. Arrow: Integration to ’Apache’ ’Arrow’.
Robinson, David, Alex Hayes, and Simon Couch. 2022. Broom: Convert Statistical Objects into Tidy Tibbles. https://CRAN.R-project.org/package=broom.
Shakespeare, W. 1599/2020. Julius Caesar. New York: Open Road Media.
Stock, James H, and Mark W Watson. 2003. Introduction to Econometrics. Pearson Education.
Taddy, Matt. 2017. “One-Step Estimator Paths for Concave Regularization.” Journal of Computational and Graphical Statistics 26 (3): 525–36. https://doi.org/10.1080/10618600.2016.1211532.
———. 2019. Business Data Science. New York: McGraw-Hill.
Urbanek, Simon. 2022. RJDBC: Provides Access to Databases Through the JDBC Interface. https://CRAN.R-project.org/package=RJDBC.
Varian, Hal R. 2014. “Big Data: New Tricks for Econometrics.” Journal of Economic Perspectives 28 (2): 3–28. https://doi.org/10.1257/jep.28.2.3.
Venkataraman, Shivaram, Xiangrui Meng, Felix Cheung, and The Apache Software Foundation. 2021. SparkR: R Front End for Apache Spark. https://CRAN.R-project.org/package=SparkR.
Walkowiak, Simkon. 2016. Big Data Analytics with r. Birmingham, UK: PACKT Publishing.
Wickham, Hadley. 2011. “The Split-Apply-Combine Strategy for Data Analysis.” Journal of Statistical Software 40.
———. 2016. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. https://ggplot2.tidyverse.org.
———. 2021. Pryr: Tools for Computing on the Language. https://CRAN.R-project.org/package=pryr.
———. 2022a. Lobstr: Visualize r Data Structures with Trees. https://CRAN.R-project.org/package=lobstr.
———. 2022b. Stringr: Simple, Consistent Wrappers for Common String Operations. https://CRAN.R-project.org/package=stringr.
Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019. “Welcome to the tidyverse.” Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.
Wickham, Hadley, and Jennifer Bryan. 2022. Bigrquery: An Interface to Google’s ’BigQuery’ ’API’. https://CRAN.R-project.org/package=bigrquery.
Wickham, Hadley, Romain François, Lionel Henry, Kirill Müller, and Davis Vaughan. 2023. Dplyr: A Grammar of Data Manipulation. https://CRAN.R-project.org/package=dplyr.
Wickham, Hadley, and Garrett Grolemund. 2016. R for Data Science. O’Reilly Media, Inc.
Wickham, Hadley, Jim Hester, Winston Chang, and Jennifer Bryan. 2022. Devtools: Tools to Make Developing r Packages Easier. https://CRAN.R-project.org/package=devtools.
Wilkinson, L., D. Wills, D. Rope, A. Norton, and R. Dubbs. 2005. The Grammar of Graphics. Statistics and Computing. Springer New York. https://books.google.ch/books?id=\_kRX4LoFfGQC.
Yang, Fangjin, Eric Tschetter, Xavier Léauté, Nelson Ray, Gian Merlino, and Deep Ganguli. 2014. Druid: A Real-Time Analytical Data Store.” In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, 157–68. SIGMOD ’14. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/2588555.2595631.
Zaharia, Matei, Reynold S. Xin, Patrick Wendell, Tathagata Das, Michael Armbrust, Ankur Dave, Xiangrui Meng, et al. 2016. Apache Spark: A unified engine for big data processing.” Commun. ACM 59 (11): 56–65. https://doi.org/10.1145/2934664.
Zeileis, Achim, and Torsten Hothorn. 2002. “Diagnostic Checking in Regression Relationships.” R News 2 (3): 7–10. https://CRAN.R-project.org/doc/Rnews/.
Zeileis, Achim, David Meyer, and Kurt Hornik. 2007. “Residual-Based Shadings for Visualizing (Conditional) Independence.” Journal of Computational and Graphical Statistics 16 (3): 507–25. https://doi.org/10.1198/106186007X237856.