Mapping the physics research space: a machine learning approach

Published on Dec 1, 2019in EPJ Data Science3.262
· DOI :10.1140/epjds/s13688-019-0210-z
Matteo Chinazzi6
Estimated H-index: 6
(NU: Northeastern University),
Bruno Gonçalves2
Estimated H-index: 2
+ 1 AuthorsAlessandro Vespignani88
Estimated H-index: 88
(NU: Northeastern University)
Scientific discoveries do not occur in vacuum but rather by connecting existing pieces of knowledge in new and creative ways. Mapping the relation and structure of scientific knowledge is therefore central to our understanding of the dynamics of scientific production. Here we introduce a new approach to generate scientific knowledge maps based on a machine learning approach that, starting from the observed publication patterns of authors, generates an N -dimensional space where it is possible to measure the similarity or distance between different research topics and knowledge domains. We provide an implementation of the proposed approach that considers the American Physical Society publications database and generates a map of the research space in Physics that characterizes the relation among research topics over time. We use this map to measure two indicators, the research capacity fingerprint and the knowledge density , to profile the research activity in physical sciences of more than 400 urban areas across the world. We show that these indicators can be used to analyze and predict the evolution over time of the research capacity and specialization of specific geographical areas. Furthermore we provide an extensive analysis of the relation between socio-economic development indicators and the ability to produce new knowledge for 67 countries, as measured by our approach, highlighting some key correlates of scientific production capacity. The proposed approach is scalable to very large datasets and can be extended to study other disciplines and research areas without having to rely on ad-hoc science classification schemes.
Figures & Tables
  • References (89)
  • Citations (2)
📖 Papers frequently viewed together
4 Authors (Wei Huang, ..., Tieju Ma)
2 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Giovanni AbramoH-Index: 28
#2Ciriaco Andrea D’Angelo (University of Rome Tor Vergata)H-Index: 28
Last. Flavia Di CostaH-Index: 12
view all 3 authors...
The intention of this work is to analyze top scientists’ collaboration behavior at the “international”, “domestic extramural” and “intramural” levels, and compare it to that of their lesser performing colleagues. The field of observation consists of the entire faculty of the Italian academic system, and so the coauthorship of scientific publications by over 12,000 professors. The broader aim is to improve understanding of the causal nexus between research collaboration and performance. The analy...
1 CitationsSource
#1Reinhilde Veugelers (Bruegel)H-Index: 5
#2Jian Wang (Katholieke Universiteit Leuven)H-Index: 13
Abstract This paper explores the complex relationship between scientific novelty and technological impact. We measure novel science as publications which make new combinations of prior knowledge, as reflected in new combinations of journals in their references, and trace links between science and technology by scientific references in patent applications. We draw on all the Web of Science SCIE journal articles published in 2001 and all the patents in PATSTAT (October 2013 edition). We find that ...
1 CitationsSource
#1Caroline S. Wagner (OSU: Ohio State University)H-Index: 23
#2Travis A. Whetsell (FIU: Florida International University)H-Index: 6
Last. Satyam Mukherjee (Indian Institute of Management Udaipur)H-Index: 1
view all 3 authors...
Abstract Research articles produced through international collaboration are more highly cited than other work, but are they also more novel? Using measures developed by Uzzi et al. (2013) , and replicated by Boyack and Klavans (2014) , this article tests for novelty and conventionality in international research collaboration. Scholars have found that coauthored articles are more novel and have suggested that diverse groups have a greater chance of producing creative work. As such, we expected to...
5 CitationsSource
#1Shuo Yu (DUT: Dalian University of Technology)H-Index: 6
#2Hayat Dino Bedru (DUT: Dalian University of Technology)H-Index: 1
Last. Feng Xia (DUT: Dalian University of Technology)H-Index: 35
view all 4 authors...
Abstract Scientific teamwork collaboration is an integral element of the scientific process that often leads to significant findings. Systematic analysis of scientific teamwork collaboration continues to influence both the advance in science and knowledge production. This paper presents an overview of Science of Scientific Team Science (SSTS). SSTS explores the behaviors and attributes of teamwork and team-based collaboration specific to scientific teams from the perspective of quantitative anal...
1 CitationsSource
#1Loet Leydesdorff (UvA: University of Amsterdam)H-Index: 83
#2Caroline S. Wagner (OSU: Ohio State University)H-Index: 23
Last. Lutz Bornmann (MPG: Max Planck Society)H-Index: 48
view all 3 authors...
Abstract Questions of definition and measurement continue to constrain a consensus on the measurement of interdisciplinarity. Using Rao-Stirling (RS) Diversity sometimes produces anomalous results. We argue that these unexpected outcomes can be related to the use of “dual-concept diversity” which combines “variety” and “balance” in the definitions (ex ante). We propose to modify RS Diversity into a new indicator (DIV) which operationalizes “variety,” “balance,” and “disparity” independently and ...
2 CitationsSource
#1Federico Battiston (CEU: Central European University)H-Index: 10
#2Federico Musciotto (CEU: Central European University)H-Index: 1
Last. Roberta SinatraH-Index: 16
view all 6 authors...
Over the past decades, the diversity of areas explored by physicists has exploded, encompassing new topics from biophysics and chemical physics to network science. However, it is unclear how these new subfields emerged from the traditional subject areas and how physicists explore them. To map out the evolution of physics subfields, here, we take an intellectual census of physics by studying physicists’ careers. We use a large-scale publication data set, identify the subfields of 135,877 physicis...
1 CitationsSource
#1Raj Kumar Pan (Aalto University)H-Index: 23
#2Alexander Michael Petersen (UCM: University of California, Merced)H-Index: 21
Last. Santo Fortunato (IU: Indiana University Bloomington)H-Index: 44
view all 4 authors...
Abstract Scientific production is steadily growing, exhibiting 4% annual growth in publications and 1.8% annual growth in the number of references per publication, together producing a 12-year doubling period in the total supply of references, i.e. links in the science citation network. This growth has far-reaching implications for how academic knowledge is connected, accessed and evaluated. Against this background, we analyzed a citation network comprised of 837 million references produced by 3...
10 CitationsSource
#1César A. Hidalgo (MIT: Massachusetts Institute of Technology)H-Index: 22
#2Pierre-Alexandre Balland (UU: Utrecht University)H-Index: 15
Last. Shengjun Zhu (PKU: Peking University)H-Index: 1
view all 15 authors...
The idea that skills, technology, and knowledge, are spatially concentrated, has a long academic tradition. Yet, only recently this hypothesis has been empirically formalized and corroborated at multiple spatial scales, for different economic activities, and for a diversity of institutional regimes. The new synthesis is an empirical principle describing the probability that a region enters—or exits—an economic activity as a function of the number of related activities present in that location. I...
15 CitationsSource
#1Mathias Czaika (Danube University Krems)H-Index: 13
#2Sultan Orazbayev (Harvard University)H-Index: 3
This article provides an empirical assessment of global scientific mobility over the past four decades. Based on bibliometric data we find (i) an increasing diversity of origin and destination countries integrated in global scientific mobility, with (ii) the centre of gravity of scientific knowledge production and migration destinations moving continuously eastwards by about 1300 km per decade, (iii) an increase in average migration distances of scientists reflecting integration of global periph...
5 CitationsSource
#1Kara L. HallH-Index: 19
#2Amanda L. Vogel (Leidos)H-Index: 8
Last. Stephen M. Fiore (UCF: University of Central Florida)H-Index: 32
view all 7 authors...
25 CitationsSource
Cited By2
The exchange of knowledge across different areas and disciplines plays a key role in the process of knowledge creation, and can stimulate innovation and the emergence of new fields. We develop here a quantitative framework to extract significant dependencies among scientific disciplines and turn them into a time-varying network whose nodes are the different fields, while the weighted links represent the flow of knowledge from one field to another at a given period of time. Drawing on a comprehen...
We propose an original approach to describe the scientific progress in a quantitative way. Using innovative Machine Learning techniques we create a vector representation for the PACS codes and we use them to represent the relative movements of the various domains of Physics in a multi-dimensional space. This methodology unveils about 25 years of scientific trends, enables us to predict innovative couplings of fields, and illustrates how Nobel Prize papers and APS milestones drive the future conv...