È necessario avere un abbonamento a JoVE per visualizzare questo. Accedi o inizia la tua prova gratuita.
* Questi autori hanno contribuito in egual misura
This bibliometric analysis of single-cell sequencing in cancer research indicates that China and the USA produce significantly more scholarly articles than other nations. Burst detection identifies emerging terms such as 'intra-tumor heterogeneity,' 'clonal evolution,' and 'drug delivery systems,' which are expected to influence future research.
Cancer poses a significant challenge to human health due to its complex biological systems, necessitating in-depth analysis. Single-cell sequencing has become an essential tool for investigating these systems, enabling the detection of gene expression and epigenetic modifications at the single-cell level. To elucidate research trends, collaboration networks, and knowledge dissemination in this field, a bibliometric analysis was conducted using the Web of Science Core Collection database, covering publications from January 1, 2010, to December 31, 2023. The Bibliometrix package in R was used to extract and analyze key publication data, including document types, countries, institutions, authors, and keywords. Additionally, CiteSpace, VOSviewer, and the Online Analysis Platform of Literature Metrology were employed for data compilation and visualization. The analysis identified 34,074 authors from 3,129 institutions across 75 countries and regions, contributing to 5,680 publications on single-cell sequencing in cancer, published in 788 academic journals. China and the United States emerged as the leading nations in publication volume. Harvard University produced the highest number of publications (320), with Aviv Regev, affiliated with Harvard, recognized as a key contributor. Leading journals, such as Frontiers in Immunology and Nature Communications, highlight both established and emerging research areas, including the immune microenvironment and immunotherapy. Key trends and potential areas for future research include intra-tumor heterogeneity, clonal evolution, and drug delivery systems. This study provides a comprehensive overview of single-cell sequencing research in oncology, emphasizing its rapid progress, driven by technological advancements and international collaborations. Strengthening global partnerships, developing integrative analytical tools, and addressing data complexities will be crucial for advancing personalized cancer therapies and deepening insights into cancer biology.
Cancer represents one of the most detrimental diseases, ranking as the second leading cause of mortality worldwide1. It is estimated that by 2035, approximately a quarter of the global population will be directly impacted by cancer2,3. The pathogenesis of cancer is primarily linked to dysregulation in cellular growth, which is influenced by a variety of tumorigenic factors4,5. The "Hallmarks of Cancer" were conceptualized as a set of functional capabilities that facilitate the transition from normal cellular states to neoplastic growth, specifically those capabilities essential for the formation of malignant tumors6. Sequencing technology plays a pivotal role in advancing our understanding of disease pathogenesis. However, due to the inherent heterogeneity of tumors, identifying the genomic characteristics of low-abundance stem cells through high-throughput sequencing analysis of tumor tissues presents significant challenges7,8.
Single-cell sequencing, which includes genomics, transcriptomics, epigenomics, proteomics, and metabolomics, represents a powerful methodological approach for elucidating cellular and molecular landscapes at the single-cell level9,10. Its application in cancer research has significantly enhanced the understanding of the biological characteristics and dynamics present within neoplastic lesions, thereby facilitating a more comprehensive comprehension of cancer development and metastasis.
The bibliometric analysis examines the structural characteristics and attributes of scholarly publications and has been widely employed in both qualitative and quantitative assessments of scientific literature11,12. By comparing contributions from various countries, institutions, researchers, and publications, it is possible to elucidate and anticipate potential advancements within a particular research domain. Although there has been a substantial increase in systematic and narrative reviews focusing on single-cell sequencing research in cancer, there remains a notable deficiency in comprehensive analyses within the realm of quantitative assessment13,14,15. This study aims to conduct a comprehensive analysis of the developmental trends and prominent research topics in single-cell sequencing within the domain of cancer, utilizing bibliometric methods. The findings will offer researchers, clinicians, and policymakers a detailed overview of the current state of knowledge and understanding in this area.
The data used in this study was obtained from the Web of Science Core Collection (2010-2023).
1. Data collection
2. Data preprocessing
Annual growth trend of publications and citations
From 2010 to 2023, a total of 6,767 publications related to single-cell sequencing in cancer were identified in the WoSCC database. A total of 602 studies published between 2010 and 2023 were excluded from the analysis, followed by the exclusion of five studies not published in English. Additionally, 480 articles were excluded based on predefined exclusion criteria, comprising 361 meeting abstracts, 83 editorial materials, and 36 articles classified...
Bibliometric analysis serves as a quantitative approach to evaluating the characteristics and scholarly impact of significant publications26. This study conducted an extensive bibliometric analysis of 5,680 articles related to single-cell sequencing in cancer research, extracted from the WoSCC database and published between 2010 and 2023. This analysis aimed to assess the current state of research, identify key research hotspots, and elucidate emerging trends to provide actionable insights for res...
The authors have nothing to disclose.
None.
Name | Company | Catalog Number | Comments |
bibliometrix package | Comprehensive R Archive Network (CRAN) | bibliometrix 4.3.0 | A forest plot that allows for multiple confidence intervals per row, custom fonts for each text element, custom confidence intervals, text mixed with expressions, and more. |
CiteSpace | Chaomei Chen, Drexel University | CiteSpace 6.2.R4 (64-bit) beta Basic | CiteSpace is a scientific literature analysis tool. Its main function is to analyze the underlying knowledge in scientific literature through visual means, showing the structure, rules and distribution of scientific knowledge. The main functions of CiteSpace include: research collaboration analysis , important journal judgment , core topic mining and so on. |
dplyr | Comprehensive R Archive Network (CRAN) | dplyr 1.1.4 | dbplyr is the database backend for dplyr. It allows you to use remote database tables as if they are in-memory data frames by automatically converting dplyr code into SQL. |
esquisse | Comprehensive R Archive Network (CRAN) | esquisse 2.0.1 | This addin allows you to interactively explore your data by visualizing it with the ggplot2 package. It allows you to draw bar plots, curves, scatter plots, histograms, boxplot and sf objects, then export the graph or retrieve the code to reproduce the graph. |
forcats | Comprehensive R Archive Network (CRAN) | forcats 1.0.0 | R uses factors to handle categorical variables, variables that have a fixed and known set of possible values. Factors are also helpful for reordering character vectors to improve display. The goal of the forcats package is to provide a suite of tools that solve common problems with factors, including changing the order of levels or the values. |
ggplot2 | Comprehensive R Archive Network (CRAN) | ggplot2 3.5.1 | ggplot2 is a system for declaratively creating graphics, based on The Grammar of Graphics. You provide the data, tell ggplot2 how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details. |
ggpmisc | Comprehensive R Archive Network (CRAN) | ggpmisc 0.6.1 | Package ‘ggpmisc’ (Miscellaneous Extensions to ‘ggplot2’) is a set of extensions to R package ‘ggplot2’ (>= 3.0.0) with emphasis on annotations and plotting related to fitted models. Estimates from model fit objects can be displayed in ggplots as text, tables or equations. Predicted values, residuals, deviations and weights can be plotted for various model fit functions. |
ggsci | Comprehensive R Archive Network (CRAN) | ggsci 3.2.0 | ggsci offers a collection of ggplot2 color palettes inspired by scientific journals, data visualization libraries, science fiction movies, and TV shows. |
openxlsx | Comprehensive R Archive Network (CRAN) | openxlsx 4.2.7.1 | This R package simplifies the creation of .xlsx files by providing a high level interface to writing, styling and editing worksheets. Through the use of Rcpp, read/write times are comparable to the xlsx and XLConnect packages with the added benefit of removing the dependency on Java. |
readxl | Comprehensive R Archive Network (CRAN) | readxl 1.4.3 | The readxl package makes it easy to get data out of Excel and into R. Compared to many of the existing packages (e.g. gdata, xlsx, xlsReadWrite) readxl has no external dependencies, so it’s easy to install and use on all operating systems. It is designed to work with tabular data. |
reshape2 | Comprehensive R Archive Network (CRAN) | reshape2 1.4.4 | Reshape2 is a reboot of the reshape package. It's been over five years since the first release of reshape, and in that time I've learned a tremendous amount about R programming, and how to work with data in R. Reshape2 uses that knowledge to make a new package for reshaping data that is much more focused and much much faster. |
stringr | Comprehensive R Archive Network (CRAN) | stringr 1.5.1 | Strings are not glamorous, high-profile components of R, but they do play a big role in many data cleaning and preparation tasks. The stringr package provides a cohesive set of functions designed to make working with strings as easy as possible. |
tidytext | Comprehensive R Archive Network (CRAN) | tidytext 0.4.2 | Using tidy data principles can make many text mining tasks easier, more effective, and consistent with tools already in wide use. Much of the infrastructure needed for text mining with tidy data frames already exists in packages like dplyr, broom, tidyr, and ggplot2. In this package, we provide functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages. Check out our book to learn more about text mining using tidy data principles |
tidyverse | Comprehensive R Archive Network (CRAN) | tidyverse 2.0.0 | The tidyverse is an opinionated collection of R packages designed for data science. All packages share an underlying design philosophy, grammar, and data structures. |
VennDiagram | Comprehensive R Archive Network (CRAN) | VennDiagram 1.7.3 | VennDiagram is a R package for generating high-resolution, customizable Venn diagrams with up to four sets and Euler diagrams with up to three sets. Includes handling for several special cases including two-case scaling, and extensive customization of plot shape and structure. |
VOSviewer | Centre for Science and Technology Studies, Leiden University, The Netherlands | VOSviewer version 1.6.19 | VOSviewer is a software tool for constructing and visualizing bibliometric networks. These networks may for instance include journals, researchers, or individual publications, and they can be constructed based on citation, bibliographic coupling, co-citation, or co-authorship relations. VOSviewer also offers text mining functionality that can be used to construct and visualize co-occurrence networks of important terms extracted from a body of scientific literature. |
Richiedi autorizzazione per utilizzare il testo o le figure di questo articolo JoVE
Richiedi AutorizzazioneEsplora altri articoli
This article has been published
Video Coming Soon