SNAP: Network datasets: Wikipedia Article Networks
文章推薦指數: 80 %
Dataset information. The data was collected from the English Wikipedia (December 2018). These datasets represent page-page networks on specific topics ( ... SNAPforC++ SNAPC++MainPage SNAPC++Download SNAPC++Documentation SNAPforPython Snap.pyPythonMainPage Snap.pyPythonDownload Snap.pyPythonDocumentation SNAPDatasets Largenetworks Webdatasets Otherresources BIOSNAPDatasets What'snew People Papers Projects ActivityInequality AGM COMET Conflict ConNIe Counseling CRank Decagon GraphSAGE GraphWave Higher-order Disinformation InfoPath LIM MAPPR MAMBO MARS Memetracker NCP NE NETINF NIFTY node2vec OhmNet ORCA Pathways QUOTUS Ringo SEISMIC SNAP Snap.py SnapVX TemporalMotifs TICC TIPAS TVGL CitingSNAP Links About Contactus Openpositions OpenresearchpositionsinSNAPgroupareavailableat undergraduate,graduate and postdoctoral levels. WikipediaArticleNetworks Datasetinformation ThedatawascollectedfromtheEnglishWikipedia(December2018).Thesedatasetsrepresentpage-pagenetworksonspecifictopics(chameleons,crocodilesandsquirrels).Nodesrepresentarticlesandedgesaremutuallinksbetweenthem.Theedgescsvfilescontaintheedges-nodesareindexedfrom0.Thefeaturesjsonfilescontainthefeaturesofarticles-eachkeyisapageid,andnodefeaturesaregivenaslists.ThepresenceofafeatureinthefeaturelistmeansthataninformativenounappearedinthetextoftheWikipediaarticle.ThetargetcsvcontainsthenodeidentifiersandtheaveragemonthlytrafficbetweenOctober2017andNovember2018foreachpage.Foreachpage-pagenetworkwelistedthenumberofnodesanedgeswithsomeotherdescriptivestatistics. MUSAEpaper:arxiv.org MUSAEProject:Github Datasetinformation DirectedNo. NodefeaturesYes. EdgefeaturesNo. NodelabelsYes.Continuoustarget. TemporalNo. Datasetstatistics ChameleonCrocodileSquirrel Nodes2,27711,6315,201 Edges31,421170,918198,493 Density0.0120.0030.015 Transitvity0.3140.0260.348 Possibletasks Regression Linkprediction Communitydetection Networkvisualization Source(citation) B.Rozemberczki,C.AllenandR.Sarkar.Multi-scaleAttributedNodeEmbedding.2019. @misc{rozemberczki2019multiscale, title={Multi-scaleAttributedNodeEmbedding}, author={BenedekRozemberczkiandCarlAllenandRikSarkar}, year={2019}, eprint={1909.13021}, archivePrefix={arXiv}, primaryClass={cs.LG} } Files File Description wikipedia.zip WikipediaArticleNetworks
延伸文章資訊
- 1Wikipedia:Size of Wikipedia
- 2Wikipedia Data Science: Working with the World's Largest ...
Wikipedia Data Science: Working with the World's Largest Encyclopedia. How to programmatically do...
- 3Wikipedia:Database download
- 4List of datasets for machine-learning research - Wikipedia
Afifi, M. et al. IMDB-WIKI, IMDB and Wikipedia face images with gender and age labels. None, 523,...
- 5Wikimedia servers - Meta