{"id":5087,"date":"2016-12-13T09:15:50","date_gmt":"2016-12-13T09:15:50","guid":{"rendered":"https:\/\/clarivate.com\/?p=5087"},"modified":"2025-08-07T15:29:38","modified_gmt":"2025-08-07T15:29:38","slug":"web-of-science-big-data-blooms-in-bloomington","status":"publish","type":"post","link":"https:\/\/clarivate.com\/academia-government\/blog\/web-of-science-big-data-blooms-in-bloomington\/","title":{"rendered":"Web of Science Big Data Blooms in Bloomington"},"content":{"rendered":"<p>This November, we had the opportunity to partner with the <a href=\"http:\/\/www.knowledgelab.org\/\" target=\"_blank\" rel=\"noopener\"><strong>KnowledgeLab at the University of Chicago<\/strong><\/a>\u00a0and the\u00a0<a href=\"http:\/\/ella.slis.indiana.edu\/~katy\/\" target=\"_blank\" rel=\"noopener\"><strong>I-School of Indiana University<\/strong><\/a> to host a workshop with over 40 of our colleagues and customers to discuss research approaches to using our data. More than\u00a08,500 scholarly\u00a0articles based on Web of Science data have been published in the\u00a0past 15 years by researchers in myriad fields; and with the recent explosion in big data technologies, this trend is sure to continue. Working with these researchers on\u00a0ways to improve both our data services and their research was the inspiration for the\u00a0event.<\/p>\n<p>Attendees joined us from all over North America and as far away as Switzerland and Poland for two days of discussion, presentations, networking, and hacking with Web of Science data. The attendees included representatives from top university big data research labs, Key Opinion Leaders, development partners, and our expert colleagues. Check out our <a href=\"http:\/\/cns.iu.edu\/workshops\/event\/161114.html\" target=\"_blank\" rel=\"noopener\">&#8216;lighting talks&#8217;<\/a> covering in-depth look at the history of, and technology powering, Web of Science.<\/p>\n<p>The second day was really the highlight with the<strong>\u00a0Hackathon<\/strong>. We provided everyone with a custom <a href=\"https:\/\/clarivate.com\/academia-government\/scientific-and-academic-research\/research-discovery-and-referencing\/web-of-science\/\">Web of Science dataset<\/a> for the event and split up into small teams to focus on digging into the data. The teams dove in and brought all kinds of tools to bear on the dataset including data analytics and compute power using both the KnowledgeLab Data Enclave and the IU secure data desktop as well as visualizations in commercial tools like\u00a0<a href=\"http:\/\/www.tableau.com\/\" target=\"_blank\" rel=\"noopener\">Tableau<\/a>. In the collaborative spirit of the event, folks were sharing their own custom code from\u00a0<a href=\"https:\/\/github.com\/\" target=\"_blank\" rel=\"noopener\">GitHub<\/a>\u00a0and immediately realized how much time they could save each other by sharing tools and approaches to things like disambiguation, recommendations,\u00a0similarity metrics,\u00a0SQL schemas,\u00a0XML parsers,\u00a0matching IDs,\u00a0gender identification, and\u00a0data format conversions.<\/p>\n<p>We had wide-ranging discussions on many topics throughout the workshop, but the following stand out as perhaps the top takeaways and most interesting food for thought:<\/p>\n<ul>\n<li><strong>Our customers use multiple datasets<\/strong>\u2014full text publisher data, usage data, PubMed and other freely available meta-data\u2014but the Web of Science dataset is the standard they rely on for the broadest, deepest, and most authoritative record of scholarly research output. It is the scale, scope, and thoroughness of Web of Science that makes it an essential big data research tool.<\/li>\n<li>Attendees of the event represented a\u00a0<strong>Data Scientist Researcher user persona<\/strong> with a focus on technical software tools and data needs quite different than our more typical end-user researcher or librarian. The Web of Science product team is continually working to accommodate the needs of a broad range of customers and use cases.<\/li>\n<li>Despite many of the attendees working for \u201ccompeting\u201d labs and universities, there was a highly\u00a0<strong>open and collaborative spirit<\/strong> to the interactions with several teams starting new projects that they will continue after the event.<\/li>\n<li>Many of\u00a0<strong>our users are duplicating effort on core data cleanup tasks<\/strong>\u2014disambiguation, ID matching, parsing,\u00a0data format conversion, etc. Disambiguation of author names is perhaps the most substantial core data science challenge. We are working closely with this community to find ways we can better support them to help save time and money and enable them to focus on their core research\u2014value Clarivate Analytics always strives to deliver to our customers.<\/li>\n<\/ul>\n<p>This was a great two days and many people walked away with new collaborations started and plans to follow up and continue to learn from one another\u2014connections they would very likely not have made without the chance to meet at this workshop.<\/p>\n<p>This event is just <strong>one example of the way the Clarivate Analytics Web of Science team is engaging with our customers to build a vibrant user community<\/strong>. I would love to hear from other Web of Science users on ways that we can work with and support them in their citation-based big data analytical research projects. Feel free to reach out to me via <a href=\"mailto:jason.rollins@Clarivate.com\">email<\/a> for further information.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Working with researchers on ways to improve both our data services and their research at the Web of Science as a Research Dataset Workshop.<\/p>\n","protected":false},"author":10,"featured_media":5088,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[16],"tags":[34],"class_list":["post-5087","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-academia-government","tag-web-of-science"],"acf":[],"lang":"en","translations":{"en":5087},"publishpress_future_workflow_manual_trigger":{"enabledWorkflows":[]},"pll_sync_post":[],"_links":{"self":[{"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/posts\/5087","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/comments?post=5087"}],"version-history":[{"count":1,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/posts\/5087\/revisions"}],"predecessor-version":[{"id":286508,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/posts\/5087\/revisions\/286508"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/media?parent=5087"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/categories?post=5087"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/clarivate.com\/academia-government\/wp-json\/wp\/v2\/tags?post=5087"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}