Difference between revisions of "Journal:Electronic lab notebooks: Can they replace paper"
Shawndouglas (talk | contribs) (Recreated, without question mark; LIMSforum doesn't play nice with question mark in title.) |
Shawndouglas (talk | contribs) m (Protected "Journal:Electronic lab notebooks: Can they replace paper" ([Edit=Allow only administrators] (indefinite))) |
Revision as of 19:49, 26 July 2017
Full article title | Electronic lab notebooks: Can they replace paper? |
---|---|
Journal | Journal of Cheminformatics |
Author(s) |
Kanza, Samantha; Willoughby, Cerys; Gibbins, Nicholas; Whitby, Richard; Frey, Jeremey Gramaham; Erjavec, Jana; Zupančič, Klemen; Kovač, Katarina |
Author affiliation(s) | University of Southampton, BioSistemika |
Primary contact | Email: sk11g08 at soton dot ac dot uk |
Year published | 2017 |
Volume and issue | 9 |
Page(s) | 31 |
DOI | 10.1186/s13321-017-0221-3 |
ISSN | 1758-2946 |
Distribution license | Creative Commons Attribution 4.0 International |
Website | https://jcheminf.springeropen.com/articles/10.1186/s13321-017-0221-3 |
Download | https://jcheminf.springeropen.com/track/pdf/10.1186/s13321-017-0221-3 (PDF) |
Abstract
Despite the increasingly digital nature of society, there are some areas of research that remain firmly rooted in the past; in this case the laboratory notebook, the last remaining paper component of an experiment. Countless electronic laboratory notebooks (ELNs) have been created in an attempt to digitize record keeping processes in the lab, but none of them have become a "key player" in the ELN market, due to the many adoption barriers that have been identified in previous research and further explored in the user studies presented here. The main issues identified are the cost of the current available ELNs, their ease of use (or lack of it), and their accessibility issues across different devices and operating systems. Evidence suggests that whilst scientists willingly make use of generic notebooking software, spreadsheets, and other general office and scientific tools to aid their work, current ELNs are lacking in the required functionality to meet the needs of researchers. In this paper we present our extensive research and user study results, proposing an ELN built upon a pre-existing cloud notebook platform that makes use of accessible popular scientific software and semantic web technologies to help overcome the identified barriers to adoption.
Keywords: electronic laboratory notebooks (ELNs), notebooking software, cloud, semantic web, scientific software
Background
In scientific research, communication is essential between researchers, funding bodies, industry, and members of the public. Ideas need to be shared, evidence disseminated, plans discussed, findings recorded, and errors corrected. Researchers may work alone, but their research is of little value to the scientific community if it isn't disseminated. The scientific record can act as a legally binding record that protects intellectual property (IP).[1] Historically the paper laboratory notebook and the scientific paper have been at the center of this scientific communication[2]; however, this is being slowly replaced by the arrival of digital technologies and the internet, the web in particular.[3]
Digital technologies are shaping the way experiments are performed, results are captured, and findings are disseminated. Computers enable a myriad of functions that benefit researchers and scientists: they can be searched, shared, easily backed up, and readily accessed.[4] They facilitate interactive computation, electronic communication, multimedia distribution, and digital information management.[5] Within the lab, instruments are mostly computer controlled; computers are the main tools for capturing, analyzing, and annotating data. Electronic laboratory notebooks (ELNs) are also transforming the way that the scientific record is captured, with a revolutionary transformation from paper notebooks to the digital capture of experiments.[6]
ELNs offer significant benefits to researchers by facilitating long-term storage, reproducibility, and enhanced availability of experiment records across multiple devices, ensuring standard operating procedure compliance and providing interfaces to instrumentation, supporting IP protection, collaboration, and open science.[7][8][9][10][11] ELNs eliminate the need for manual transcription and can be used by distributed groups[12], facilitate managing notes, and simplify the inclusion and curation of digital resources (e.g., instrument data, analysis results).[13] While some systems are restricted to repositories of raw data and results, others have the potential to support researchers through the whole experiment lifecycle.[14][15]
More recently, semantic lab notebooks (SLNs) have utilized semantic web technologies to expose research data as formalized metadata[16], and to link between the different data sets collected throughout the experimental process.[17] Incorporating semantic web technologies within ELNs, using RDF and ontologies to enrich the data with meaning and context, provides new functionality such as making inferences about experiment types and creates valuable links between experiment outcomes and their final reports.[12][13][15][16][18] Making ELN data machine readable increases interoperability, facilitates integration with third-party tools, and enables automatic generation of materials for deposition in an archive or publication[16], increasing the usefulness of the tools for researchers.
Although ELNs are being increasingly used for industrial research, uptake in academia is limited.[19][20] This paper explores the current offerings of ELNs and electronic notebook software. We conducted studies to investigate the attitudes of academics towards ELNs and their desired functionality. It presents an overview of the barriers to adoption within academic environments, researcher behavior, and key features for ELNs. Following these findings, we discuss priorities for future ELN development and propose our semantic platform-based ELN solution. Table 1 introduces the user studies that will be discussed in this paper.
|
The market today
The current ELN market is supersaturated with choice; however, despite the wide range of products available, there is no obvious "leader." Additionally, the scientific community is still resistant to using ELNs, despite the popularity of electronic notebooks. Electronic notebooks that have been subverted to ELN usage, current ELN offerings, and the attitudes to ELNs and their current usage are examined and detailed in this section.
Electronic notebooks
Today’s market has multiple offerings for electronic notebooks: Microsoft Word, Office 365, Google Docs, Evernote[23], and OneNote[24], which have been evaluated for use as ELNs.[25][26][27] Oleksik et al.’s[25] study reported that the collaborative features of OneNote facilitated faster and easier sharing, and they enabled simultaneous communication between researchers, irrespective of location. Users trialing Evernote as an ELN[27] said they appreciated the electronic affordances such as "accessible from any online computer" and "ability to search" but found that it was lacking in domain knowledge, stating that it was "simple and practical for some laboratories, but for others it does not offer features specialized for fields such as biology, chemistry, or quality assurance/quality control." A balance may need to be struck between making an ELN usable across multiple disciplines, whilst still providing enough domain-specific knowledge.
Whilst there are many attractive affordances of storing your notes electronically, it is of concern to researchers whether their data is kept truly private or not, once it has been put into these services. Service providers differ in their privacy policies. For example, Google Docs states that not only do the users maintain intellectual property of any content they create, content will not be shared with any third parties, and the user can take their data with them if they choose to leave Google Docs.[28] However, other services such as Microsoft Office 365 may give contracted third parties access to their customer data (which includes both personal data such as names and email addresses, but also data uploaded into their systems such as images and documents) to perform certain services.[29] An example of a privacy policy controversy is Evernote, where the default was for their employees to be able to read users' content to ascertain the accuracy of their machine learning algorithms.[30] It is important for researchers to be aware of privacy policies and to ensure that research data is secure and can't be read by third parties when using any software.
Electronic lab notebooks
Southampton University’s ELN Market study identified 103 ELNs[31][32][33][17], 72 active and 30 no longer active, either due to discontinuation or acquisition by larger companies. The active ELNs were further investigated to see which domains they supported, as well as their platform and licensing availability (Figs. 1, 2).
|
|
The different licensing categories and associated considerations are as follows:
- Paid for: This is a proprietary piece of software that can be purchased, which may use proprietary data formats.
- Paid (with free version): This is a proprietary piece of software that can be purchased, but which also has a version of this software which can be used for free; either as a trial for a fixed period of time, or a version that has reduced functionality.
- Open-source: This is a product where the code behind the actual software has been made openly available so that anyone can redistribute it and edit it as long as they conform to the licensing conditions. Open-source products are often free, but not always, and they may use either standard or proprietary data formats.
- Free: This is a product which is free to use.
These findings illustrate an array of ELNs ranging from supporting specific disciplines (such as [[eNovalys SAS|eNovalys]' LabBook[34], which is aimed at chemists), to providing all-purpose solutions (such as KineMatik's eNovator ELN[35], which aims to provide a multipurpose ELN that can be used in many different areas). However, a common factor is that most of these ELNs require payment. Additionally, slightly over 60 percent of them are web-based/platform-independent, with the rest only available on certain operating systems or without a disclosure of their platform compatibility.
There appears to be a proclivity towards ELNs that make use of pre-existing software. NuGenesis allows users to drag and drop Excel and Word files into their ELN, eLabJournal provides Excel inside its ELN, and Limsophy uses Microsoft Word templates. This illustrates an increasing awareness that scientists do use notebooking software, even if they don't specifically use ELNs. Additionally, it suggests that there is a place for ELNs during the final write-up process, as well as during the physical experimental process. These ideas will be explored further in the "Proposal" section. In addition to the market investigations, current ELN usage was also researched. BioSistemika investigated ELN usage, and the DaM survey looked at attitudes towards ELNs.
ELN usage and barriers to adoption
Despite the saturated ELN market, results from the BioSistemika and DaM surveys indicated that whilst a large percentage of academic users are considering or interested in using ELNs (as shown in Fig. 3; Table 2), they are lacking in uptake in academia.[19][20] Many scientists extensively use computers yet continue to use paper notebooks throughout their experiments, highlighting that computer illiteracy or an aversion to technology cannot fully explain resistance to ELNs.[36][37][38]
|
|
The University of Southampton’s Lab Practice Study asked users about their ELN usage and experiences. Some participants had used ELNs such as LocalWiki, LabTrove, Blog3, BioBook (now E-WorkBook), LabBook and an industrial one on a short term basis. The industrial ELN was unfavorably described, whilst the other ELNS were only deemed useful for certain purposes. One participant found LabBook very useful for inorganic work but lacking the required functionality for their transport runs. Equally, participants who tried LabTrove and Blog3 found some of the elements useful in certain situations, but all defaulted back to Word documents. One participant suggested that this was the case because it did not contribute in a systematic way to their work.
There are therefore challenges and barriers to adoption of ELNs. Figure 4 and Table 3 illustrate the key barriers that our studies identified, and these are described in more depth in the following sections.
|
|
Cost
As shown in Table 3, a large percentage of survey respondents indicated that cost was a significant barrier to ELN adoption.[7][19][20] This includes financial outlay, staff hours, troubleshooting, and the fact that long-term use is likely to require ongoing maintenance and support. There are also concerns about the required database administration and support, with suggestions that having professional IT staff to help with setup and maintenance would be pivotal.
One respondent experienced sharp price-increases in database maintenance and upgrade costs after an initial discount. Other concerns are service providers not competing to keep costs down and the potential cost of storage space, indicating disincentives if the University charges groups for storage space. Figure 4 illustrates a willingness to pay up to $50 a month for an ELN, but not $100, suggesting that ELNs reach a point where they are considered "too expensive" (Fig. 5).
|
Other comments queried whether funding was available for ELNs in universities, suggesting that web-based systems could significantly cut costs, as they require less hardware.
As evidenced by Fig. 2, while most of the ELNs available in the market are proprietary pieces of software that require purchasing, there are also some free and open-source offerings available. The free options would clearly have the advantage of being cheaper to run and test but may have disadvantages depending on the nature of the software. The paid-for and free software model (one of the categories in Fig. 2) will have enterprise users to generate its revenue, and are able to offer reduced free versions to other users to generate recognition, providing the benefits and stability of proprietary software with a potential lack of cost. However, the fear is that other standalone free offerings are more likely to disappear, potentially alongside the research data. Some of the inactive free ELNs that were identified as part of the University of Southampton’s ELN Study were listed on online sources[31][9] though with websites that had seemingly vanished with no new obvious location. Open-source software is often free (although not always) and has a significant advantage over proprietary software with respect to their potential longevity. Both open-source and proprietary projects will always be at risk of ceasing to continue, either due to lack of funds or the original developers leaving the project. However, given the licensing of open-source projects, which makes it possible to view and change the source code, other developers are able to access, update, and support the software.
Ease of use
Another challenge is the perceived "ease of use" of paper notebooks compared to electronic systems. Paper notebooks are considered easier to use, input data to, read, and transport, and they're also inexpensive, readily available, "turn on" instantly, have infinite battery life, are socially acceptable during meetings, and require no training and minimal IT support.[7][19][37][5][39][40] Whereas, ELN software for taking notes is considered more difficult to use, timely and less flexible, leading to anxieties about ELNs' stability, accessibility, and availability.[41]
Our interactions with researchers and ELN users demonstrate that ease of use is vital to adoption. In the DaM survey, 99% of respondents indicated that ease of use would influence their ELN choice, with almost 80% rating it as very important. One comment reflected the desire for a flexible generic solution, rather than an ELN designed for a specific research area, due to anxieties that their research "doesn’t fit neatly into one category."
Attitudes to ELNs
Adopting an ELN only makes sense if the whole department adopts it, which allows for sharing costs and training; repositories, consistency, and use of standards could also be relevant.[16][17] Several comments reflected their assumption of students and postgrads rejecting ELNs through "resistance to change in some groups," noting that some students didn’t like their experiences with ELN, and might consider using them as an "additional burden."
Access to ELNs
In the Uses of ELNs in Academia survey, 74% expressed concerns about needing to enter data in both the lab and write-up area, due to a lack of suitable hardware or software capabilities to facilitate ELN usage inside and outside the lab. This can lead to copying and pasting printouts into paper notebooks and manually transcribing data between notebooks and computers, which can result in data loss, transcription errors, and records stored haphazardly.[42][12] Popular suggestions were to use mobile computers or tablets for portability in and out of the lab, and that web-based ELNs could improve accessibility.
Lack of appropriate hardware access in the lab lead to 12.5% of participants in the Post Pilot Survey ceasing to use the trialed ELN, resulting in several needing to perform tasks manually. Other anxieties frequently raised included risk of damage or contamination, security, "hassle" of carrying laptops around, shortage of computers for sharing, lack of bench space for computers, ELN not supported on chosen mobile platform, and lack of WiFi access. Primary workarounds for these issues appeared to be printing out experiments, writing up experiments retrospectively, or using paper notebooks alongside the ELN.
Software and system integration and compatibility
Researchers use different operating systems, but both the ELN Market study and comments from the DaM survey revealed a lack of availability of ELNs for Macs. It was suggested that iPads could work as a shared notebook due to their ease of transport, although software would need to be compliant with iOS and other mobile platforms, or be web-based. A perceived barrier was linked to integrating ELNs with existing infrastructure. The DaM surveys had similar concerns that users might be expected to purchase new ELN software at each operating system upgrade, which could contribute to system costs, support costs, and additional training requirements.
Electronic pen data entry, integration with digital repositories for archiving purposes, and bibliographic management have also been mentioned with regards to integration with existing tools. The DaM Pilot Program elicited a need for software compatibility, database integration, electronic data, and other "common software" (e.g., Word and Excel), as well as options to purchase add-ons for increased functionality. Users found problems using the ELN on a 64-bit operating system and on Macs, or with Chemdraw, office attachments, and uploading photographs; one said "…it was too cumbersome to import files from our current systems." ELN data input seems to have been a recurring issue, alongside failings in basic expectations about data management that heightened existing anxieties.
Data compatibility and portability
In the DaM Pilot survey, just under 70% expressed concern about the ELN capturing information easily, with 81% considering automatic experiment data capture important. Comments indicated that capturing a range of data is important, while raising concerns about the difficulties of instrument integration, partly due to a lack of standards between different manufacturers. Many comments expressed frustrations about not being able to link to specific experimental data such as spectroscopic results.
Several comments indicated worries about the ability to extract and move data between different ELNS and machines; these concerns relate to price hikes with a provider, longevity of commercial packages, and changing institution. Other comments addressed issues of proprietary formats, including previous bad experiences of "being tied into data formats" or being left with only a PDF of their data, although the desired transfer formats differed between respondents. Some comments embraced the importance of open data and not being tied to a particular commercial package, suggesting an open-source ELN to resolve the problem. This suggests that researchers perceive open-source offerings to be more likely to use standard data formats rather than proprietary formats. Concerns were also expressed regarding accessing databases and notebooks across different machines, suggesting that users expect their information to be stored locally or in a centralized system, and they are concerned about data security.
What do users do?
What users say they do doesn’t always match their actions; therefore, after establishing the main adoption barriers, we investigated how the researchers actually worked. Four focus groups were run with 24 postgraduate chemists, physicists, and biologists to discuss their current practices. Additionally, four different chemistry labs were observed to see how scientists operated there.
Results
We found that different researchers vary their working patterns and note-taking, and they have contrasting needs when it comes to sharing records with others. Therefore a "one size fits all" approach to tool design wouldn’t be effective. Tools need to provide considerable flexibility and customization to accommodate different needs. The high-level results of these activities across the different disciplines are presented in Table 4 and will be further discussed later on in this section.
|
Discussion
The biologists and physicists from these focus groups were mostly uniform in their methods, whereas the chemists were more diverse, highlighting differences in their approach even within a single discipline.
Computational chemists used some software, with sporadic use of lab books and scraps of paper, whereas the "wet" chemists had stringently organized lab books for different tasks. One chemist used blogs and Word documents alongside their paper notebooks, and the crystallographers relied heavily on their paper sample books. The inorganic and organic chemists used paper lab notebooks during experiments, and only used lab computers to access the instruments they were linked to. Note-taking differed depending on the situation. For experiments, the lab book was typically used to record observations and initial values. The chemists recorded different types of data including energy values, simulations, temperature, masses, observations, schemas, and protocols. These findings have similarities to Reimer and Douglas’[43] work, illustrating how information recorded remained much the same; however, they also demonstrate that different chemists possess contrasting needs for recording their notes. Constructing one’s own "templates" or other mechanisms for standardizing data capture appears to be common in academic environments[44]; therefore, providing capabilities to facilitate this is likely to be popular.[13][44] Allowing users to edit their own templates poses challenges. This is illustrated by a comment from one of the lab observation participants, who resented being asked to use a template rather than expressing themselves in their own style.
Chemists also differed in how they linked their paper and electronic notes. The physicists and biologists linked them by date, whereas the chemists inconsistently used a variety of codes, reflecting the personal nature of note organization.[44] Despite their differing lab work, there was a common theme of using instruments (e.g., X-ray machines or diffractometers) to read data and linking statements in their lab book to reference electronic data location and any data values that required inputting to other software. In some situations it may be necessary to capture some information on paper, and ELNs therefore need to facilitate the inclusion of such information with the research record.
Different disciplines had varying restrictions on what equipment could be taken into the lab. The biologists didn’t have specific restrictions, although one biochemist mentioned that there were concerns about bringing in outside equipment in case of contamination. The physicists couldn’t bring equipment into their cleanroom to avoid contaminating the environment; contrastingly, the chemists wouldn’t take technology into the lab in order to avoid damaging it with chemicals. Computers in the lab weren’t often online, and most were connected to specific instruments. When asked, participants indicated a reluctance to use instrument-dedicated computers for any other purpose, such as making notes, accessing documents remotely, or using cloud software as they didn’t have network access. One chemist stated that "once you’ve started doing something one way, you don’t want to change it."
Scenarios
To investigate the participants' current searching and backup procedures, they were presented with three scenarios to discuss (illustrated in Fig. 6):
- Imagine you’re trying to locate some work from six months ago. How would you locate you notes and associated data?
- Imagine there’s a fire in your lab and all of your paper notebooks are destroyed. How much work would you lose and how could you go about recovering it?
- If you fell under a bus tomorrow and were temporarily indisposed, how would your supervisor/industry sponsors/colleagues access your work?
For Scenario 1, participants revealed that they organized their lab books chronologically, and the most common method of locating previous work was to go back through their lab book by date to locate work from a particular time period. Similarly, to locate previous work on a computer, participants said that they would search by date to find the appropriate data files or would search by name if that proved unsuccessful.
Scenario 2 provoked different reactions. Some participants were unconcerned at the prospect of losing their lab books and thought reproducing what was needed wouldn’t take too long, as a lot of the information was only "useful in the moment" or a list of things that didn’t work. Whereas other participants elicited responses such as "I’d be ruined," "a nightmare," and "might as well stop my Ph.D. now." Particularly with reference to the idea of their labs catching fire, several participants seemed more concerned at the idea of losing their lab samples or compounds, suggesting that perhaps their lab books would not be the biggest loss in a fire.
Scenario 3 revealed that generally participants don’t have measures in place to enable their supervisors to access their work if something happened to them. One of the biologists had a particularly strict supervisor who required their students to photocopy all of their lab books and work, but that was a rare exception. It did however transpire that the participants believed that other group members would probably be able to access their work and give it to their supervisors, but they didn’t believe that they would be able to follow their lab books or the structures they’d put in place to link together their paper and electronic notes.
|
This continues the earlier theme about participants showing less concern towards backing up their paper-based work. They are obviously aware that these scenarios could occur, but the participants clearly don’t perceive the scenarios as likely or serious enough to merit much pre-emptive preparation, apart from circumstances where their supervisors have put procedures in place. Capturing notes and data electronically has clear backup and archiving benefits. Not only can electronic information be automatically backed up and securely stored, but the information can become accessible across multiple locations. Outdated information can be archived so that it can be retrieved later if needed, or be shared with other researchers through deposition or publication.
What do users want?
Having discussed with the users what they actually do, this section will look at what features the users say they want, with information taken from all studies B, D, E and F.
These have been grouped according to the different categories in the "What do users do?" section in addition to a new category of project activities that came out of this research. These features have also been linked to the associated priorities of the iLabber pilot project for those who found these features very or quite important, and it’s been noted which barriers these features aim to address.
The full breakdown of priorities from the iLabber Pilot Project are shown in Fig. 7.
|
Proposal
Based on the needs elicited from our user studies, we formulated a proposal of how to construct an ELN environment that would fit with these requirements. The majority of ELNs have been created from scratch, including the underlying "notebook" part[12][17][45]; an alternative would be to build on top of a generic electronic notebook (which are more popular than ELNs) with domain specific features. Many of these electronic notebooks already have collaborative cloud-based features and could be further expanded with domain knowledge and semantic web technologies. Additionally, based on our market research, despite the amount of available ELNs, there are a minority that are available as free/open-source platform-independent entities that scientists can use on any device, suggesting a gap that could be filled with this type of ELN.
Visions
As part of this vision, BioSistemika used their ELN survey to create their own ELN, sciNote.[46] Taking the path towards interoperability, sciNote has been designed in a modular way and released under an open source license (Mozilla Public Licence). Based on user needs, they are developing new add-ons while at the same time encouraging the community to develop their own add-ons, similar to the packages concept of R-statistical. In this way every lab will be able to design their own ELN to fit their needs, which will help them manage their project, share research, gather the metadata directly from instruments, and connect with existing software and databases.
Southampton University has looked at the features the users want (shown in Table 5) and formulated how these could be achieved using an electronic notebook platform as a base. There is a large overlap of features between electronic notebooks, ELNs, and SLNs, and the main features required by an ELN already exist in generic electronic notebooking software.[43] Furthermore, using a cloud-based electronic notebook platform would combat some of the accessibility issues and facilitate the collaboration requirements of the users, and incorporating semantic web technologies would provide an improved (semantic) search (the top priority listed in Fig. 7) and allow for metadata/tagging (as requested).
|
A cloud-based ELN
There will always be concerns about IP with regards to using cloud-based services. The University of Southampton’s Lab Practice study elicited that users with industry sponsors were less likely to use cloud software. It is thought that once data is "in the cloud," users are no longer in control[47], and that like with any electronic service there is the potential for data breaches[48]; however, many precautions are taken. Yet this concern certainly isn’t restricted to electronic data. Some of the biologists from the University of Southampton’s Lab Practice Study said that they didn’t consider their work to be safe at conferences as people may take photographs of their posters and steal their ideas, and some of the chemists were aware that previous members of their research group had been "scooped" which resulted in tightened security measures across the group. Despite this, cloud computing is advantageous in that it can provide large volumes of storage and computing power that are accessible from any location[47][48], and it’s worth noting that only 18.9% of respondents in the iLabber Pilot Project Survey thought that "Better protection of IP" was "Very important," ranking significantly below "Improved search" and "Secure automatic backup of data," both of which lend themselves greatly to our proposed methods.
Proposed features/design
When investigating the features our users want, we realized that approximately 40% of these features are already implemented within cloud-based electronic notebooking software, and the rest of the features are either domain specific or could be achieved using semantic web technologies. Figure 8 shows these desired features elicited from our user studies detailed in Table 5, which are supported by previous ELN research work.[4][7][17][26][43][49][50]
|
We believe that this approach and the subsequent ELN environment that will be developed can mitigate the current barriers and concerns, as detailed in Table 6.
|
Therefore we propose that building a semantic ELN on top of an existing cloud infrastructure or platform would allow us to make use of these pre-existing features, provide a solid notebook base aligned with software scientists already use, and would also help combat the current adoption barriers. The ability to adapt documents and control input provided by a platform such as Google Docs enables much of the functionality needed for an ELN (e.g., in Fig. 9).
|
Conclusions and future work
Our user studies have made one thing very clear: we cannot currently hope to fully replace the paper lab notebook. Until we have the technology where a screen can be written on as accurately and easily as paper, and labs have cheap, durable and easily replaceable tech to use instead of paper, it will always prevail for some tasks. We also need to stop thinking of ELNs as direct replacements for paper lab notebooks that are only useful during experiments in the lab and consider them in the wider context of the whole experiment process. Therefore we need to build a system that works with paper and formulate a new digital practice for scientists to use in their current lab environment.
Despite some scientists preferring paper notebooks, they still frequently use technology in their work. Many store data electronically and use note-taking software such as Word and Evernote to write up their notes, Excel to handle their figures and graphs, and specialty software for specific tasks. The cloud is also widely used to back up work and make it available across different locations. Therefore, we need to start considering ELNs that can work in this context and to work out how we can re-use existing successful software to create a better ELN platform.
We propose that we need an ELN environment that can serve as an interface between paper lab notebooks and the electronic documents that scientists create, one that is interoperable and utilizes semantic web and cloud technologies. It would fulfill all of the software needs described in the "What do users want?" section and provide a centralized location for scientists to store their notes. Whilst the ideal long term goal is that adoption of an ELN alongside extensive laboratory automation removes any need for paper, realistically current technology is such that it is desirable that ELN solutions work alongside paper for the foreseeable future. We believe that ELNS will significantly improve reproducibility of scientific experiments, contribute to data traceability and data annotation, and enable scientists to collaborate and share results in an intuitive manner. The wider adoption of ELNs will facilitate interoperability, which will ultimately change the ways scientists perform experiments and manage their data. There’s a great potential for future work in these areas, as an ELN that follows our vision has yet to be created. And as hardware and technology as a whole advances, we will be able to support even more of the experimental process digitally.
Abbreviations
DaM: Dial-a-Molecule
ELN: electronic lab notebook
ELNs: electronic lab notebooks
IP: intellectual property
RDF: resource description framework
SLNs: semantic lab notebooks
Declarations
Authors’ contributions
SK carried out Study C and Study D. CW carried out Study F and was involved with Study E alongside RW who designed and originated the survey. JE, KK and MH carried out Studies A and B. KZ/BioSistemika designed the surveys, did survey analysis and developed a concept of sciNote ELN together with MH. SK, CW and JE all contributed to writing the manuscript. NG and JGF helped draft and revise the manuscript, participated in the design of the research, and the analysis of the results. All authors read and approved the final manuscript.
Author information
SK is a final year iPh.D. Student in Web Science at the University of Southampton. Her primary research focus is Semantic Web technologies, and she is looking at bringing the modern power of the web to chemical research. She has a First Class M.Eng. Degree in Computer Science and worked as a software developer before starting her Ph.D. CW has recently completed a Ph.D. with Southampton University investigating note-taking behavior of scientists in the lab and the design of tools to help them capture and make use of their experiment information. Before undertaking research at Southampton, she worked at IBM UK Laboratories as a software engineer specializing in software usability and information architecture. NG is an associate Professor in Computer Science at the University of Southampton. His primary research interests are in the Semantic Web, hypertext, and distributed information systems. RW is a Professor of Organic Chemistry at the University of Southampton. He originated and leads the ‘Dial-a-Molecule’ Grand Challenge (www.dial-a-molecule.org), which has the 20–40 years aim of making the synthesis of new molecules as quick and easy as it currently is to order a commercial compound. He is interested in total synthesis, flow chemistry, molecular electronics, and cheminformatics. JGF is a Professor of Physical Chemistry at the University of Southampton. His interests span experimental laser spectroscopy and imaging, though chemical informatics and modelling, to the wider area e-science and digital economy. JE has a Ph.D. in Microbiology and Biotechnology. She always follows the latest trends in the Life sciences and believes that great software will be a crucial tool for wet lab or dry lab scientists in the near future. KZ has a Ph.D. in Genetics. He has created several successful companies and he strongly believes that platforms that are able to connect with your instruments and other software seamlessly will help in creating digital laboratories of the future. MH has a Ph.D. in Gene expression technologies. Based on his own experience, he has dedicated his career to developing new software products for researchers. KK has a Ph.D. in Virology. She has a special interest in laboratory management improvement and automation which would enable researchers dedicate more time to their research and innovation and leave tedious routine laboratory tasks to smart software and instruments.
Acknowledgements
We would like to thank all of the people who took part in the different user studies.
Competing interests
BioSistemika’s ELN Webinar Survey and ELN Survey were done as a part of the ELN market research before they started with the development of sciNote open-source electronic lab notebook. The results of BioSistemika’s market research complement nicely the data gathered by the University of Southampton. They highlight the current state of the ELN market and show in which direction — existing and new — ELN solutions should go in order to meet the expectations of the users. JGF is involved with the LabTrove ELN project (www.labtrove.org).
Ethical approval and consent to participate
Specific ethical approval was obtained for Study 4 (the most recently conducted study). This study was approved by the University of Southampton’s Ethics Committee for the Faculty of Physical Sciences and Engineering. Focus groups—ethics application ERGO/FPSE/18246. Participant observations—ethics application ERGO/FPSE/18448.
Availability of data and materials
The datasets supporting the conclusions of this article will be available in the Pure repository for the University of Southampton, doi:10.5258/SOTON/405190. The datasets supporting the conclusions of this article are included within the article (and its Additional file 1).
Funding
This work was supported by the Web Science Centre for Doctoral Training at the University of Southampton, funded by the U.K. Engineering and Physical Sciences Research Council (EPSRC) under Grant No. EP/G036926/1. This work was also supported by the following groups: Digital Economy IT as a Utility Network + under Grant No. EP/K003569/1, the South East Regional e-Research Consortium under Grant No. EP/F05811X/1, and PLATFORM: End-to-End pipeline for chemical information: from the laboratory to literature and back again, under Grant No. EP/C008863/1. The Dial-a-Molecule work was also supported by the following Grants: EP/H034447/1 and EP/K004840/1. BioSistemika’s ELN Survey and Webinar were financed by BioSistemika LLC as a part of their ELN market research.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional files
13321_2017_221_MOESM1_ESM.zip as Additional file 1: A file describing how to access the focus groups and lab observations transcripts from Study D—University of Southampton Lab Practice Study (Focus Groups & Lab Observations).
References
- ↑ Shankar, K. (2004). "Recordkeeping in the Production of Scientific Knowledge: An Ethnographic Study". Archival Science 4 (3–4): 367–382. doi:10.1007/s10502-005-2600-1.
- ↑ Coppin, P.; Hockema, S.A. (2009). "Learning from the information workspace of an information professional with dyslexia and ADHD". 2009 IEEE Toronto International Conference on Science and Technology for Humanity (TIC-STH) 2009: 801–07. doi:10.1109/TIC-STH.2009.5444390.
- ↑ Boulton, G.; Campbell, P.; Collins, B. et al. (2012) (PDF). Science as an Open Enterprise. The Royal Society. pp. 105. ISBN 9780854039623. https://royalsociety.org/~/media/Royal_Society_Content/policy/projects/sape/2012-06-20-SAOE.pdf.
- ↑ 4.0 4.1 Frey, J.G.; De Roure, D.; Schraefel, M.C. et al. (2003). "Context Slicing the Chemical Aether". First International Workshop on Hypermedia and the Semantic Web, United Kingdom. http://eprints.soton.ac.uk/id/eprint/258790.
- ↑ 5.0 5.1 Yeh, R.; Liao, C.; Klemmer, S. et al. (2006). "ButterflyNet: A mobile capture and access system for field biology research". Proceedings of the SIGCHI Conference on Human Factors in Computing Systems 2006: 571–580. doi:10.1145/1124772.1124859.
- ↑ Borman, S. (1994). "Electronic Laboratory Notebooks May Revolutionize Research Record Keeping". Chemical Engineering News 72 (21): 10–20. doi:10.1021/cen-v072n021.p010.
- ↑ 7.0 7.1 7.2 7.3 Bird, C.L.; Willoughby, C.; Frey, J.G. (2013). "Laboratory notebooks in the digital era: The role of ELNs in record keeping for chemistry and other sciences". Chemical Society Reviews 42 (20): 8157–75. doi:10.1039/c3cs60122f. PMID 23864106.
- ↑ Hice, R.C. (15 May 2009). "Roadmap to a Clear Definition of ELN". Scientific Computing. Advantage Business Media. https://www.scientificcomputing.com/blog/2009/05/roadmap-clear-definition-eln. Retrieved 07 January 2017.
- ↑ 9.0 9.1 Kaiser, J. (18 January 2017). "Rigorous replication effort succeeds for just two of five cancer papers". Science. American Association for the Advancement of Science.
- ↑ Taylor, K.T.. "19. Evolution of Electronic Laboratory Notebooks". In Ekins, S.; Hupcey, M.A.Z., Williams, A.J.. Collaborative Computational Technologies for Biomedical Research. John Wiley & Sons. pp. 301–320. doi:10.1002/9781118026038.ch19. ISBN 9781118026038.
- ↑ Wilson, D.F. (2011). "Purposive variation in recordkeeping in the academic molecular biology laboratory". University of Glasgow. http://theses.gla.ac.uk/2482/. Retrieved 07 January 2017.
- ↑ 12.0 12.1 12.2 12.3 Myers, J.D.; Mendoza, E.S.; Hoopes, B. (2001). Hamza, M.H. ed. "A Collaborative Electronic Laboratory Notebook". Proceedings of the IASTED International Conference, Internet and Multimedia Systems and Applications, August 13-16, 2001, Honolulu, Hawaii, USA: 334–338. http://pdf.aminer.org/000/877/371/computational_experiments_using_distributed_tools_in_a_web_based_electronic.pdf.
- ↑ 13.0 13.1 13.2 Badiola, K.A.; Bird, C.; Brocklesby, W.S. et al. (2015). "Experiences with a researcher-centric ELN". Chemical Science 6 (3): 1614–1629. doi:10.1039/C4SC02128B.
- ↑ Fakas, G.J.; Nguyen, A.V.; Gillet, D. (2005). "The Electronic Laboratory Journal: A Collaborative and Cooperative Learning Environment for Web-Based Experimentation". Computer Supported Cooperative Work 14 (3): 189–216. doi:10.1007/s10606-005-3272-3.
- ↑ 15.0 15.1 Hughes, G.; Mills, H.; De Roure, D. et al. (2004). "The semantic smart laboratory: A system for supporting the chemical eScientist". Organic and Biomolecular Chemistry 2 (22): 3284-93. doi:10.1039/B410075A. PMID 15534706.
- ↑ 16.0 16.1 16.2 16.3 Coles, S.J.; Frey, J.G.; Bird, C.L. et al. (2013). "First steps towards semantic descriptions of electronic laboratory notebook records". Journal of Cheminformatics 5 (1): 52. doi:10.1186/1758-2946-5-52. PMC PMC3878183. PMID 24360292. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3878183.
- ↑ 17.0 17.1 17.2 17.3 17.4 Talbott, T.; Peterson, M.; Schwidder, J. et al. (2005). "Adapting the electronic laboratory notebook for the semantic era". Proceedings of the 2005 International Symposium on Collaborative Technologies and Systems 2005: 136–143. doi:10.1109/ISCST.2005.1553305.
- ↑ Borkum, M.; Lagoze, C.; Frey, J. et al. (2010). "A Semantic eScience Platform for Chemistry". 2010 IEEE Sixth International Conference on e-Science 2010: 316–323. doi:10.1109/eScience.2010.18.
- ↑ 19.0 19.1 19.2 19.3 Goddard, N.H.; Macneil, R.; Ritchie, J. (2009). "eCAT: Online electronic lab notebook for scientific research". Automated Experimentation 1: 4. doi:10.1186/1759-4499-1-4.
- ↑ 20.0 20.1 20.2 Rudolphi, F.; Goossen, L.J. (2012). "Electronic laboratory notebook: The academic point of view". Journal of Chemical Information and Modeling 52 (2): 293–301. doi:10.1021/ci2003895. PMID 22077095.
- ↑ "BioSistemika". BioSistemika d.o.o. https://biosistemika.com/. Retrieved 07 January 2017.
- ↑ Willoughby, C.; Frey, J.G.. "From paper notebooks to digital research management: Designing digital tools for laboratory scientists". "In preparation"
- ↑ "Evernote". Evernote Corporation. https://evernote.com/. Retrieved 07 January 2017.
- ↑ "OneNote". Microsoft Corporation. https://www.onenote.com/. Retrieved 07 January 2017.
- ↑ 25.0 25.1 Oleksik, G.; Milic-Frayling, N.; Jones, R. (2014). "Study of electronic lab notebook design and practices that emerged in a collaborative scientific environment". Proceedings of the 17th ACM conference on Computer Supported Cooperative Work & Social Computing 2014: 120–133. doi:10.1145/2531602.2531709.
- ↑ 26.0 26.1 Voegele, C.; Bouchereau, B.; Robinot, N. et al. (2013). "A universal open-source electronic laboratory notebook". Bioinformatics 29 (13): 1710-2. doi:10.1093/bioinformatics/btt253. PMID 23645817.
- ↑ 27.0 27.1 Walsh, E.; Choi, I. (2013). "Using Evernote as an electronic lab notebook in a translational science laboratory". Journal of Laboratory Automation 18 (3): 229-34. doi:10.1177/2211068212471834. PMID 23271786.
- ↑ "Google Terms of Service". Google, Inc. https://www.google.com/intl/en/policies/terms/. Retrieved 16 May 2017.
- ↑ "Office 365 Privacy Statement". Microsoft Corporation. https://www.microsoft.com/online/legal/v2/?docid=43. Retrieved 16 May 2017.
- ↑ Conger, K. (14 December 2016). "Evernote’s new privacy policy allows employees to read your notes". TechCrunch. Oath, Inc. https://techcrunch.com/2016/12/14/evernotes-new-privacy-policy-allows-employees-to-read-your-notes/. Retrieved 16 May 2017.
- ↑ 31.0 31.1 "Electronic Laboratory Notebook Products". Atrium Research & Consulting, LLC. 26 December 2016. http://www.atriumresearch.com/eln.html. Retrieved 07 January 2017.
- ↑ "ELN vendor". LIMSwiki.org. Laboratory Informatics Institute, Inc. https://www.limswiki.org/index.php/ELN_vendor. Retrieved 07 January 2017.
- ↑ Rubacha, M.; Rattan, A.K.; Hosselet, S.C. (2011). "A review of electronic laboratory notebooks available in the market today". Journal of Laboratory Automation 16 (1): 90–8. doi:10.1016/j.jala.2009.01.002. PMID 21609689.
- ↑ "eNovalys LabBook". eNovalys, Inc. http://www.enovalys.com/book. Retrieved 16 May 2017.
- ↑ "KineMatik Electronic Lab Notebook". KineMatik Ltd. http://www.kinematik.com/solutions/industry/electronic-laboratory-notebook. Retrieved 16 May 2017.
- ↑ Klokmose, C.N.; Zander, P.O (2010). "Rethinking Laboratory Notebooks". Proceedings of COOP 2010: 119–139. doi:10.1007/978-1-84996-211-7_8.
- ↑ 37.0 37.1 Mackay, W.E.; Pothier, G.; Letondal, C. et al. (2002). "The missing link: Augmenting biology laboratory notebooks". Proceedings of the 15th Annual ACM Symposium on User Interface Software and Technology: 41–50. doi:10.1145/571985.571992.
- ↑ Tabard, A.; Mackay, W.E.; Eastmond, E. (2008). "From individual to collaborative: The evolution of prism, a hybrid laboratory notebook". Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work: 569-578. doi:10.1145/1460563.1460653.
- ↑ Brandl, P.; Richter, C.; Haller, M. (2010). "NiCEBook: Supporting natural note taking". Proceedings of the SIGCHI Conference on Human Factors in Computing Systems: 599–608. doi:10.1145/1753326.1753417.
- ↑ Cooke, R.; Schraefel, M.C. (2004). "Signature Flip and Clip: Virtually Flipping and Dog Earing Pages in a Digital Lab Book". Proceedings of User Interface Software Technology (UIST), Mexico. https://eprints.soton.ac.uk/259251/.
- ↑ Drake, D.J. (2007). "ELN implementation challenges". Drug Discovery Today: 647-9. doi:10.1016/j.drudis.2007.06.010. PMID 17706546.
- ↑ Bruce, S. (31 December 2002). "A Look at the State of Electronic Lab Notebook Technology". Scientific Computing. Advantage Business Media.
- ↑ 43.0 43.1 43.2 Reimer, Y.J.; Douglas, S.A. (2004). "Ethnography, Scenario-Based Observational Usability Study, and Other Reviews Inform the Design of a Web-Based E-Notebook". International Journal of Human-Computer Interaction 17 (3): 403–426.
- ↑ 44.0 44.1 44.2 Shankar, K. (2007). "Order from chaos: The poetics and pragmatics of scientific recordkeeping". Journal of the Association for Information Science and Technology 58 (10): 1457–1466. doi:10.1002/asi.20625.
- ↑ Zaki, Z.M.; Dew, P.M.; Lau, L.M.S. et al. (2013). "Architecture design of a user-orientated electronic laboratory notebook: A case study within an atmospheric chemistry community". Future Generation Computer Systems 29 (8): 2182-2196. doi:10.1016/j.future.2013.04.011.
- ↑ "sciNote". sciNote, LLC. http://scinote.net/. Retrieved 07 January 2017.
- ↑ 47.0 47.1 di Vimercati, S.De C.; Foresti, S.; Samarati, P. (2015). "Data Security Issues in Cloud Scenarios". In Jajoda, S.; Mazumdar, C.. Information Systems Security. Lecture Notes in Computer Science. 9478. pp. 3–10. doi:10.1007/978-3-319-26961-0_1. ISBN 9783319269610.
- ↑ 48.0 48.1 Thiebes, S.; Dehling, T.; Sunyaev, A. (2016). "One Size Does Not Fit All: Information Security and Information Privacy for Genomic Cloud Services". ECIS 2016 Proceedings. https://ssrn.com/abstract=2770946.
- ↑ Scraefel, M.C.; Hughes, G.; Mills, H. et al. (2004). "Breaking the Book: Translating the Chemistry Lab Book into a Pervasive Computing Lab Environment". CHI 2004, Austria. https://eprints.soton.ac.uk/258780/.
- ↑ Scraefel, M.C.; Hughes, G.; Mills, H. et al. (2004). "Making Tea: Iterative Design through Analogy". Designing Interactive Systems, 2004, United States. https://eprints.soton.ac.uk/258672/.
Notes
This presentation is faithful to the original, with only a few minor changes to presentation, including regionalizing spelling. In some cases important information was missing from the references, and that information was added. The original lists references in alphabetical order; this version lists them in order of appearance due to the nature of the wiki. Several URLs and access dates were also cleaned up from the original. The original question mark that appeared at the end of the title was removed from the wiki page title as it causes rendering issues elsewhere; the question mark was retained elsewhere.