Difference between revisions of "LII:Directions in Laboratory Systems: One Person's Perspective"

From LIMSWiki
Jump to navigationJump to search
(Created stub. Saving and adding more.)
 
m (Text replacement - "\[\[PerkinElmer Inc.(.*)" to "[[Vendor:PerkinElmer Inc.$1")
 
(19 intermediate revisions by the same user not shown)
Line 7: Line 7:
'''Publication date''': November 2021
'''Publication date''': November 2021


==The "laboratory of the future"==
==Introduction==
The [[laboratory]] of the future” (LOF) makes for an interesting playground of concepts. People's view of the LOF is often colored by their commercial and research interests. Does the future mean tomorrow, next month, six years, or twenty years from now? In reality, it means all of those time spans coupled with the length of a person's tenure in the lab, and the legacy they want to leave behind.
The purpose of this work is to provide one person's perspective on planning for the use of computer systems in the [[laboratory]], and with it a means of developing a direction for the future. Rather than concentrating on “science first, support systems second,” it reverses that order, recommending the construction of a solid support structure before populating the lab with systems and processes that produce knowledge, information, and data (K/I/D).
 
===Intended audience===
This material is intended for those working in laboratories of all types. The biggest benefit will come to those working in startup labs since they have a clean slate to work with, as well as those freshly entering into scientific work as it will help them understand the roles of various systems. Those working in existing labs will also benefit by seeing a different perspective than they may be used to, giving them an alternative path for evaluating their current structure and how they might adjust it to improve operations.
 
However, all labs in a given industry can benefit from this guide since one of its key points is the development of industry-wide guidelines to solving technology management and planning issues, improving personnel development, and more effectively addressing common projects in automation, instrument communications, and vendor relationships (resulting in lower costs and higher success rates). This would also provide a basis for evaluating new technologies (reducing risks to early adopters) and fostering product development with the necessary product requirements in a particular industry.
 
===About the content===
This material follows in the footsteps of more than 15 years of writing and presentation on the topic. That writing and presentation—[[Book:LIMSjournal - Laboratory Technology Special Edition|compiled here]]—includes:
 
*[[LII:Are You a Laboratory Automation Engineer?|''Are You a Laboratory Automation Engineer?'' (2006)]]
*[[LII:Elements of Laboratory Technology Management|''Elements of Laboratory Technology Management'' (2014)]]
*[[LII:A Guide for Management: Successfully Applying Laboratory Systems to Your Organization's Work|''A Guide for Management: Successfully Applying Laboratory Systems to Your Organization's Work'' (2018)]]
*[[LII:Laboratory Technology Management & Planning|''Laboratory Technology Management & Planning'' (2019)]]
*[[LII:Notes on Instrument Data Systems|''Notes on Instrument Data Systems'' (2020)]]
*[[LII:Laboratory Technology Planning and Management: The Practice of Laboratory Systems Engineering|''Laboratory Technology Planning and Management: The Practice of Laboratory Systems Engineering'' (2020)]]
*[[LII:Considerations in the Automation of Laboratory Procedures|''Considerations in the Automation of Laboratory Procedures'' (2021)]]
*[[LII:The Application of Informatics to Scientific Work: Laboratory Informatics for Newbies|''The Application of Informatics to Scientific Work: Laboratory Informatics for Newbies'' (2021)]]
 
While that material covers some of the “where do we go from here” discussions, I want to bring a lot of it together in one spot so that we can see what the entire picture looks like, while still leaving some of the details to the titles above. Admittedly, there have been some changes in thinking over time from what was presented in those pieces. For example, the concept of "laboratory automation engineering" has morphed into "laboratory systems engineering," given that in the past 15 years the scope of laboratory automation and computing has broadened significantly. Additionally, references to "scientific manufacturing" are now replaced with "scientific production," since laboratories tend to produce ideas, knowledge, results, information, and data, not tangible widgets. And as the state of laboratories continues to dynamically evolve, there will likely come more changes.
 
Of special note is 2019's ''Laboratory Technology Management & Planning'' webinars. They provide additional useful background towards what is covered in this guide.
 
==Looking forward and back: Where do we begin?==
===The "laboratory of the future" and laboratory systems engineering===
The “laboratory of the future” (LOF) makes for an interesting playground of concepts. People's view of the LOF is often colored by their commercial and research interests. Does the future mean tomorrow, next month, six years, or twenty years from now? In reality, it means all of those time spans coupled with the length of a person's tenure in the lab, and the legacy they want to leave behind.


However, with those varied time spans we’ll need flexibility and adaptability for [[Information management|managing data]] and [[information]] while also preserving access and utility for the products of lab work, and that requires organization and planning. Laboratory equipment will change and storage media and data formats will evolve. The instrumentation used to collect data and information will change, and so will the computers and software applications that manage that data and information. Every resource that has been expended in executing lab work has been to develop knowledge, information, and data (K/I/D). How are you going to meet that management challenge and retain the expected return on investment (ROI)? Answering it will be one of the hallmarks of the LOF. It will require a deliberate plan that touches on every aspect of lab work: people, equipment and systems choices, and relationships with vendors and information technology support groups. Some points reside within the lab while others require coordination with corporate groups, particularly when we address long-term storage, ease of access, and security (both physical and electronic).
However, with those varied time spans we’ll need flexibility and adaptability for [[Information management|managing data]] and [[information]] while also preserving access and utility for the products of lab work, and that requires organization and planning. Laboratory equipment will change and storage media and data formats will evolve. The instrumentation used to collect data and information will change, and so will the computers and software applications that manage that data and information. Every resource that has been expended in executing lab work has been to develop knowledge, information, and data (K/I/D). How are you going to meet that management challenge and retain the expected return on investment (ROI)? Answering it will be one of the hallmarks of the LOF. It will require a deliberate plan that touches on every aspect of lab work: people, equipment and systems choices, and relationships with vendors and information technology support groups. Some points reside within the lab while others require coordination with corporate groups, particularly when we address long-term storage, ease of access, and security (both physical and electronic).
Line 16: Line 41:
Modern laboratory work is a merger of science and information technology. Some of the information technology is built into instruments and equipment, the remainder supports those devices or helps manage operations. That technology needs to be understood, planned, and engineered into smoothly functioning systems if labs are to function at a high level of performance.
Modern laboratory work is a merger of science and information technology. Some of the information technology is built into instruments and equipment, the remainder supports those devices or helps manage operations. That technology needs to be understood, planned, and engineered into smoothly functioning systems if labs are to function at a high level of performance.


Given all that, how do we prepare for the LOF, whatever that future turns out to be? One step is the development of “laboratory automation engineering” as a means of bringing structure and discipline to the use of [[Informatics (academic field)|informatics]], robotics, and [[Laboratory automation|automation]] to lab systems.
Given all that, how do we prepare for the LOF, whatever that future turns out to be? One step is the development of “laboratory systems engineering” as a means of bringing structure and discipline to the use of [[Informatics (academic field)|informatics]], robotics, and [[Laboratory automation|automation]] to lab systems.
 
But who is the laboratory systems engineer (LSE)? The LSE is someone able to understand and be conversant in both the laboratory science and IT worlds, relating them to each other to the benefit of lab operation effectiveness while guiding IT in performing their roles. A fully dedicated LSE will understand a number of important principles:
 
#Knowledge, information, and data should always be protected, available, and usable.
#[[Data integrity]] is paramount.
#Systems and their underlying components should be supportable, meaning they are proven to meet users' needs (validated), capable of being modified without causing conflicts with results produced by previous versions, documented, upgradable (without major disruption to lab operations), and able to survive upgrades in connected systems.
#Systems should be integrated into lab operations and not exist as isolated entities, unless there are overriding concerns.
#Systems should be portable, meaning they are able to be relocated and installed where appropriate, and not restricted to a specific combination of hardware and/or software that can’t be duplicated.
#There should be a smooth, reproducible (bi-directional, if appropriate), error-free (including error detection and correction) flow of results, from data generation to the point of use or need.
 
===But how did we get here?===
The primary purpose of laboratory work is developing and carrying out scientific methods and experiments, which are used to answer questions. We don’t want to lose sight of that, or the skill needed to do that work. Initially the work was done manually, which inevitably limited the amount of data and information that could be produced, and in turn the rate at which new knowledge could be developed and distributed.
 
However, the introduction of electronic instruments changed that, and the problem shifted from data and information production to data and information utilization (including the development of new knowledge), distribution, and management. That’s where we are today.
 
Science plays a role in the production of data and information, as well as the development of knowledge. In between we have the tools used in data and information collection, storage, [[Data analysis|analysis]], etc. That’s what we’ll be talking about in this document. The equipment and concepts we’re concerned with here are the tools used to assist in conducting that work and working with the results. They are enablers and amplifiers of lab processes. As in almost any application, the right tools used well are an asset.
 
[[File:Fig1 Liscouski DirectLabSysOnePerPersp21.png|400px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="400px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure 1.''' Databases for knowledge, information, and data (K/I/D) are represented as ovals, and the processes acting on them as arrows.<ref>{{Cite book |last=Liscouski, J. |year=2015 |title=Computerized Systems in the Modern Laboratory: A Practical Guide |publisher=PDA/DHI |pages=432 |asin=B010EWO06S |isbn=978-1933722863}}</ref></blockquote>
|-
|}
|}
 
One problem with the distinction between science and informatics tools is that lab personnel understand the science, but they largely don't understand the intricacies of information and computing technologies that comprise the tools they use to facilitate their work. Laboratory personnel are educated in a variety of scientific disciplines, each having its own body of knowledge and practices, each requiring specialization. Shouldn't the same apply to the expertise needed to address the “tools”? On one hand we have [[Chromatography|chromatographers]], [[Spectroscopy|spectroscopists]], physical chemists, toxicologists, etc., and on the other roboticists, database experts, network specialists, and so on. In today's reality these are not IT specialists but rather LSEs who understand how to apply the IT tools to lab work with all the nuances and details needed to be successful.
 
===Moving forward===
If we are going to advance laboratory science and its practice, we need the right complement of experienced people. Moving forward to address laboratory- and organization-wide productivity needs a different perspective; rather than ask how we improve things at the bench, ask how we improve the processes and organization.
 
This guide is about long-term planning for lab automation and K/I/D management. The material in this guide is structured in sections, and each section starts with a summary, so you can read the summary and decide if you want more detail. However, before you toss it on the “later” pile, read the next few bits and then decide what to do.
 
Long-term planning is essential to organizational success. The longer you put it off, the more expensive it will be, the longer it will take to do it, and the more entrenched behaviors you'll have to overcome. Additionally, you won’t be in a position to take advantage of the developing results.
 
It’s time to get past the politics and the inertia and move forward. Someone has to take the lead on this, full-time or part-time, depending on the size of your organization (i.e., if there's more than one lab, know that it affects all of them). Leadership should be from the lab side, not the IT side, as IT people may not have the backgrounds needed and may view everything through their organizations priorities. (However, their support will be necessary for a successful outcome.)
 
The work conducted on the lab bench produces data and information. That is the start of realizing the benefits from research and testing work. The rest depends upon your ability to work with that data and information, which in turn depends on how well your data systems are organized and managed. This culminates in maximizing benefit at the least cost, i.e., ROI. It’s important to you, and it’s important to your organization.
 
Planning has to be done at least four levels:
 
#Industry-wide (e.g., biotech, mining, electronics, cosmetics, food and beverage, plastics, etc.)
#Within your organization
#Within your lab
#Within your lab processes
 
One important aspect of this planning process—particularly at the top, industry-wide level—is the specification of a framework to coordinate product, process (methods), or standards research and development at the lower levels. This industry-wide framework is ideally not a “this is what you must do” but rather a common structure that can be adapted to make the work easier and, as a basis for approaching vendors for products and product modifications that will benefit those in your industry, give them confidence that the requests have a broader market appeal. If an industry-wide approach isn’t feasible, then larger companies may group together to provide the needed leadership. Note, however, that this should not be perceived as an industry/company vs. vendor effort; rather, this is an industry/company working with vendors. The idea of a large group effort is to demonstrate a consensus viewpoint and that vendors' development efforts won’t be in vain.
 
The development of this framework, among other things, should cover:
 
*Informatics
*Communications (networking, instrument control and data, informatics control and data, etc.)
*Physical security (including power)
*Data integrity and security
*[[Cybersecurity]]
*The FAIR principles (the findability, accessibility, interoperability, and reusability of data<ref name="WilkinsonTheFAIR16">{{cite journal |title=The FAIR Guiding Principles for scientific data management and stewardship |journal=Scientific Data |author=Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J. et al. |volume=3 |pages=160018 |year=2016 |doi=10.1038/sdata.2016.18 |pmid=26978244 |pmc=PMC4792175}}</ref>)
*The application of [[Cloud computing|cloud]] and [[virtualization]] technologies
*Long-term usable access to lab information in databases without vendor controls (i.e., the impact of [[software as a service]] and other software subscription models)
*Bi-directional [[Data exchange|data interchange]] between archived instrument data in standardized formats and vendor software, requiring tamper-proof formats
*Instrument design for automation
*Sample storage management
*Guidance for automation
*Education for lab management and lab personnel
*The conversion of manual methods to semi- or fully automated systems
 
These topics affect both a lab's science personnel and its LSEs. While some topics will be of more interest to the engineers than the scientists, both groups have a stake in the results, as do any IT groups.
 
As digital systems become more entrenched in scientific work, we may need to restructure our thinking from “lab bench” and “informatics” to “data and information sources” and “digital tools for working, organizing, and managing those elements." Data and information sources can extend to third-party labs and other published material. We have to move from digital systems causing incremental improvements (today’s approach), to a true revolutionary restructuring of how science is conducted.
 
==Laboratory computing==
'''Key point''': ''Laboratory systems are planned, designed, and engineered. They are not simply a collection of components. Laboratory computing is a transformational technology, one which has yet to fully emerge in large part because those who work in laboratories with computing aren’t fully educated about it to take advantage of it.''
 
Laboratory computing has always been viewed as an "add-on" to traditional laboratory work. These add-ons have the potential to improve our work, make it faster, and make it more productive (see Appendix 1 for more details).
 
The common point-of-view in the discussion of lab computing has been focused on the laboratory scientist or manager, with IT providing a supporting role. That isn’t the only viewpoint available to us, however. Another viewpoint is from that of the laboratory systems engineer (LSE), who focuses on data and information flow. This latter viewpoint should compel us to reconsider the role of computing in the laboratory and the higher level needs of laboratory operations.
 
Why is this important? Data and information generation may represent the end of the lab bench process, but it’s just the beginning of its use in broader scientific work. The ability to take advantage of those elements in the scope of manufacturing and corporate research and development (R&D) is where the real value is realized. That requires planning for storage, access, and utilization over the long term.
 
The problem with the traditional point-of-view (i.e., instrumentation first, with computing in a supporting role) is that the data and information landscape is built supporting the portion of lab work that is the most likely to change (Figure 2). You wind up building an information architecture to meet the requirements of diverse data structures instead of making that architecture part of the product purchase criteria. Systems are installed as needs develop, not as part of a pre-planned information architecture.
 
[[File:Fig2 Liscouski DirectLabSysOnePerPersp21.png|650px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="650px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure 2.''' Hierarchy of lab systems, noting frequency of change</blockquote>
|-
|}
|}
 
Designing an informatics architecture has some things in common with building a house. You create the foundation first, a supporting structure that everything sits on. Adding the framework sets up the primary living space, which can be modified as needed without disturbing the foundation (Figure 3). If you built the living space first and then wanted to install a foundation, you’d have a mess to deal with.
 
[[File:Fig3 Liscouski DirectLabSysOnePerPersp21.png|700px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="700px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure 3.''' Comparison of foundation and living/working space levels in an organization</blockquote>
|-
|}
|}
 
The same holds true with [[laboratory informatics]]. Set the foundation—the [[laboratory information management system]] (LIMS), [[electronic laboratory notebook]] (ELN), [[scientific data management system]] (SDMS), etc.—first, then add the data and information generation systems. That gives you a common set of requirements for making connections and can clarify some issues in product selection and systems integration. It may seem backwards if your focus is on data and information production, but as soon as you realize that you have to organize and manage those products, the benefits will be clear.
 
You might wonder how you go about setting up a LIMS, ELN, etc. before the instrumentation is set. However it isn’t that much of a problem. You know why your lab is there and what kind of work you plan to do. That will guide you in setting up the foundation. The details of tests can be added as need. Most of that depends on having your people educated in what the systems are and how to use them.
 
Our comparison between a building and information systems does bring up some additional points. A building's access to utilities runs through control points; water and electricity don’t come in from the public supply to each room but run through central control points that include a distribution system with safety and management features. We need the same thing in labs when it comes to network access. In our current society, access to private information for profit is a fact of life. While there are desirable features of lab systems available through network access (remote checking, access to libraries, updates, etc.), they should be controlled so that those with malicious intent are prevented access, and data and information are protected. Should instrument systems and office computers have access to corporate and external networks? That’s your decision and revolves around how you want your lab run, as well as other applicable corporate policies.
 
The connections layer in Figure 3 is where devices connect to each other and the major informatics layer. This layer includes two functions: basic networking capability and application-to-application transfer. Take for example moving pH measurements to a LIMS or ELN; this is where things can get very messy. You need to define what that is and what the standards are to ensure a well-managed system (more on that when we look at industry-wide guidelines).
 
To complete the analogy, people do move the living space of a house from one foundation to another, often making for an interesting experience. Similarly, it’s also possible to change the informatics foundation from one product set to another. It means exporting the contents of the database(s) to a product-independent format and then importing into the new system. If you think this is something that might be in your future, make the ability to engage in that process part of the product selection criteria. Like moving a house, it isn’t going to be fun.<ref name="FishOvercoming05">{{cite journal |title=Overcoming the Challenges of a LIMS Migration |journal=Research & Development |author=Fish, M.; Minicuci, D. |volume=47 |issue=2 |year=2005 |url=http://apps.thermoscientific.com/media/SID/Informatics/PDF/Article-Overcoming-the-Challanges.pdf |format=PDF}}</ref><ref name="FishOvercoming13">{{cite web |url=https://www.scientistlive.com/content/overcoming-daunting-business-challenges-lims-migration |title=Overcoming daunting business challenges of a LIMS migration |author=Fish, M.; Minicuci, D. |work=Scientist Live |date=01 April 2013 |accessdate=17 November 2021}}</ref><ref name="FreeLIMSWhat18">{{cite web |url=https://freelims.org/blog/legacy-data-migration-to-lims.html |title=Overcoming the Challenges of Legacy Data Migration |work=FreeLIMS.org |publisher=CloudLIMS.com, LLC |date=29 June 2018 |accessdate=17 November 2021}}</ref> The same holds true for ELN and SDMS.
 
==How automation affects people's work in the lab==
'''Key point''': ''There are two basic ways lab personnel can approach computing: it’s a black box that they don’t understand but is used as part of their work, or they are fully aware of the equipment's capabilities and limitations and know how to use it to its fullest benefit.''
 
While lab personnel may be fully educated in the science behind their work, the role of computing—from pH meters to multi-instrument data systems—may be viewed with a lack of understanding. That is a significant problem because they are responsible for the results that those systems produce, and they may not be aware of what happens to the signals from the instruments, where the limitations lie, and what can turn a well-executed procedure to junk because an instrument or computer setting wasn’t properly evaluated and used.
 
In reality, automation has both a technical impact on their work and an impact on themselves. These are outlined below.
 
Technical impact on work:
 
*It can make routine work easier and more productive, reducing costs and improving ROI (more on that below).
*It can allow work to be performed that might otherwise be too expensive to entertain. There are techniques such as high-throughput screening and statistical experimental design that are useful in laboratory work but might be avoided because the effort of generating the needed data is too labor-intensive and time-consuming. Automated systems can relieve that problem and produce the volumes of data those techniques require.
*It can improve accuracy and reproducibility. Automated systems, properly designed and implemented, are inherently more reproducible than a corresponding manual system.
*It can increase safety by limiting people's exposure to hazardous situations and materials.
*It can also be a financial hole if proper planning and engineering aren’t properly applied to a project. “Scope creep,” changes in direction, and changes in project requirements and personnel are key reasons that projects are delayed or fail.
 
Impact on the personnel themselves:
 
*It can increase technical specialization, potentially improving work opportunities and people's job satisfaction. Having people move into a new technology area gives them an opportunity to grow both personally and professionally.
*Full automation of a process can cause some jobs to end, or at least change them significantly (more on that below).
*It can elevate routine work to more significant supervisory roles.
 
Most of these impacts are straightforward to understand, but several require further elaboration.
 
===It can make routine work easier and more productive, reducing costs and improving ROI===
This sounds like a standard marketing pitch; is there any evidence to support it? In the 1980s, clinical chemistry labs were faced with a problem: the cost for their services was set on an annual basis without any adjustments permitted for rising costs during that period. If costs rose, income dropped; the more testing they did the worse the problem became. They addressed this problem as a community, and that was a key factor in their success. Clinical chemistry labs do testing on materials taken from people and animals and run standardized tests. This is the kind of environment that automation was created for, and they, as a community, embarked on a total laboratory automation (TLA) program. That program had a number of factors: education, standardization of equipment (the tests were standardized so the vendors knew exactly what they needed in equipment capabilities), and the development of instrument and computer communications protocols that enabled the transfer of data and information between devices (application to application).
 
Organizations such as the American Society for Clinical Laboratory Science (ASCLS) and the American Association for Clinical Chemistry (AACC), as well as many others, provide members with education and industry-wide support organization. Other examples include the Clinical and Laboratory Standards Institute (CLSI), a non-profit organization that develops standards for laboratory automation and informatics (e.g., AUTO03-A2 on laboratory automation<ref name="CLSIAUTO03">{{cite web |url=https://clsi.org/standards/products/automation-and-informatics/documents/auto03/ |title=AUTO03 Laboratory Automation: Communications With Automated Clinical Laboratory Systems, Instruments, Devices, and Information Systems, 2nd Edition |publisher=Clinical and Laboratory Standards Institute |date=30 September 2009 |accessdate=17 November 2021}}</ref>), and [[Health Level 7|Health Level Seven, Inc.]], a non-profit organization that provide software standards for [[Data integration|data interoperability]].
 
Given that broad, industry-wide effort to address automation issues, the initial response was as follows<ref name="SarkoziTheEff03">{{cite journal |title=The effects of total laboratory automation on the management of a clinical chemistry laboratory. Retrospective analysis of 36 years |journal=Clinica Chimica Acta |author=Sarkozi, L.; Simson, E.; Ramanathan, L. |volume=329 |issue=1–2 |pages=89–94 |year=2003 |doi=10.1016/S0009-8981(03)00020-2}}</ref>:
 
*Between 1965 and 2000, the Consumer Price Index increased by a factor of 5.5 in the United States.
*During the same 36 years, at Mount Sinai Medical Center's chemistry department, the productivity (indicated as the number of reported test results/employee/year) increased from 10,600 to 104,558 (9.3-fold).
*When expressed in constant 1965 dollars, the total cost per test decreased from $0.79 to $0.15.
 
In addition, the following data (Table 1 and 2) from Dr. Michael Bissell of Ohio State University provides further insight into the resulting potential increase in labor productivity by implementing TLA in the lab<ref name="BissellTotal14">{{cite web |url=https://www.youtube.com/watch?v=RdwFZyYE_4Q |title=Total Laboratory Automation - Michael Bissell, MD, Ph.D |work=YouTube |publisher=University of Washington |date=15 July 2014 |accessdate=17 November 2021}}</ref>:
 
{|
| style="vertical-align:top;" |
{| class="wikitable" border="1" cellpadding="5" cellspacing="0" width="100%"
|-
  | colspan="4" style="background-color:white; padding-left:10px; padding-right:10px;" |'''Table 1.''' Overall productivity of labor<br />&nbsp;<br />FTE = Full-time equivalent; TLA = Total laboratory automation
|-
  ! style="padding-left:10px; padding-right:10px;" |Ratio
  ! style="padding-left:10px; padding-right:10px;" |Pre-TLA
  ! style="padding-left:10px; padding-right:10px;" |Post-Phase 1
  ! style="padding-left:10px; padding-right:10px;" |Change
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Test/FTE
  | style="background-color:white; padding-left:10px; padding-right:10px;" |50,813
  | style="background-color:white; padding-left:10px; padding-right:10px;" |64,039
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +27%
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Tests/Tech FTE
  | style="background-color:white; padding-left:10px; padding-right:10px;" |80,058
  | style="background-color:white; padding-left:10px; padding-right:10px;" |89,120
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +11%
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Tests/Paid hour
  | style="background-color:white; padding-left:10px; padding-right:10px;" |20.8
  | style="background-color:white; padding-left:10px; padding-right:10px;" |52.9
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +24%
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Tests/Worked hour
  | style="background-color:white; padding-left:10px; padding-right:10px;" |24.4
  | style="background-color:white; padding-left:10px; padding-right:10px;" |30.8
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +26%
|-
|}
|}
 
{|
| style="vertical-align:top;" |
{| class="wikitable" border="1" cellpadding="5" cellspacing="0" width="100%"
|-
  | colspan="4" style="background-color:white; padding-left:10px; padding-right:10px;" |'''Table 2.''' Productivity of labor-processing area<br />&nbsp;<br />FTE = Full-time equivalent; TLA = Total laboratory automation
|-
  ! style="padding-left:10px; padding-right:10px;" |Ratio
  ! style="padding-left:10px; padding-right:10px;" |Pre-TLA
  ! style="padding-left:10px; padding-right:10px;" |Post-Phase 1
  ! style="padding-left:10px; padding-right:10px;" |Change
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Specimens/Processing FTE
  | style="background-color:white; padding-left:10px; padding-right:10px;" |39,899
  | style="background-color:white; padding-left:10px; padding-right:10px;" |68,708
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +72%
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Specimens/Processing paid hour
  | style="background-color:white; padding-left:10px; padding-right:10px;" |19.1
  | style="background-color:white; padding-left:10px; padding-right:10px;" |33.0
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +73%
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |Requests/Processing paid hour
  | style="background-color:white; padding-left:10px; padding-right:10px;" |12.7
  | style="background-color:white; padding-left:10px; padding-right:10px;" |21.5
  | style="background-color:white; padding-left:10px; padding-right:10px;" | +69%
|-
|}
|}
 
Through TLA, improvements can be seen in:
 
*Improved sample throughput
*Cost reduction
*Less variability in the data
*Reduced and more predictable consumption of materials
*Improved use of people's talents
 
While this data is almost 20 years old, it illustrates the impact in a change from a manual system to an automated lab environment. It also gives us an idea of what might be expected if industry- ir community-wide automation programs were developed.
 
Clinical laboratories are not unique in the potential to organize industry-wide standardized aspects of technology and work, and provide education. The same can be done anywhere as long as the ability to perform a particular test procedure isn’t viewed as a unique competitive advantage. The emerging ''[[Cannabis]]'' testing industry represents one such opportunity, among others.
 
The clinical industry has provided a template for the development of laboratory systems and automation. The list of instruments that meet the clinical communications standard continues to grow (e.g., Waters' MassLynx LIMS Interface<ref name="WatersMassLynx16">{{cite web |url=https://www.waters.com/webassets/cms/library/docs/720005731en%20Masslynx%20LIMS%20Interface.pdf |format=PDF |title=MassLynx LIMS Interface |publisher=Waters Corporation |date=November 2016 |accessdate=17 November 2021}}</ref>). There is nothing unique about the communications standards that prevent them from being used as a basis for development in other industries, aside from the data dictionary. As such, we need to move from an every-lab-for-itself approach to lab systems development toward a more cooperative and synergistic model.
 
===Automation can cause some jobs to end, or at least change them significantly===
One of the problems that managers and the people under them get concerned about is change. No matter how beneficial the change is for the organization, it raises people’s anxiety levels and can affect their job performance unless they are prepared for it. In that context, questions and concerns staff may have in relation to automating aspects of a job include:
 
*How are these changes going to affect my job and my income? Will it cause me to lose my job? That’s about as basic as it gets, and it can impact people at any organizational level.
*Bringing in new technologies and products means learning new things, and that opens up the possibility that people may fail or not be as effective as they are currently. It can reshuffle the pecking order.
*The process of introducing new equipment, procedures, etc. is going to disrupt the lab's [[workflow]]. The changes may be procedural structural, or both; how are you going to deal with those issues?
 
Two past examples will highlight different approaches. In the first, a multi-instrument automation system was being introduced. Management told the lab personnel what was going to happen and why, and that they would be part of the final acceptance process. If they weren’t happy, the system would be modified to meet their needs. The system was installed, software written to meet their needs, instruments connected, and the system was tested. Everyone was satisfied except one technician, and that proved to be a major roadblock to putting the system into service. The matter was discussed with the lab manager, who didn’t see the problem; as soon as the system was up and running, that technician would be given a new set of responsibilities, something she was interested in. But no one told her that. As she saw it, once the system came on line she was out of a job. (One of the primary methods used in that lab's work was chromatography, with the instrument output recorded on chart paper. Most measurements were done using peak height, but peak area was used for some critical analyses. Those exacting measurements, made with a planimeter, were her responsibility and her unique—as she saw it—contribution to the lab's work. The instrument system replaced the need for this work. The other technicians and chemists had no problem adapting to the data system.) However, a short discussion between she and the lab manager alleviated the concerns.
 
The second example was handled a lot differently, and was concerned with the implementation of a lab’s first LIMS. The people in the lab knew something was going on but not the details. Individual meetings were held with each member of the lab team to discuss what was being considered and to learn of their impressions and concerns (these sessions were held with an outside consultant, and the results were reported to management in summary form). Once that was completed, the project started, with the lab manager holding a meeting of all lab personnel and IT, describing what was going to be done, why, how the project was going to proceed, and the role of those working in the lab in determining product requirement and product selection. The concerns raised in the individual sessions were addressed up-front, and staff all understood that no one was going to lose their job, or suffer a pay cut. Yes, some jobs would change, and where appropriate that was discussed with each individual. There was an educational course about what a LIMS was, its role in the lab's work, and how it would improve the lab’s operations. When those sessions were completed, the lab’s personnel looked forward to the project. They were part of the project and the process, rather than having it done without their participation. In short, instead of it happening to them, it happened with them as willing participants.
 
People's attitudes about automation systems and being willing participants in their development can make a big difference in a project's success or failure. You don’t want people to feel that the incoming system and the questions surrounding it are threatening, or seen as something that is potentially going to end their employment. They may not freely participate or they may leave when you need them the most.
 
All of this may seem daunting for a lab to take on by itself. Large companies may have the resources to handle it, but we need more than a large company to do this right; we need an industry-wide effort.
 
==Development of industry-wide guidelines==
'''Key point''': ''The discussion above about clinical labs illustrates what can be accomplished when an industry group focuses on a technology problem. We need to extend that thinking—and action—to a broader range of industries individually. The benefits of an industry-wide approach to addressing technology and education issues include:''
 
*''providing a wider range of inputs to solving problems;''
*''providing a critical marketing mass to lobby vendors to create products that fit customers’ needs in a particular industry;''
*''developing an organized educational program with emphasis on that industry's requirements;''
*''giving labs (startup and existing) a well-structured reference point to guide them (not dictate) in making technology decisions;''
*''reducing the cost of automation, with improved support; and''
*''where competitive issues aren’t a factor, enabling industry-funded R&D technology development and implementation for production or manufacturing quality, process control (integration of online quality information), and process management.''
 
The proposal: each industry group should define a set of guidelines to assist labs in setting up an information infrastructure. For the most part, large sections would end up similar across multiple industries, as there isn’t that much behavioral distinction between some industry sets. The real separation would come in two places: the data dictionaries (data descriptions) and the nature of the testing and automation to implement that testing. Where there is a clear competitive edge to a test or its execution, each company may choose to go it alone, but that still leaves a lot of room for cooperation in method development, addressing both the basic science and its automated implementations, particularly where ASTM, USP, etc. methods are employed.
 
The benefits of this proposal are noted in the key point above. However, the three most significant ones are arguably:
 
*the development of an organized education program with an emphasis on industry requirements;
*the development of a robust communications protocol for application-to-application transfers; and,
*the ability to lobby vendors from an industry-wide basis for product development, modification, and support.
 
In the "Looking forward and back" section earlier in this guide, we showed a bulleted list of considerations for the development of such a guideline-based framework. What follows is a more organized version of those points, separated into three sections, which all need to be addressed in any industry-based framework. For the purposes of this guide, we'll focus primarily on Section C: "Issues that need concerted attention." Section A on background and IT support, and Section B on lab-specific background information, aren't discussed as they are addressed elsewhere, particularly in the previous works referenced in the Introduction.
 
:'''A.''' General background and routine IT support
::1. Physical security (including power)
::2. Cybersecurity
::3. Cloud and virtualization technologies
 
:'''B.''' Lab-specific background information
::1. Informatics (could be an industry-specific version of ''ASTM E1578 - 18 Standard Guide for Laboratory Informatics''<ref name="ASTME1578_18">{{cite web |url=https://www.astm.org/Standards/E1578.htm |title=ASTM E1578 - 18 Standard Guide for Laboratory Informatics |publisher=ASTM International |date=2018 |accessdate=17 November 2021}}</ref>)
::2. Sample storage management and organization (see Appendix 1, section 1.4 of this guide)
::3. Guidance for automation
::4: Data integrity and security (see S. Schmitt's ''Assuring Data Integrity for Life Sciences''<ref>{{Cite book |editor-last=Schmitt, S. |title=Assuring Data Integrity for Life Sciences |url=https://www.dhibooks.com/assuring-data-integrity-for-life-sciences |publisher=DHI Publishing |pages=385 |isbn=9781933722979}}</ref>, which has broader application outside the life sciences)
::5. The FAIR principles (the findability, accessibility, interoperability, and reusability of data; see Wilkinson ''et al.'' 2016<ref name="WilkinsonTheFAIR16" />)
 
:''C.'' Issues that need concerted attention:
::1. Education for lab management and lab personnel
::2. The conversion of manual methods to semi- or full-automated methods (see [[LII:Considerations in the Automation of Laboratory Procedures|''Considerations in the Automation of Laboratory Procedures'']] for more)
::3. Long-term usable access to lab information in databases without vendor controls (i.e., the impact of [[software as a service]] and other software subscription models)
::4. Archived instrument data in standardized formats and standardized vendor software (requires tamper-proof formats)
::5. Instrument design for automation (Most instruments and their support software are dual-use, i.e., they work as stand-alone devices via front panels, or through software controls. While this is a useful selling tool—whether by manual or automated use—it means the device is larger, more complex, and expensive than a automation-only device that uses software [e.g., a smartphone or computer] for everything. Instruments and devices designed for automation should be more compact and permit more efficient automation systems.)
::6. Communications (networking, instrument control and data, informatics control and data, etc.)
 
We'll now go on to expand upon items C-1, C-3, C-4, and C-6.
 
===C-1. Education for lab management and lab personnel===
Laboratory work has become a merger of two disciplines: science and information technology (including robotics). Without the first, nothing happens; without the second, work will proceed but at a slower more costly pace. There are different levels of education requirements. For those working at the lab bench, not only do they need to understand the science, but also how the instrumentation and supporting computers systems (if any, including those embedded in the instrument) make and transform measurements into results. “The computer did it” is not a satisfactory answer to “how did you get that result,” nor is “magic," or maintaining willful ignorance of exactly how the instrument measurements are taken. Laboratorians are responsible and accountable for the results, and they should be able to explain the process of how they were derived, including how settings on the instrument and computer software affect that work. Lab managers should understand the technologies at the functional level and how the systems interact with each other. They are accountable for the overall integrity of the systems and the data and information they contain. IT personnel should understand how lab systems differ from office applications and why business-as-usual for managing office applications doesn’t work in the lab environment. The computer systems are there to support the instruments, and any changes that may affect that relationship should be initiated with care; the instrument vendor’s support for computer systems upgrades is essential.
 
As such, we need a new personnel position, that of the laboratory systems engineer or LSE, to provide support for the informatics architecture. This isn’t simply an IT person; it should be someone who is fluent in both the science and the information technology applied to lab work. (See [[LII:Laboratory Technology Planning and Management: The Practice of Laboratory Systems Engineering|''Laboratory Technology Planning and Management: The Practice of Laboratory Systems Engineering'']] for more on this topic.)
 
===C-3. Long-term usable access to lab information in databases without vendor controls===
The data and information your lab produces, with the assistance of instrumentation and instrument data systems, is yours, and no one should put limits on your ability to work with it. There is a problem with modern software design: most lab data and information can only be viewed through applications software. In some cases, the files may be used with several applications, but often it is the vendor's proprietary formats that limit access. In those instances, you have to maintain licenses for that software for as long as you need access to the data and information, even if the original application has been replaced by something else. This can happen for a variety of reasons:
 
*Better technology is available from another vendor
*A vendor sold part of its operations to another organization
*Organizations merge
*Completion of a consulting or research contract requires all data to be sent to the contracting organization
 
All of these, and others, are reasons for maintaining multiple versions of similar datasets that people need access to yet don’t want to maintain licenses for into the future, even though they still must consider meeting regulatory (FDA, EPA, ISO, etc.) requirements.
 
All of this revolves around your ability to gain value from your data and information without having to pay for its access. The vendors don’t want to give their software away for free either. What we need is something like the relationship between Adobe Acrobat Reader and the Adobe Acrobat application software. The latter gives you the ability to create, modify, comment, etc. documents, while the former allows you to view them. The Reader gives anyone the ability to view the contents, just not alter it. We need a “reader” application for instrument data collected and processed by an instrument data system. We need to be able to view the reports, raw data, etc. and export the data in a useful format, everything short of acquiring and processing new data. This gives you the ability to work with your intellectual property and allows it to be viewed by regulatory agencies if that becomes necessary, without incurring unnecessary costs or depriving the vendor of justifiable income.
 
This has become increasingly important as vendors have shifted to a subscription model for software licenses in place of one-time payments with additional charges for voluntary upgrades. One example from another realm illustrates the point. My wife keeps all of her recipes in an application on her iPad. One day she looked for a recipe and received a message that read, roughly, “No access unless you upgrade the app <not free>.” As it turned out, a Google search recommended re-installing the current app instead. It worked, but she upgraded anyhow, just to be safe. It’s your content, but who “owns” it if the software vendor can impose controls on how it is used?
 
As more of our work depends on software, we find ourselves in a constant upgrade loop of new hardware, new operating systems, and new applications just to maintain the status quo. We need more control over what happens to our data. Industry-wide guidelines backed by the buying power of an industry could create vendor policies that would mitigate that. Earlier in this document we noted that the future is going to extend industry change for a long time, with hardware and software evolving in ways we can’t imagine. Hardware changes (anyone remember floppy disks?) inevitably make almost everything obsolete, so how do we protect our access to data and information? Floppy disks were the go-to media 40 years ago, and since then cassette tapes, magnetic tape, Zip drives, CD-ROMs, DVDs, Syquest drives, and other types of storage media have come and gone. Networked systems, at the moment, are the only consistent and reliable means of exchange and storage as datasets increase in size.
 
One point we have to take into account is that versions of applications will only function on certain versions of operating systems and databases. All three elements are going to evolve, and at some point we’ll have to deal with “old” generations while new ones are coming online. One good answer to that is emulation. Some software systems like VMWare Corporation's VMware allow packages of operating systems, databases, and applications to operate on computers regardless of their age, with each collection residing in a “container”; we can have several generations of those containers residing on a computer and execute them at will, as if they were still running on the original hardware. If you are looking for the data for a particular sample, go to the container that covers its time period and access it. Emulation packages are powerful tools; using them you can even run Atari 2600 games on a Windows or OS X system.
 
Addressing this issue is generally bigger than what a basic laboratory-based organization can handle, involving policy making and support from information technology support groups and corporate legal and financial management. The policies have to take into account physical locations of servers, support, financing, regulatory support groups, and international laws, even if your company isn’t a multinational (third-party contracting organizations may be). Given this complexity, and the fact that most companies in a given industry will be affected, industry-wide guidelines would be useful.
 
===C-4. Archived instrument data in standardized formats and standardized vendor software===
This has been an area of interest for over 25 years, beginning with the Analytical Instrument Association's work that resulted in a set of ASTM Standard Guides:
 
*''ASTM E2078 - 00(2016) Standard Guide for Analytical Data Interchange Protocol for Mass Spectrometric Data''<ref name="ASTMEE2078_16">{{cite web |url=https://www.astm.org/Standards/E2078.htm |title=ASTM E2078 - 00(2016) Standard Guide for Analytical Data Interchange Protocol for Mass Spectrometric Data |publisher=ASTM International |date=2016 |accessdate=17 November 2021}}</ref>
*''ASTM E1948 - 98(2014) Standard Guide for Analytical Data Interchange Protocol for Chromatographic Data''<ref name="ASTME1948_14">{{cite web |url=https://www.astm.org/Standards/E1948.htm |title=ASTM E1948 - 98(2014) Standard Guide for Analytical Data Interchange Protocol for Chromatographic Data |publisher=ASTM International |date=2014 |accessdate=17 November 2021}}</ref>
 
Millipore Sigma continues to invest in solutions based on the Analytical Information Markup Language (AnIML) standard (an outgrowth of work done at NIST).<ref name="MerckMilli19">{{cite web |url=https://www.emdgroup.com/en/news/bssn-software-06-08-2019.html |title=MilliporeSigma Acquires BSSN Software to Accelerate Customers’ Digital Transformation in the Lab |publisher=Merck KGaA |date=06 August 2019 |accessdate=17 November 2021}}</ref> There have also been a variety of standards programs, all of which have a goal of moving instrument data into a neutral data format that is free of proprietary interests, allowing it to be used and analyzed as the analyst needs (e.g., JCAMP-DX<ref>{{Cite journal |last=McDonald |first=Robert S. |last2=Wilks |first2=Paul A. |date=1988-01 |title=JCAMP-DX: A Standard Form for Exchange of Infrared Spectra in Computer Readable Form |url=http://journals.sagepub.com/doi/10.1366/0003702884428734 |journal=Applied Spectroscopy |language=en |volume=42 |issue=1 |pages=151–162 |doi=10.1366/0003702884428734 |issn=0003-7028}}</ref> and GAML<ref name="GAML">{{cite web |url=http://www.gaml.org/default.asp |title=Welcome to GAML.org |work=GAML.org |date=22 June 2007 |accessdate=17 November 2021}}</ref>).
 
Data interchange standards can help address issues in two aspects of data analysis: qualitative and quantitative work. In qualitative applications, the exported data can imported into other packages that provide facilities not found in the original data acquisition system. Examining an infrared spectra or [[Nuclear magnetic resonance spectroscopy|nuclear magnetic resonance]] (NMR) scan depends upon peak amplitude, shape, and positions to provide useful information, and some software (including user-developed software) may provide facilities that the original data acquisition system didn’t, or it might be a matter of having a file to send to someone for review or inclusion in another project.
 
Quantitative analysis is a different matter. Techniques such as chromatography, [[inductively coupled plasma mass spectrometry]] (ICP), and [[atomic absorption spectroscopy]], among others, rely on the measurement of peak characteristics or single wavelength absorbance measurements in comparison with measurements made on standards. A single chromatogram is useless (for quantitative work) unless the standards are available. (However, it may be useful for qualitative work if retention time references to appropriate known materials are available.) If you are going to export the data for any of the techniques noted, and others as well, you need the full collection of standards and samples for the data to be of any value.
 
Yet there is a problem with some, if not all of these programs: they trust the integrity of the analyst to use the data honestly. It is possible for people to use these exported formats in ways that circumvent current data integrity practices and falsify results.
 
There are good reasons to want vendor neutral data formats so that data sets can be examined by user-developed software, to put it into a form where the analysis is not limited by a vendor's product design. It also holds the potential for separating data acquisition from [[data analysis]] as long as all pertinent data and information (e.g., standards and [[Sample (material)|samples]]) were held together in a package that could not be altered without detection. It may be that something akin to [[blockchain]] technology could be used to register and manage access to datasets on a company-by-company basis (each company having it’s own registry that would become part of the lab's data architecture).
 
These standards formats are potentially very useful to labs, created by people with a passion for doing something useful and beneficial to the practice of science, sometimes at their own expense. This is another area where a coordinated industry-wide statement of requirements and support would lead to some significant advancements in systems development, and enhanced capability for those practicing instrumental science.
 
If these capabilities are important to you, than addressing that need has to be part of an industry-wide conversation and consensus to provide the marketing support to have the work done.
 
===C-6. Communications===
Most computers and electronic devices in the laboratory have communications capability built in: RS-232 (or similar), digital I/O, Ethernet port, Bluetooth, Wi-Fi, IEEE-488, etc. What these connection types do is provide the potential for communications, and the realization of that potential depends on message formatting, structure of the contents, and moving beyond proprietary interests to those expressed by an industry-wide community. There are two types of devices that need to be addressed: computer systems that service instruments (instrument data systems), and, devices that don’t require an external computer to function (pH meters, balances, etc.) that may have Ethernet ports, serial ASCII, digital I/O, or IEEE-488 connections. In either of these cases, the typical situation is one in which the vendor has determined a communications structure and the user has to adapt their systems to it, often using custom programming to parse messages and take action.
 
The user community needs to determine its needs and make them known using community-wide buying power as justification for asking for vendor development efforts. As noted earlier, this is not an adversarial situation, but rather a maturation of industry-wide communities working with vendors; the “buying power” comments simply give the vendors the confidence that its development efforts won’t be in vain.
 
In both cases we need application-to-application message structures that meet several needs. They should be able to handle test method identifications, so that the receiving application knows what the
message is addressing, as well as whether the content is a list of samples to be processed or samples that have already been processed, with results. Additionally, the worklist message should ideally contain, in attribute-value pairs, the number of samples, with a list of sample IDs to be processed. As for the completed work message, the attribute-value pairs would also ideally contain the number of samples processed, the sample IDs, and the results of the analysis (which could be one or more elements depending on the procedure). Of course, there may be other elements required of these message structures; these were just examples. Ultimately, the final framework could be implemented in a format similar to that used in HTML files: plain-text that is easily read and machine- and operating-system-independent.
 
Mapping the message content to database system structure (LIMS or ELN) could be done using a simple built-in application (within the LIMS or ELN) that would graphically display the received message content on one side of a window, with the database’s available fields on the other side. The two sides would then graphically be mapped to one another, as shown in the Figure 4 below. (Note: Since the format of the messages are standardized, we wouldn’t need a separate mapping function to accommodate different vendors).
 
 
[[File:Fig4 Liscouski DirectLabSysOnePerPersp21.png|600px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="600px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure 4.''' Mapping IDS message contents to database system</blockquote>
|-
|}
|}
 
The second case—devices like pH meters, etc.—is a little more interesting since the devices don’t have the same facilities available as a computer system provides. However, in the consumer marketplace, this is well-trod ground, using both a smartphone or tablet as an interface, and translation mechanisms between small, fixed function devices and more extensive applications platforms. The only non-standard element is a single-point Bluetooth connection, as shown in Figure 5.
 
 
[[File:Fig5 Liscouski DirectLabSysOnePerPersp21.png|600px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="600px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure 5.''' Using a smartphone or tablet as an interface to a LIMS or ELN. The procedure takes a series of pH measurements for a series of samples.</blockquote>
|-
|}
|}
 
A single-point Bluetooth connection is used to exchange information between the measuring device and the smartphone or tablet. There are multi-point connection devices (the smartphone, for example) but we want to restrict the device to avoid confusion about who is in control of the measuring unit. A setup screen (not shown) would set up the worklist and populate the sample IDs. The “Take Reading” button would read the device and enter the value into the corresponding sample position—taking them in order—and enable the next reading until all samples had been read. “Send” would transmit the formatted message to the LIMS or ELN. Depending on the nature of the device and the procedure being used, the application can take any form, this is just a simple example taking pH measurement for a series of samples. In essence, the device-smartphone combination becomes an instrument data system, and the database setup would be the same as described above.
 
The bottom line is this: standardized message formats can greatly simplify the interfacing of instrument data systems, LIMS, ELNs, and other laboratory informatics applications. The clinical chemistry community created communications standards that would permit meaningful messages to be sent between an instrument data system and database system structured so that either end of the link would recognize the message and be able to extract and use the message content without the custom programming common to most labs today. There is no reason why the same thing can’t be done in any other industry; it may even be possible to adapt the clinical chemistry protocols. The data dictionary (list of test names and attributes) would have to be adjusted, but that is a minor point that can be handled on an industry-by-industry basis and be incorporated as part of a system installation.
 
What is needed is people coming together as a group and organizing and defining the effort. How important is it to you to streamline the effort in getting systems up and running without custom programming, to work toward a plug-and-play capability that we encounter in consumer systems (an environment where vendors know easy integration is a must or their products won’t sell)?
 
====Additional note on Bluetooth-enabled instruments====
The addition of Bluetooth to a device can result in much more compact systems, making the footprint smaller and reducing the cost of the unit. By using a smartphone to replace the front panel controls, the programming can become much more sophisticated. Imagine a Bluetooth-enabled pH meter and a separate Bluetooth, electronically controlled titrator. That combination would permit optimized{{Efn|Where the amount of titrant added is adjusted based on the response to the previous addition. This should yield faster titrations with increased accuracy.}} delivery of titrant making the process faster and more accurate, while also providing a graphical display of the resulting titration curve, putting the full data processing capability of the smartphone and it’s communications at the service of the experiment. Think about what the addition of a simple clock did for thermostats: it opened the door to programmable thermostats and better controls. What would smartphones controlling simple devices like balances, pH meters, titrators, etc. facilitate?
 
==Laboratory work and scientific production==
'''Key point''': ''Laboratory work is a collection of processes and procedures that have to be carried out for research to progress, or, to support product manufacturing and production processes. In order to meet “productivity” goals, get enough work done to make progress, or provide a good ROI, we need labs to transition manual to automated (partial or full) execution as much as possible. We also need to ensure and demonstrate process stability, so that lab work itself doesn’t create variability and unreliable results.''


==Principles of laboratory systems engineering==
===Different kinds of laboratory work===
The laboratory systems engineer (LSE) is someone able to understand and be conversant in both the laboratory science and IT worlds, relating them to each other to the benefit of lab operation effectiveness while guiding IT in performing their roles. A fully dedicated LSE will understand a number of important principles:
There are different kinds of laboratory work. Some consist of a series of experiments or observations that may not be repeated, or are repeated with expected or unplanned variations. Other experiments may be repeated over and over either without variation or with controlled variations based on gained experience. It is that second set that concerns this writing, because repetition provides a basis and a reason for automation.


# Knowledge, information, and data should always be protected, available, and usable.
Today we are used to automation in lab work, enacted in the form of computers and robotics ranging from mechanical arms to auto-samplers, auto-titration systems, and more. Some of the technology development we've discussed began many decades before, and there's utility in looking back that far to note how technologies have developed. However, while there are definitely forms of automation and robotics available today, they are not always interconnected, integrated, or compatible with each other; they were produced as products either without an underlying integration framework, or with one that was limited to a few cooperating vendors. This can be problematic.
# [[Data integrity]] is paramount.
# Systems and their underlying components should be supportable, meaning they are proven to meet users' needs (validated), capable of being modified without causing conflicts with results produced by previous versions, documented, upgradable (without major disruption to lab operations), and able to survive upgrades in connected systems.
# Systems should be integrated into lab operations and not exist as isolated entities, unless there are overriding concerns.
# Systems should be portable, meaning they are able to be relocated and installed where appropriate, and not restricted to a specific combination of hardware and/or software that can’t be duplicated.
# There should be a smooth, reproducible (bi-directional, if appropriate), error-free (including error detection and correction) flow of results, from data generation to the point of use or need.


There are two major elements to repetitive laboratory methods: the underlying science and how the procedures are executed. At these methods’ core, however is the underlying scientific method, be it a chemical, biological, or physical sequence. We automate processes not things. We don’t, for example, automate an instrument, but rather the process of using it to accomplish something. If you are automating the use of a telescope to obtain the spectra of a series of stars, you build a control algorithm to look at each star in a list, have the instrument position itself properly, record the data using sensors, process it, and then move on the next star on the list. (You would also build in error detection with messages like "there isn’t any star there," "cover on telescope," and "it isn’t dark yet," along with relevant correction routines). The control algorithm is the automation, while the telescope and sensors are just tools being used.
When building a repetitive laboratory method, the method should be completely described and include aspects such as:
*the underlying science
*a complete list of equipment and materials when implemented as a manual process
*possible interferences
*critical facets of the method
*variables that have to be controlled and their operating ranges (e.g., temperature, pH of solutions, etc.)
*special characteristics of instrumentation
*safety considerations
*recommended sources of equipment and sources to be avoided
*software considerations such as possible products, desirable characteristics, and things to be avoided
*potentially dangerous, hazardous situations
*an end-to-end walkthrough of the methods used
Of course, the method has to be validated. You need to have complete confidence in its ability to produce the results necessary given the input into the process. At this point, the scientific aspects of the work are complete and finished. They may be revisited if problems arise during process automation, but no further changes should be entertained unless you are willing to absorb additional costs and alterations to the schedule. This is a serious matter; one of the leading causes of project failures is “scope creep,” which occurs when incremental changes are made to a process while it is under development. This results in the project becoming a moving target, with seemingly minor changes able to cause a major disruption in the project's design.
===Scientific production===
At this point, we have a proven procedure that we need to execute repeatedly on a set of inputs (samples). We aren’t carrying out "experiments"; that word suggests that something may be changing in the nature of the procedure, and at this point it shouldn’t. Changes to the underlying process invite a host of problems and may forfeit any chance at a successful automation effort.
However, somewhere in the (distant) future something in the process is likely to change. A new piece of technology may be introduced, equipment (including software) may need to be upgraded, or the timing of a step may need to be adjusted. That’s life. How that change is made is very important. Before it is implemented, the process has to be in place long enough to establish that it works reliably, that it produces useful results, and that there is a history of running the same reference sample(s) over and over again in the mix of samples to show that the process is under control with acceptable variability in results (i.e., statistical process control). In manufacturing and production language, this is referred to as "evolutionary operations" (EVOP). But we can pull it all together under the heading “scientific production.”{{Efn|In prior writings, the term “scientific manufacturing” was used. The problem with that term is that we’re not making products but instead producing results. Plus “manufacturing results” has some negative connotations.}}
If your reaction to the previous paragraph is “this is a science lab not a commercial production operation,” you’re getting close to the point. This is a production operation (commercial or not, it depends on the type of lab, contract testing, and other factors) going on within a science lab. It’s just a matter of scale.
===Demonstrating process stability: The standard sample program===
One of the characteristics of a successfully implemented stable process is consistency of the results with the same set of inputs. Basically, if everything is working properly, the same set of samples introduced at the beginning should yield the same results time after time, regardless of whether the implementation is manual or automated. A standard sample program (Std.SP) is a means of demonstrating the stability and operational integrity of a process; it can also tell you if a process is getting out of control. Introduced early in my career, the Std.SP was used to show the consistency of results between analysts in a lab, and to compare the performance of our lab and the quality control labs in the production facilities. The samples were submitted and marked like any similar sample and became part of the workflow. The lab managers maintained control charts of the labs and and individual's performance on the samples. The combined results would show whether the lab and those working there had things under control or if problems were developing. Those problems might be with an individual, a change in equipment performance, or a change in incoming raw materials used in the analysis.
The Std.SP was the answer to the typical “how do you know you can trust these results?” question, which often came up when a research program or production line was having issues. This becomes more important when automation is used and the sampling and testing throughput increases. If an automated system is having issues, you are just producing bad data faster. The Std.SP is a high-level test of the system's performance. A deviation of the results trend line or a widening of the variability are indications that something is wrong, which should lead to a detailed evaluation of the system to account for the deviations. A troubleshooting guide should be part of the original method description (containing aspects such as “if something starts going wrong, here’s where to look for problems” or “the test method is particularly sensitive to …”, etc.) and notes made during the implementation process.
Reference samples are another matter, however. They have to be stable over a long enough period of use to establish a meaningful trend. If the reference samples are stable over a long period of time, you may only need two or three so that the method can be evaluated over different ranges (you may have a higher variability at a lower range than a higher one). If the reference samples are not stable, then their useful lifetime needs to be established and older ones swapped out periodically. There should be some overlap between samples near the end of their useful life and the introduction of new ones so that laboratorians can differentiate between system variation and changes in reference samples. You may need to use more reference samples in these situations.
This inevitably becomes a lot of work, and if your lab utilizes many procedures, it can be time-consuming to manage an Std.SP. But before you decide it is too much work, ask yourself how valuable it is to have a solid, documented answer to the “how do you know you can trust these results?” question. Having a validated process or method is a start, but that only holds at the start of a method’s implementation. If all of this is new to you, find someone who understands process engineering and statistical process control and learn more from them.
==Where is the future of lab automation going to take us?==
'''Key point''': ''There is no single “future” for laboratories; each lab charts it own path. However there are things they can all do to be better prepared to meet the next day, month, or decade. Plan for flexibility and adaptability. Keep your data and information accessible, educate your personnel to recognize and take advantage of opportunities, and don’t be afraid of taking risks.''
Where is the future of lab automation going to take us? The simple answer is that the future is in your hands. Lab personnel are going to be making their own choices, and that is both a reflection of reality and part of the problem. Aside from a few industries, labs have approached the subject of automation individually, making their own plans and charting their own course of action. The problems that approach engenders are that it is both inefficient and it wastes resources (e.g., funding, time, effort).
Earlier we saw how the clinical chemistry industry handled the problem of containing costs through automation: it made the automation work better. Within any given industry, it’s unlikely that the same type of testing is going to differ markedly; there will be unique demands and opportunities, but they will be the exception. Why not pool resources and solve common problems? It would benefit the end-user in the lab as well as the vendor, encouraging those vendors to build products for a specific market rather than meeting the needs of a single customer. Among the biggest issues in automation and lab computing is communications between devices; the clinical chemistry industry has solved that. Their solution has elements that are specific to their work but they should be easily adjusted to meet the needs of other groups.
We’re in a phase where we have many technologies that work well independently for their intended purpose, but they don't work well together. We’ve been there before, with two examples to be found in computer networking and computer graphics.
Computer networks began to emerge commercially in the early 1970s. Each computer vendor had their own approach to networking (similar hardware but different communications protocols). Within each vendor's ecosystem things worked well, yet bridging across ecosystems was interesting. When network installations occurred at customer sites, the best hardware and software people were involved in planning, laying out the systems, installing software, and getting things to work. Installation time was measured in weeks. Today, a reasonably sophisticated home or business network can be install in an hour or two, maybe longer depending on the number of machines and components involved, and when you turn it on, you’d be surprised if it didn’t work. What made the difference? Standards were developed for communications protocols. Instead of each vendor having their own protocol, the world adopted TCP/IP and everything changed.
The path of computer graphics development provides further insight. In the 1970s and 80s, when computer graphics hardware began seeing widespread use, every vendor in the market had their own graphics display hardware and software. If you liked one vendor's graphics library, you had to buy their hardware. If you liked their hardware you had to live with their software. We saw this play out in the early days of the PC, when there were multiple graphics processor card adapters (e.g., enhanced graphics adapter [EGA], color graphics adapter [CGA], video graphics adapter [VGA], Hercules, etc.), and if you had the wrong card installed, the software wouldn’t work (although combination cards allowed the user to switch modes and reduce the number of boards used in the computer). This continued to frustrate users and developers, until a standardized graphics architecture was developed that was device-independent. Problem solved.
This is a common theme that comes up over and over. A new technology develops, vendors build products with unique features and those products don’t play well with each other (i.e., they attempt to achieve a strong market positions). Customers get frustrated and standards are developed that convert “don’t play well” to "easy integration," and the market takes off.
How the future of lab automation is going to play out for you is in your hands, but there are things we can do to make it easier. As we noted earlier, few if any of the developments in lab automation were initially done according to a plan, though we’ve also seen what planning can actually do in at least one setting, the field of clinical chemistry. This highlights the fact that not only do you need a plan for your laboratory, but also a plan for automation in your industry would be worthwhile; it would avoid spending resources individually where pooling is a better approach, and it would give you a stronger voice to work with vendors to create products that meet your industry's needs. A set of industry-wide strategies (e.g., for mining, viniculture, cosmetics, materials research, healthcare, biotech, electronics, etc.) would give you a solid starting point for the use of computing and automation technologies in your lab. There still would be plenty of room for unique applications.
Industry working groups are one need, while getting people educated is another. Determining how much you are willing to rely on automation is an additional point, which we’ll illustrate with a discussion of quality control labs, in the next section.
Before we move on to the next section, however, there are two terms that haven’t been mentioned yet that need to be noted: [[artificial intelligence]] (AI) and big data. I’m not sure I know what AI is (the definition and examples change frequently), or that I’d trust what is available today for serious work. For every article that talks about the wonders it will have, there is one that talks about the pitfalls of design, built in bias, and concerns about misapplication. In science, aside from using it as a “did you know about this?” advisory tool, I’d be concerned about using it. If you don’t know how the AI arrived at its answer (one of AI’s characteristics) why would you trust it? Big data, on the hand, is on a more solid technical footing and should prove useful, though only if your data and information is structured to take advantage of it. You can’t just say “here are all my datasets”; you have to organize them to be usable.
==Applying automation to lab work==
'''Key point''': ''Today, people’s attitudes about work have changed, and with it the world has changed; the [[COVID-19]] [[pandemic]] has forced us to re-evaluate models of work. If nothing else, even though we’re talking about science, the use of technology to conduct scientific work is going to take on a new importance. This won’t be solved one lab at a time, but on an industry-by-industry basis. We need to think about key concepts and their implementation as a community if truly effective solutions are to be found and put into practice.''
In [[LII:Considerations in the Automation of Laboratory Procedures|''Considerations in the Automation of Laboratory Procedures'']], the criteria for "automation readiness" were described. Two significant points were that a laboratory procedure must both be stable and have a duration of use sufficiently long enough to justify the cost of automation. Beyond that, additional concerns such as safety may come into play, but those two points were the most critical. From those points we can roughly derive the automation needs of research, testing, quality control, and contract laboratories.
'''Research laboratory''': Automation in research is going to be predominately at the task level, i.e., portions of a process being done by automated instruments and devices rather than end-to-end automation. This is due to the two key points noted above: research labs often change procedures, with some being very short-lived, and the cost of full-automation (including validation) may not be worth it unless the equipment is designed to be modular, interconnected, and easily programmed. One exception can be seen with Bayer.<ref name="CNNMicro15">{{cite web |url=https://www.youtube.com/watch?v=_PuTKH7143c |title=Micro robots drive Bayer's high-tech vision |author=CNN Business |work=YouTube |date=17 April 2015 |accessdate=17 November 2021}}</ref> They are using robotics and microplate technology to automate a high-throughput screen system that couldn’t—in their words—be done practically by people; it's too slow and too expensive. Could something like this become more commonplace?
'''Testing laboratory''': Non-routine testing laboratories are going to have a mix of task-level and end-to-end automation. It will depend on the stability of the test process and if there is sufficient work to justify the cost of automation, plus what level of control over the form of the incoming samples is available (see Appendix 1, section 1.3 of this guide). Often samples are submitted for multiple tests, which raised an additional concern: is the last step in a procedure destructive to the sample? Samples may be prepared so that they can be used in more than one test procedure. If the test process is destructive, then the initial preparation has to be divided so there is enough material for all testing; this can complicate the automation process.
'''Quality control laboratory''': [[Quality control]] (QC) labs should be the ideal place for automation to take hold. The procedures are well defined, stable, and entrenched. The benefits in faster throughput and cost reduction should be clear. The biggest place for problems is in sample preparation and getting material ready for analysis. Given that the form of the samples is known and predictable, it is worthwhile to spend time in analyzing and designing a means of working with samples. There are going to be some preparation steps that may be too difficult to justify automation; this can be seen with tensile testing of polymers and fibers. The sample or parts fabrication and need to wait for the material to stabilize may make parts of the process difficult to deal with, or not financially justifiable. Insertion into the test apparatus may also be an issue. Testing that can be done by spectroscopic, chromatographic, or other easily automated instrumentation may be migrated from the laboratory to in-line testing, raising questions about the nature of the QC process.
'''Contract or independent laboratory''': The ability to apply automation beyond the task level depends on the kind of work being done; is it specialized or highly flexible? The greater the flexibility (e.g., a number of different procedures with a small number of samples submitted at a time) the more task level automation will apply. However, the justification for automation may not be there. Specialized labs (e.g., clinical chemistry labs) or a lab that specializes in a technique with a high demand and throughput should benefit from a move to end-to-end automation since it reduces costs and improves accuracy and turn-around times (TATs).
===More on the quality control lab and its process management===
The evolution of QC labs may change our perspective of how QC testing is accomplished, as well as the nature of QC lab operations. Our current view, and that of the last few decades, may change significantly.
====Early QC testing====
Some form of quality testing was done during the production process in prior decades. For example, I’ve seen production station operators bite plastic materials to get an idea of how well the production process, and the expansion of polystyrene beads, was going in the production of Styrofoam cups (though this was not encouraged by management as butane and pentane, used as expansion agents, could cause health problems). If the cup was too stiff or soft, it would indicate a problem in the molding process. There were other “quick and dirty” production floor tests that were run because the TAT in the lab was producing an unacceptable delay (if you’re making adjustments, you need measurements quickly). People may also have wanted to avoid having a record of production problems and changes.
In another application (anecdotal reporting from a course on plastics manufacturing)—calendering{{Efn|Wikipedia [[wikipedia:Calender|says this]] of calendering: "A calender is a series of hard pressure rollers used to finish or smooth a sheet of material such as paper, textiles, or plastics. Calender rolls are also used to form some types of plastic films and to apply coatings."}} of plastic coated paper—the production operator would strike the laminated paper with a slender bamboo rod and listen to the sound it made to determine the product quality. According to the course instructor, he was adept at this, and no other test could replace this in-process procedure.
As such, it's clear not all "quality" testing went on in the laboratory.
====Normal QC testing====
Traditionally, QC lab operations became separate from production, partly because of the physical conditions needed to conduct the work, and partly to avoid bias in the results. The conclusion: it was important that the reporting lines of communication be separate and independent from production. QC labs would, and do, perform testing on incoming materials to certify them suitable for use and produced products to see if they met product specifications, as well as perform in-process testing. They would also certify products as qualified for shipment, acting as an unbiased evaluator of product quality.
Quality tests were manually implemented until the early 1960s. Then we saw the advent of in-process instrumentation and process chromatographs by Fisher Controls and Mine Safety Apparatus, for example. While the instruments themselves were on the production floor, their management and analysis was under the control of the QC lab, at least in the facility I worked in. The instruments' maintenance, calibration, peak measurements, and sample calculations were all done by lab personnel. Since that time, we’ve seen a continued growth in in-line testing for production processes. That said, what’s the logical conclusion of increasing automation?
====A thought experiment====
Lets posit that we have a production process whose raw materials are fluids and the end product is a fluid. We’d be concerned with certifying in-coming raw materials as suitable for use, while monitoring the production process for product composition and potential contaminants. The end product would have to be certified for shipment.
If all the testing were chromatographic with in-process instruments and a [[chromatography data system]] (CDS), and/or in-line spectrophotometric and other in-line or in-process tests, with the results becoming part of the process control system (sample IDs would be a hash of sensor type-location-timestamp), what would the role of the QC lab become? Assuring that the equipment was running properly and regularly calibrated, with periodic verification of test results (off-line testing) is one set of possibilities. Is this a desirable developmental direction? We need to look at the benefits and issues that result from this design.
'''Benefits''':
*It provides an integrated system where all process sensor and test result data are immediately available to management. This allows them to detect issues faster and provide a more timely response (better process control), reducing down time and off-spec product. If we want to enter the world of science fiction, we can even imagine a combination AI-[[machine learning]] solution providing closed loop process monitoring and control.
*It signifies a potential cost reduction resulting from smaller labs and lower personnel costs.
'''Issues''':
*There's a loss of independent product evaluation. Basically, you are trusting the system to honestly monitor, report, and reject off-spec, incoming, and outgoing material. In the movie version, this is where the sound track becomes ominous. This loss of independent checking may reduce customer confidence in product quality.
*The veracity of statistical process control and correction may suffer.
*System validation could be challenging as the production process has to be validated, each sensor and instrument data system has to be validated, and the integrated system has to be validated, including the statistical process control.
Assuming a system were built, how far are we willing to trust automated systems to function without external oversight? The control room would still be populated with people managing the process and exerting a higher level of scrutiny, but you are still trusting the designers of the system to do very sophisticated work, not only in process design but also integrating testing as part of the effort. Ideally you’d want the CDS and other instrument data processing equipment in the control room, where it is more easily used and maintained, than on the process floor. The role of the QC lab would then change to that of an overarching quality manager of the entire system, ensuring that equipment functioned properly, testing was accurate, the process and testing were operating within control limits, and the data logs were correct.
Some organizations may be past this point, while for others this may be interesting, bordering on science fiction. The point of this thought experiment is to see what could happen and where your comfort level is with it. How much control do you want to give an automated system and how much do you want to retain? What are the consequences of not providing sufficient oversight? How much bad product could be made?
Also note that this isn’t an all-or-nothing proposition; give it some room to work, see what happens, and if everything is good, give it more. Just build in a big red button that allows you to reboot and revert to manual operations; in other words, don't self-destruct, just remove some critical controls. A lot depends on the nature of the finished product. If the end product is something critical (e.g., a [[medical device]] or therapeutic), you’ll want to be cautious about phasing in automated control systems.
All that said, two additional points should be made. First, be willing to play with the ideas. Turn it into a science fiction project (“sci fi” is just a playground for "what ifs"), remove it from reality enough that people can look at it from different perspectives and see what might work and what might not. Then let people play with those ideas; you might learn something. What are all the things that could go right, and what could go wrong (and what can you do about it)? You probably won't have to worry about alien robots, but malware interference is certainly a factor, as is a network air-gap. There is always the possibility of someone causing a problem; the question of course is how do you detect it and correct it. Second, be willing to model the system. There are a number of modeling packages ideal for this purpose. You can model the behavior and see how different control methods react.
(While this thought experiment used a process involving fluids only, as they are relatively easy to work with, its worth noting that solid materials become more of an issue, complicating the automation process [see Appendix 1, section 1.4 of this guide]. In some cases sample preparation for testing may be too cumbersome for automation. This would shift the automated-manual testing balance more toward the latter in those cases, introducing delays and disrupting the timing of results to process control.)
==The creation of a "center for laboratory systems engineering"==
'''Key point''': ''Throughout this piece, the need for education has been a consistent theme. Developing and using the technologies in lab work, both scientific and informatics-related, will require people who know what they are doing, specifically educated to carry out the work noted above. We also need a means of pulling things together so that there is a centralized resource to start a learning process and continue development from there.''
Let's propose a "center for laboratory system engineering." This center would firstly prepare people to be effective planners, designers, implementers, supporters, and users of laboratory informatics and automation systems in scientific applications. Additionally, the center would ideally drive innovation and provide assistance to scientific personnel and IT groups seeking to apply and manage such laboratory technologies.
Those goals would be accomplished by:
*Developing and delivering courses for LSEs, lab personnel, and IT support (These courses would cover technical science topics as well as skills in working with people, conflict resolution, and communications. They would be presented both in-person and online or on-demand to reach a broad audience; an intensive summer course with hands-on experience should also be considered.)
*Creating an LSE certification program
*Carrying out research on the application of informatics, robotics, and computer-assisted data collection and processing
*Documenting the best practices for an LSE
*Aggregating and publishing material on the roles and requirements of the LSE
Ideally, this center would be part of a university setting so that it could interact with both science and computer science departments, contribute to their programs, and in turn gain from that association.
==Appendix 1: A very brief historical note==
It would be useful to understand how we arrived at our current state in regards to informatics and automation in science. That will make it easier to understand what we need to do to make advancements. There is one key point to take away from this: in the history of lab automation, products weren’t developed according to a grand plan{{Efn|See the discussion on clinical chemistry in the main text.}} but rather to meet perceived needs and opportunities. Thought processes in this vein have likely included:
*“Here’s a problem that needs to be solved.”
*“If I can figure out how to do X, then I can accomplish Y.”
*“There’s a business opportunity in building product concept X, which will help people do Y and Z."
Sometimes these ideas were voiced by lab personnel, but most of the time they were the result of someone seeing a problem or opportunity and taking the initiative to address it.
===1.1 Collecting data from instruments===
In the late 1960s and early 1970s, instrument companies recognized that connecting a computer to the analog output of an instrument would help lab personnel capture and process data. The form that this product development took depended on how the developer saw the problem. We’re going to look at chromatography{{Efn|If you’re not familiar with the method, Wikipedia's [[chromatography]] page is a good starting point.}} as an example for several reasons: it received the most attention for automation, it’s a data rich technique that took considerable manual effort to analyze, and it was and still is one of the most popular instrumental techniques in chemical labs. The product solutions provided by the vendor reflected the technology available and their view of the problem that needed to be solved.
The analysis (Figure A1) depends on recognizing and quantifying a series of peaks, each of which represents the amount of a component in a mixture of materials. The size of the peak (measured by area "better" or height with respect to a baseline) helps quantify the amount, and the time it takes the peak to appear can be used to help identify the component.
[[File:FigA1 Liscouski DirectLabSysOnePerPersp21.png|560px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="560px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure A1.''' Illustration of peaks from chromatograph. Source: public domain</blockquote>
|-
|}
|}
Reference standards are prepared and run along with the samples. The peak response of the standards and their corresponding concentrations are used to draw a calibration curve, and the samples are quantified by comparing peak sizes against that curve (Figure A2).
[[File:FigA2 Liscouski DirectLabSysOnePerPersp21.png|560px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="560px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure A2.''' Samples are quantified by comparison to a calibration curve</blockquote>
|-
|}
|}
The solutions reflected both the user’s input and the vendor's observations, plus being too close to the problem and not seeing the whole picture. In that same timeframe, computing was expensive and you had to have a lot of processing done to justify the costs. Otherwise you dropped down to microprocessors and scaled back the size of the problem you could tackle. The microprocessor of choice was the Intel 4004, which was superseded by the Intel 8008 in 1972.
With computing, the chromatographer could detect peaks and quantify the peak height and area, later printing the results on a strip of paper. This was a big help to the chromatographer since determining peak size, and area in particular, was a major hassle. Prior to computerized methods, chromatographers were using:
*mechanical integrators built into the strip chart recorder (that recorded the chromatograph output), which were hard to read and didn’t provide for baseline corrections (critical for accurate results);
*a planimeter, which was time-comsuming and demanded careful attention;
*a cut-and-weigh method, where the chromatographer literally cut our a copy of the peak and weighed it on a balance, cutting out a reference area, and comparing it to a calibration chart constructed from a similar procedure; and
*a method that had the chromatographer counting the boxes the peak enclosed on the chart paper, which was not very accurate, though better than peak height in some cases.
Having the quantified peaks printed on a piece of paper meant that the analyst could move quickly to drawing the calibration curve and evaluating the results. However, from the standpoint of being a laborious, time-consuming task, this is only a piece of the problem (a major one, but only a part). Some users connected the output of the integrators (RS-232 serial ASCII) to minicomputers, transferred the integrator information, and completed the analytical process in the larger machines. Several integrators could be connected to a mini-computer so the cost per instrument was reduced. This was a step toward a better solution, but only a step.
Computer vendors wanted to participate in the same market, but the cost of minicomputers put them at a disadvantage unless they could come at it from a different perspective. They looked at the entire data processing problem, including the points mentioned above, plus getting the computer to compute the calibration curve, complete the analysis, print out a report, and store the report and the data for later retrieval (integrators didn’t have that capability). They could also store the raw digitized instrument output for later analysis and display. The cost per instrument dropped when computer vendors began connecting multiple instruments to one system, some from different labs. Nelson Analytical used a local-to-the-instrument box for data collection and control and then forwarded those information packets to a central system for processing. This bigger-picture view of the problem greatly reduced the workload on the analyst. As computing costs dropped and the power of microchips increased, several different approaches emerged from different vendors that had varied perspectives on computing. Most took the big-picture view but worked on a one-instrument-one-computer-station approach, which benefited small labs since they didn’t have to invest in a minicomputer.
The low-cost of microprocessors more readily allowed digital systems to join the lab, to the point where almost every lab device had a computer chip in it (“digital” was a strong marketing point). Now that we had lots of digital data sources, what was the lab going to do with them?
===1.2 The beginning of laboratory informatics systems===
In the early 1970s, [[Vendor:PerkinElmer Inc.|PerkinElmer]] described its instrument supporting computers as “data stations.” Then they announced the “laboratory information management system” or "LIMS," and the next level of informatics hit the lab market. However, “laboratory information management system” was a problematic name for a product that did sample and test tracking. The customers thought it would take into account all of a lab's information, including personnel records, scheduling, budgets, documents, anything that “information” could be attached to, ultimately promising more than it could deliver. It took some time, but eventually something like that happened. From a marketing standpoint, it got people’s attention.
Several other vendors, consulting companies, startups, and computer vendors began LIMS development projects (computer vendors felt that database systems were their turf). This was viewed as a strategic offering: the testing lab's operations would revolve around the LIMS, and that gave the LIMS vendor whose product was chosen a strong marketing position in that company.
The introduction of the LIMS eventually opened the door to other informatics applications. The major classes and functions of said applications are show in Figure A3.
[[File:FigA3 Liscouski DirectLabSysOnePerPersp21.png|640px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="640px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure A3.''' The starting point for major classes of lab informatics systems and the lab functions they addressed</blockquote>
|-
|}
|}
Missing from the chart are user-developer tools like LabView from National Instruments that enabled users to develop data acquisition and control applications via a graphical user interface.
The term “electronic laboratory notebook” or "ELN" has had an interesting history. It’s been applied to at least three types of software before its most current iteration. Laboratory Technologies, Inc. first created LABTECH Notebook, a PC-based software package designed to assist users with the communication between a computer and its lab instruments via RS-232 connections. Then there was Wolfram's Mathematica software, an electronic notebook for advanced mathematics. And finally there was Velquest's SmartLAB, an ELN for conducting analyzes and recording information from laboratory procedures, becoming the first [[laboratory execution system]] (LES).
Figure A3 showed a nice clean, somewhat idealized, starting point for the introduction of lab informatics. That didn’t last long. Vendors saw what their competition was doing and the opportunities to expand their products' capabilities (and market acceptance). What were clean, neat, functionality silos became more complex products to attract the attention of scientists and laboratorians (Figure A4).
[[File:FigA4 Liscouski DirectLabSysOnePerPersp21.png|700px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="700px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure A4.''' Lab informatics capability expansion</blockquote>
|-
|}
|}
These expanded capabilities meant that a single vendor solution could address more of a lab’s needs, simplifying support, installation, and so on through the use of a singular software package. It did, however, make product and vendor selection more of a challenge; you really had to know what you needed. It also raised questions of cross-product compatibility: what instruments connected easily to what LIMS, ELN, or SDMS? If it wasn’t easy, what did you do? Third-party intermediate systems were eventually developed that translated instrument communications similarly in the way database systems did.
While all this was going on in the expanding digital laboratory space, people still had things to do on the lab bench.
===1.3 Electro-mechanical robotics===
During the 1970s and 80s, vendors noted the amount of time spent doing repetitive tasks at the lab bench. This resulted in two different approaches to product development:
*Dedicated-function robotics: The development of auto-samplers, auto-titrators, and similar devices that were used to carry out a specific task
*General-purpose robotics: The development of an elaborate kit the use user had to assemble to robotically complete a task; not too different from a computer programming language: the
components are there, you just have to organize them to make something useful happen
Although each of these approaches was potentially useful, they each presented the user community with different sets of problems.
The dedicated-function robotics device generally worked well; that wasn’t the issue. The problem arose when they had to connect to other components and instruments. The use of an auto-sampler, for example, required the user to adjust their sample preparation so that the material to be analyzed wound up in the appropriate sample vials. This may have required adjusting the procedure to accommodate a different set of mechanics than they were used to, e.g., sealing the vials or choosing the proper vial for an application. Auto-samplers feed samples into instruments so there is a matter of setting up control signals for proper coordination with instruments and instrument data systems.
Other devices such as auto-titrators required a reservoir of samples (in the proper vial formats), and the vials had to be organized so that an analysis on sample ID456 was in fact that sample and that the order didn’t get mixed up. The data output could also be a problem. The default for many vendors was a spreadsheet-compatible file, though it was up to the user to make sense of it and integrate it into the workflow. This was probably the best compromise available until more formalized data interchange and communications standards became available. The vendor had no idea how a particular device was going to fit into a procedure's workflow or what was going to happen to the data further downstream, so a CSV file format as a solution was simple, flexible, and easy to work with. It also meant more work on the user's part on the integration front, representing another place where changes may have to be made if a device is replaced in the future.
With the dedicated-function device, which could be as simple as a balance (the process of reading a force exerted on a sensor and converted to a weight is an automated process), we have potentially isolated elements of automation that are either interconnected by programming or someone reading data and re-entering it. However, there is no “plug-and-go” capability.
As for the general-purpose robotics device, they could be broken down into two categories: those that were successful in the broad marketplace and those that weren't. In the 1980s, three vendors were competing for attention in the laboratory robotics market: Zymark, Hewlett-Packard, and PerkinElmer. Each of their general-purpose systems had a moveable arm that could grip items and be programmed to carry out lab procedures. Yet they faced a daunting task: the process of gripping items was problematic. The robot didn’t work with just one gripper or "hand"; it required multiple interchangeable hands that had to be swapped depending on the next sequence of actions and what items had to be grasped. Those items were typically things designed for human use, including flasks, test tubes, and a variety of other equipment. This problem was non-trivial, a result of having the robot work with equipment designed for humans so that the lab didn’t have to buy duplicate components, which would have raised problems of cost, and the ability to carry on work if the robot didn’t function. Another issue with the grippers and their holders was that they took up space. The Zymark system for example had a central robotic arm that could reach items within the shell of a hemispherical volume; grippers took up space that could be used by other devices. Some people were successful in building systems, but not enough to make the products economically viable. In many respects, the core robotics technologies should have been successful, but the peripheral devices were not up to the necessary operational levels; good for humans, lacking for automation.
Another problem was education. The vendors would run courses to train people to use their systems. However, successful implementations required engineering, expertise, and experience, not just experimentation. Further robust systems, those capable of running days on end with built-in error detection and correction, were rare but necessary to avoid processing samples into junk. People had the need and the desire to make working systems, but not the process engineering skills to create successful implementations.
The life sciences market, and biotechnology in particular, took a different approach to robotics: they standardized the sample carrier format in the form of the microplate. The physical dimensions of the microplate were constant while the number of sample cells could vary. The image on the right of Figure A5, for example, shows plates with 96, 384, and 1,536 wells, with even higher densities available.
[[File:FigA5 Liscouski DirectLabSysOnePerPersp21.png|700px]]
{{clear}}
{|
| style="vertical-align:top;" |
{| border="0" cellpadding="5" cellspacing="0" width="700px"
|-
  | style="background-color:white; padding-left:10px; padding-right:10px;" |<blockquote>'''Figure A5.''' A microplate sample holder ('''left'''); examples of 96, 384, and 1,536 well holders, care of [https://commons.wikimedia.org/wiki/File:Microplates.jpg WikiMedia Commons] ('''right''')</blockquote>
|-
|}
|}
What did this mean for robotics? Every device that interacted with a microplate could be reliably designed with a single gripper size and a single aperture size for plates to be inserted. Systems would “know” where in the sample space the sample wells were. In short, standardization made it easier to build equipment for a marketplace. And a lot of equipment was built and put into successful use.{{Efn|See Wikipedia's [[wikipedia:Microplate|microplate]] article for more.}}
The combination of robotics and microplates also worked well in biotechnology because of the prevalence of liquid samples; this point can not be stressed enough. We are good at working with fluids, including measuring amounts, transferring them, dispensing them, and so on. Solids can be a problem because of cross-contamination if equipment isn’t cleaned, if the solid is tacky, if the particle size isn’t right for the equipment, if it has a tendency to roll, and so on. Solids also have the potential for problems with homogeneity. (You can get layering in liquids, but that can be solved in “two shakes.”)
====1.3.1 Two approaches to sample processing with robotics====
Broadly speaking, there are two approaches to sample processing with robotics: batching and sample-at-a-time runs. The microplate is an example of batch processing. Each tray is handled (dilution, sealing, reading, etc.) at a station, and you don’t have an opportunity to evaluate the results until the analysis for a tray is completed and reported. Other methods such as ICP, mass spectrometry, chromatography, etc. that use auto-injectors can behave in a similar fashion, or be evaluated on a sample-at-a-time basis. It depends on how the process is planned and implemented, and at what stage the results are evaluated.
The sample-at-a-time procedure offers an interesting alternative pathway for automated analysis. This process can include auto-titrators, in addition to the techniques noted, as well as others. Here each sample is processed in stages either one-at-a-time or in overlapping stages. While one sample is being processed in stage 3 (adding a solvent for example), a sample in stage 4 is being injected into an instrument. This means that the results for one sample can be known before the next one is processed. The benefit is that systematic errors can be caught and corrected before all samples are processed. This would reduce waste and improve overall efficiency.
===1.4 Sample storage organization===
Before we leave the topic of samples, we need to address the subject of sample storage organization. This is important, as poor organization and management can nullify any gains from automation. There are two aspects to consider: the organization process itself and the physical nature of the samples.
In the life sciences, [[Biobank|biobanking]] and refrigerated storage management have been actively discussed topics. For example a white paper commissioned for Titian Software Ltd. titled ''The Essential Guide to Managing Laboratory Samples'' goes into a fair amount of depth on the subject.<ref name="OxerTheEss17">{{cite web |url=https://www.titian.co.uk/the-essential-guide-to-managing-laboratory-samples-web |title=The Essential Guide to Managing Laboratory Samples |author=Oxer, M.; Stroud, T. |publisher=Titian Software Ltd |date=2017 |accessdate=17 November 2021}}</ref> And if you Google “laboratory sample storage management,” you’ll get a sizeable listing of material, including peer-reviewed work by Redrup ''et al.'' titled "Sample Management: Recommendation for Best Practices and Harmonization from the Global Bioanalysis Consortium Harmonization Team.”<ref name=":0">{{Cite journal |last=Redrup |first=Michael J. |last2=Igarashi |first2=Harue |last3=Schaefgen |first3=Jay |last4=Lin |first4=Jenny |last5=Geisler |first5=Lisa |last6=Ben M’Barek |first6=Mohamed |last7=Ramachandran |first7=Subramanian |last8=Cardoso |first8=Thales |last9=Hillewaert |first9=Vera |date=2016-03 |title=Sample Management: Recommendation for Best Practices and Harmonization from the Global Bioanalysis Consortium Harmonization Team |url=http://link.springer.com/10.1208/s12248-016-9869-2 |journal=The AAPS Journal |language=en |volume=18 |issue=2 |pages=290–293 |doi=10.1208/s12248-016-9869-2 |issn=1550-7416 |pmc=PMC4779093 |pmid=26821803}}</ref> The abstract of the work of Redrup ''et al.'' reads in part<ref name=":0" />:
<blockquote>The importance of appropriate sample management in regulated bioanalysis is undeniable for clinical and non-clinical study support due to the fact that if the samples are compromised at any stage prior to analysis, the study results may be affected. Health authority regulations do not contain specific guidance on sample management; therefore, as part of the Global Bioanalysis Consortium (GBC), the A5 team was established to discuss sample management requirements and to put forward recommendations.</blockquote>
In short, you have to have control of your samples and be able to ensure their integrity.
====1.4.1 The nature of incoming samples====
Considerations about the nature of incoming samples don’t get as much press as they deserve. For example, if you are in quality control, the nature of your incoming samples is going to be consistent and determined by the production process. In fact, sample preparation is likely to be integrated with the test procedure’s automation. If the samples are fluids, the impact on an automation process may be small compared to working with solids. One complicating factor with fluids is the need to remove extraneous material so that downstream problems aren’t created. That removal may be accomplished by filtering, settling, centrifugal separation, batch centrifugation, or other means depending on the amount of material and its composition.
Working with solids raises several other issues. First, does the material have to be reduced to a coarse or fine powder before processing? This may be needed to permit precise weighing of amounts or providing a large surface area for solvent extractions. Second, is fabrication of a sample piece required? Some mechanical properties testing of plastics require molding test bars. Other tests may require pressing films for spectral analysis. Other issues exist as well. Some are industry-specific, for example working with hazardous materials (including toxic substances and radioactive samples), and those requiring careful environmental and/or security controls.
In many labs, automation is viewed as something that happens after the samples are logged in. Yet that doesn’t have to be the case. The following paragraphs focus on testing labs because they are the most likely to benefit from automation. They meet the two key criteria for automation implementation: stable procedures and sufficient workload to justify the cost of automation development. That can also happen in research labs, though; in the end it's simply a matter of the nature of the work.
Testing labs (e.g., quality control and non-routine internal and contract labs) can take steps to streamline their sample handling operations, though unfortunately at the expense of someone else’s labor (just make it worth their effort). For example, those submitting samples can be required to pre-log their samples. This can be accomplished by giving them access to restricted accounts on a LIMS that lets them log samples in, and little more. Sample containers can also be standardized with barcodes. The barcodes can then be required as part of the logging process and are critical to identifying samples that have reached the lab, as well as tracking their physical location. Additionally, sample container sizes and related container forms can also be standardized. These should match the requirements for sample handling in automated systems, if possible. (Unless you supply them, your submitters may not have the tools for sealing sample vials, etc.) Finally, all this cooperative effort to standardize sample reception can be incentivized with price breaks, which is likely to lead to faster TAT. In other words, give them an incentive to work with you, the lab. They are supplying labor that could potentially impact their productivity, so give them a good reason to work with you.
Testing operations can, as a result, see further benefits:
* Sample storage management and access is improved.
* You’ll be more informed of incoming work.
* Automation and scheduling is enhanced when it begins with the requester instead of post-login.
My first professional lab experience was in polymers in an analytical research group (some routine work, a lot of non-routine work). Samples would arrive in a variety of containers (e.g., bags, jars, test tubes, envelopes, fabricated parts taped to sample request forms). Sample matrices would range from pellets, waxes, and powders to liquids, gas cylinders, rolls of film, and more. Classifying those samples, figuring out where to put them, locating them, and preparing them for work (which often involved grinding them in a Wiley mill) was a major time sink, sufficiently so that the actual analysis was a smaller part of the overall workflow. As such, anything you can do to streamline that process will help productivity and contribute to a successful automation project.


==Abbreviations, acronyms, and initialisms==
==Abbreviations, acronyms, and initialisms==
'''CDS''': Chromatography data system
'''ELN''': Electronic laboratory notebook
'''EVOP''': Evolutionary operations
'''ICP''': Inductively coupled plasma mass spectrometry


'''K/D/I''': Knowledge, data, and information
'''K/D/I''': Knowledge, data, and information
'''LIMS''': Laboratory information management system


'''LOF''': Laboratory of the future
'''LOF''': Laboratory of the future


'''LSE''': Laboratory systems engineer
'''LSE''': Laboratory systems engineer
'''NMR''': Nuclear magnetic resonance
'''QC''': Quality control


'''ROI''': Return on investment
'''ROI''': Return on investment
'''SDMS''': Scientific data management system
'''Std.SP''': Standard sample program
'''TLA''': Total laboratory automation
'''TAT''': Turn-around time


==Footnotes==
==Footnotes==
Line 47: Line 755:
==References==
==References==
{{Reflist|colwidth=30em}}
{{Reflist|colwidth=30em}}
<!---Place all category tags here-->
[[Category:LII:Guides, white papers, and other publications]]

Latest revision as of 01:40, 28 March 2024

Title: Directions in Laboratory Systems: One Person's Perspective

Author for citation: Joe Liscouski, with editorial modifications by Shawn Douglas

License for content: Creative Commons Attribution-ShareAlike 4.0 International

Publication date: November 2021

Introduction

The purpose of this work is to provide one person's perspective on planning for the use of computer systems in the laboratory, and with it a means of developing a direction for the future. Rather than concentrating on “science first, support systems second,” it reverses that order, recommending the construction of a solid support structure before populating the lab with systems and processes that produce knowledge, information, and data (K/I/D).

Intended audience

This material is intended for those working in laboratories of all types. The biggest benefit will come to those working in startup labs since they have a clean slate to work with, as well as those freshly entering into scientific work as it will help them understand the roles of various systems. Those working in existing labs will also benefit by seeing a different perspective than they may be used to, giving them an alternative path for evaluating their current structure and how they might adjust it to improve operations.

However, all labs in a given industry can benefit from this guide since one of its key points is the development of industry-wide guidelines to solving technology management and planning issues, improving personnel development, and more effectively addressing common projects in automation, instrument communications, and vendor relationships (resulting in lower costs and higher success rates). This would also provide a basis for evaluating new technologies (reducing risks to early adopters) and fostering product development with the necessary product requirements in a particular industry.

About the content

This material follows in the footsteps of more than 15 years of writing and presentation on the topic. That writing and presentation—compiled here—includes:

While that material covers some of the “where do we go from here” discussions, I want to bring a lot of it together in one spot so that we can see what the entire picture looks like, while still leaving some of the details to the titles above. Admittedly, there have been some changes in thinking over time from what was presented in those pieces. For example, the concept of "laboratory automation engineering" has morphed into "laboratory systems engineering," given that in the past 15 years the scope of laboratory automation and computing has broadened significantly. Additionally, references to "scientific manufacturing" are now replaced with "scientific production," since laboratories tend to produce ideas, knowledge, results, information, and data, not tangible widgets. And as the state of laboratories continues to dynamically evolve, there will likely come more changes.

Of special note is 2019's Laboratory Technology Management & Planning webinars. They provide additional useful background towards what is covered in this guide.

Looking forward and back: Where do we begin?

The "laboratory of the future" and laboratory systems engineering

The “laboratory of the future” (LOF) makes for an interesting playground of concepts. People's view of the LOF is often colored by their commercial and research interests. Does the future mean tomorrow, next month, six years, or twenty years from now? In reality, it means all of those time spans coupled with the length of a person's tenure in the lab, and the legacy they want to leave behind.

However, with those varied time spans we’ll need flexibility and adaptability for managing data and information while also preserving access and utility for the products of lab work, and that requires organization and planning. Laboratory equipment will change and storage media and data formats will evolve. The instrumentation used to collect data and information will change, and so will the computers and software applications that manage that data and information. Every resource that has been expended in executing lab work has been to develop knowledge, information, and data (K/I/D). How are you going to meet that management challenge and retain the expected return on investment (ROI)? Answering it will be one of the hallmarks of the LOF. It will require a deliberate plan that touches on every aspect of lab work: people, equipment and systems choices, and relationships with vendors and information technology support groups. Some points reside within the lab while others require coordination with corporate groups, particularly when we address long-term storage, ease of access, and security (both physical and electronic).

During discussions of the LOF, some people focus on the technology behind the instruments and techniques used in lab work, and they will continue to impress us with their sophistication. However, the bottom line of those conversations is their ability to produce results: K/I/D.

Modern laboratory work is a merger of science and information technology. Some of the information technology is built into instruments and equipment, the remainder supports those devices or helps manage operations. That technology needs to be understood, planned, and engineered into smoothly functioning systems if labs are to function at a high level of performance.

Given all that, how do we prepare for the LOF, whatever that future turns out to be? One step is the development of “laboratory systems engineering” as a means of bringing structure and discipline to the use of informatics, robotics, and automation to lab systems.

But who is the laboratory systems engineer (LSE)? The LSE is someone able to understand and be conversant in both the laboratory science and IT worlds, relating them to each other to the benefit of lab operation effectiveness while guiding IT in performing their roles. A fully dedicated LSE will understand a number of important principles:

  1. Knowledge, information, and data should always be protected, available, and usable.
  2. Data integrity is paramount.
  3. Systems and their underlying components should be supportable, meaning they are proven to meet users' needs (validated), capable of being modified without causing conflicts with results produced by previous versions, documented, upgradable (without major disruption to lab operations), and able to survive upgrades in connected systems.
  4. Systems should be integrated into lab operations and not exist as isolated entities, unless there are overriding concerns.
  5. Systems should be portable, meaning they are able to be relocated and installed where appropriate, and not restricted to a specific combination of hardware and/or software that can’t be duplicated.
  6. There should be a smooth, reproducible (bi-directional, if appropriate), error-free (including error detection and correction) flow of results, from data generation to the point of use or need.

But how did we get here?

The primary purpose of laboratory work is developing and carrying out scientific methods and experiments, which are used to answer questions. We don’t want to lose sight of that, or the skill needed to do that work. Initially the work was done manually, which inevitably limited the amount of data and information that could be produced, and in turn the rate at which new knowledge could be developed and distributed.

However, the introduction of electronic instruments changed that, and the problem shifted from data and information production to data and information utilization (including the development of new knowledge), distribution, and management. That’s where we are today.

Science plays a role in the production of data and information, as well as the development of knowledge. In between we have the tools used in data and information collection, storage, analysis, etc. That’s what we’ll be talking about in this document. The equipment and concepts we’re concerned with here are the tools used to assist in conducting that work and working with the results. They are enablers and amplifiers of lab processes. As in almost any application, the right tools used well are an asset.

Fig1 Liscouski DirectLabSysOnePerPersp21.png

Figure 1. Databases for knowledge, information, and data (K/I/D) are represented as ovals, and the processes acting on them as arrows.[1]

One problem with the distinction between science and informatics tools is that lab personnel understand the science, but they largely don't understand the intricacies of information and computing technologies that comprise the tools they use to facilitate their work. Laboratory personnel are educated in a variety of scientific disciplines, each having its own body of knowledge and practices, each requiring specialization. Shouldn't the same apply to the expertise needed to address the “tools”? On one hand we have chromatographers, spectroscopists, physical chemists, toxicologists, etc., and on the other roboticists, database experts, network specialists, and so on. In today's reality these are not IT specialists but rather LSEs who understand how to apply the IT tools to lab work with all the nuances and details needed to be successful.

Moving forward

If we are going to advance laboratory science and its practice, we need the right complement of experienced people. Moving forward to address laboratory- and organization-wide productivity needs a different perspective; rather than ask how we improve things at the bench, ask how we improve the processes and organization.

This guide is about long-term planning for lab automation and K/I/D management. The material in this guide is structured in sections, and each section starts with a summary, so you can read the summary and decide if you want more detail. However, before you toss it on the “later” pile, read the next few bits and then decide what to do.

Long-term planning is essential to organizational success. The longer you put it off, the more expensive it will be, the longer it will take to do it, and the more entrenched behaviors you'll have to overcome. Additionally, you won’t be in a position to take advantage of the developing results.

It’s time to get past the politics and the inertia and move forward. Someone has to take the lead on this, full-time or part-time, depending on the size of your organization (i.e., if there's more than one lab, know that it affects all of them). Leadership should be from the lab side, not the IT side, as IT people may not have the backgrounds needed and may view everything through their organizations priorities. (However, their support will be necessary for a successful outcome.)

The work conducted on the lab bench produces data and information. That is the start of realizing the benefits from research and testing work. The rest depends upon your ability to work with that data and information, which in turn depends on how well your data systems are organized and managed. This culminates in maximizing benefit at the least cost, i.e., ROI. It’s important to you, and it’s important to your organization.

Planning has to be done at least four levels:

  1. Industry-wide (e.g., biotech, mining, electronics, cosmetics, food and beverage, plastics, etc.)
  2. Within your organization
  3. Within your lab
  4. Within your lab processes

One important aspect of this planning process—particularly at the top, industry-wide level—is the specification of a framework to coordinate product, process (methods), or standards research and development at the lower levels. This industry-wide framework is ideally not a “this is what you must do” but rather a common structure that can be adapted to make the work easier and, as a basis for approaching vendors for products and product modifications that will benefit those in your industry, give them confidence that the requests have a broader market appeal. If an industry-wide approach isn’t feasible, then larger companies may group together to provide the needed leadership. Note, however, that this should not be perceived as an industry/company vs. vendor effort; rather, this is an industry/company working with vendors. The idea of a large group effort is to demonstrate a consensus viewpoint and that vendors' development efforts won’t be in vain.

The development of this framework, among other things, should cover:

  • Informatics
  • Communications (networking, instrument control and data, informatics control and data, etc.)
  • Physical security (including power)
  • Data integrity and security
  • Cybersecurity
  • The FAIR principles (the findability, accessibility, interoperability, and reusability of data[2])
  • The application of cloud and virtualization technologies
  • Long-term usable access to lab information in databases without vendor controls (i.e., the impact of software as a service and other software subscription models)
  • Bi-directional data interchange between archived instrument data in standardized formats and vendor software, requiring tamper-proof formats
  • Instrument design for automation
  • Sample storage management
  • Guidance for automation
  • Education for lab management and lab personnel
  • The conversion of manual methods to semi- or fully automated systems

These topics affect both a lab's science personnel and its LSEs. While some topics will be of more interest to the engineers than the scientists, both groups have a stake in the results, as do any IT groups.

As digital systems become more entrenched in scientific work, we may need to restructure our thinking from “lab bench” and “informatics” to “data and information sources” and “digital tools for working, organizing, and managing those elements." Data and information sources can extend to third-party labs and other published material. We have to move from digital systems causing incremental improvements (today’s approach), to a true revolutionary restructuring of how science is conducted.

Laboratory computing

Key point: Laboratory systems are planned, designed, and engineered. They are not simply a collection of components. Laboratory computing is a transformational technology, one which has yet to fully emerge in large part because those who work in laboratories with computing aren’t fully educated about it to take advantage of it.

Laboratory computing has always been viewed as an "add-on" to traditional laboratory work. These add-ons have the potential to improve our work, make it faster, and make it more productive (see Appendix 1 for more details).

The common point-of-view in the discussion of lab computing has been focused on the laboratory scientist or manager, with IT providing a supporting role. That isn’t the only viewpoint available to us, however. Another viewpoint is from that of the laboratory systems engineer (LSE), who focuses on data and information flow. This latter viewpoint should compel us to reconsider the role of computing in the laboratory and the higher level needs of laboratory operations.

Why is this important? Data and information generation may represent the end of the lab bench process, but it’s just the beginning of its use in broader scientific work. The ability to take advantage of those elements in the scope of manufacturing and corporate research and development (R&D) is where the real value is realized. That requires planning for storage, access, and utilization over the long term.

The problem with the traditional point-of-view (i.e., instrumentation first, with computing in a supporting role) is that the data and information landscape is built supporting the portion of lab work that is the most likely to change (Figure 2). You wind up building an information architecture to meet the requirements of diverse data structures instead of making that architecture part of the product purchase criteria. Systems are installed as needs develop, not as part of a pre-planned information architecture.

Fig2 Liscouski DirectLabSysOnePerPersp21.png

Figure 2. Hierarchy of lab systems, noting frequency of change

Designing an informatics architecture has some things in common with building a house. You create the foundation first, a supporting structure that everything sits on. Adding the framework sets up the primary living space, which can be modified as needed without disturbing the foundation (Figure 3). If you built the living space first and then wanted to install a foundation, you’d have a mess to deal with.

Fig3 Liscouski DirectLabSysOnePerPersp21.png

Figure 3. Comparison of foundation and living/working space levels in an organization

The same holds true with laboratory informatics. Set the foundation—the laboratory information management system (LIMS), electronic laboratory notebook (ELN), scientific data management system (SDMS), etc.—first, then add the data and information generation systems. That gives you a common set of requirements for making connections and can clarify some issues in product selection and systems integration. It may seem backwards if your focus is on data and information production, but as soon as you realize that you have to organize and manage those products, the benefits will be clear.

You might wonder how you go about setting up a LIMS, ELN, etc. before the instrumentation is set. However it isn’t that much of a problem. You know why your lab is there and what kind of work you plan to do. That will guide you in setting up the foundation. The details of tests can be added as need. Most of that depends on having your people educated in what the systems are and how to use them.

Our comparison between a building and information systems does bring up some additional points. A building's access to utilities runs through control points; water and electricity don’t come in from the public supply to each room but run through central control points that include a distribution system with safety and management features. We need the same thing in labs when it comes to network access. In our current society, access to private information for profit is a fact of life. While there are desirable features of lab systems available through network access (remote checking, access to libraries, updates, etc.), they should be controlled so that those with malicious intent are prevented access, and data and information are protected. Should instrument systems and office computers have access to corporate and external networks? That’s your decision and revolves around how you want your lab run, as well as other applicable corporate policies.

The connections layer in Figure 3 is where devices connect to each other and the major informatics layer. This layer includes two functions: basic networking capability and application-to-application transfer. Take for example moving pH measurements to a LIMS or ELN; this is where things can get very messy. You need to define what that is and what the standards are to ensure a well-managed system (more on that when we look at industry-wide guidelines).

To complete the analogy, people do move the living space of a house from one foundation to another, often making for an interesting experience. Similarly, it’s also possible to change the informatics foundation from one product set to another. It means exporting the contents of the database(s) to a product-independent format and then importing into the new system. If you think this is something that might be in your future, make the ability to engage in that process part of the product selection criteria. Like moving a house, it isn’t going to be fun.[3][4][5] The same holds true for ELN and SDMS.

How automation affects people's work in the lab

Key point: There are two basic ways lab personnel can approach computing: it’s a black box that they don’t understand but is used as part of their work, or they are fully aware of the equipment's capabilities and limitations and know how to use it to its fullest benefit.

While lab personnel may be fully educated in the science behind their work, the role of computing—from pH meters to multi-instrument data systems—may be viewed with a lack of understanding. That is a significant problem because they are responsible for the results that those systems produce, and they may not be aware of what happens to the signals from the instruments, where the limitations lie, and what can turn a well-executed procedure to junk because an instrument or computer setting wasn’t properly evaluated and used.

In reality, automation has both a technical impact on their work and an impact on themselves. These are outlined below.

Technical impact on work:

  • It can make routine work easier and more productive, reducing costs and improving ROI (more on that below).
  • It can allow work to be performed that might otherwise be too expensive to entertain. There are techniques such as high-throughput screening and statistical experimental design that are useful in laboratory work but might be avoided because the effort of generating the needed data is too labor-intensive and time-consuming. Automated systems can relieve that problem and produce the volumes of data those techniques require.
  • It can improve accuracy and reproducibility. Automated systems, properly designed and implemented, are inherently more reproducible than a corresponding manual system.
  • It can increase safety by limiting people's exposure to hazardous situations and materials.
  • It can also be a financial hole if proper planning and engineering aren’t properly applied to a project. “Scope creep,” changes in direction, and changes in project requirements and personnel are key reasons that projects are delayed or fail.

Impact on the personnel themselves:

  • It can increase technical specialization, potentially improving work opportunities and people's job satisfaction. Having people move into a new technology area gives them an opportunity to grow both personally and professionally.
  • Full automation of a process can cause some jobs to end, or at least change them significantly (more on that below).
  • It can elevate routine work to more significant supervisory roles.

Most of these impacts are straightforward to understand, but several require further elaboration.

It can make routine work easier and more productive, reducing costs and improving ROI

This sounds like a standard marketing pitch; is there any evidence to support it? In the 1980s, clinical chemistry labs were faced with a problem: the cost for their services was set on an annual basis without any adjustments permitted for rising costs during that period. If costs rose, income dropped; the more testing they did the worse the problem became. They addressed this problem as a community, and that was a key factor in their success. Clinical chemistry labs do testing on materials taken from people and animals and run standardized tests. This is the kind of environment that automation was created for, and they, as a community, embarked on a total laboratory automation (TLA) program. That program had a number of factors: education, standardization of equipment (the tests were standardized so the vendors knew exactly what they needed in equipment capabilities), and the development of instrument and computer communications protocols that enabled the transfer of data and information between devices (application to application).

Organizations such as the American Society for Clinical Laboratory Science (ASCLS) and the American Association for Clinical Chemistry (AACC), as well as many others, provide members with education and industry-wide support organization. Other examples include the Clinical and Laboratory Standards Institute (CLSI), a non-profit organization that develops standards for laboratory automation and informatics (e.g., AUTO03-A2 on laboratory automation[6]), and Health Level Seven, Inc., a non-profit organization that provide software standards for data interoperability.

Given that broad, industry-wide effort to address automation issues, the initial response was as follows[7]:

  • Between 1965 and 2000, the Consumer Price Index increased by a factor of 5.5 in the United States.
  • During the same 36 years, at Mount Sinai Medical Center's chemistry department, the productivity (indicated as the number of reported test results/employee/year) increased from 10,600 to 104,558 (9.3-fold).
  • When expressed in constant 1965 dollars, the total cost per test decreased from $0.79 to $0.15.

In addition, the following data (Table 1 and 2) from Dr. Michael Bissell of Ohio State University provides further insight into the resulting potential increase in labor productivity by implementing TLA in the lab[8]:

Table 1. Overall productivity of labor
 
FTE = Full-time equivalent; TLA = Total laboratory automation
Ratio Pre-TLA Post-Phase 1 Change
Test/FTE 50,813 64,039 +27%
Tests/Tech FTE 80,058 89,120 +11%
Tests/Paid hour 20.8 52.9 +24%
Tests/Worked hour 24.4 30.8 +26%
Table 2. Productivity of labor-processing area
 
FTE = Full-time equivalent; TLA = Total laboratory automation
Ratio Pre-TLA Post-Phase 1 Change
Specimens/Processing FTE 39,899 68,708 +72%
Specimens/Processing paid hour 19.1 33.0 +73%
Requests/Processing paid hour 12.7 21.5 +69%

Through TLA, improvements can be seen in:

  • Improved sample throughput
  • Cost reduction
  • Less variability in the data
  • Reduced and more predictable consumption of materials
  • Improved use of people's talents

While this data is almost 20 years old, it illustrates the impact in a change from a manual system to an automated lab environment. It also gives us an idea of what might be expected if industry- ir community-wide automation programs were developed.

Clinical laboratories are not unique in the potential to organize industry-wide standardized aspects of technology and work, and provide education. The same can be done anywhere as long as the ability to perform a particular test procedure isn’t viewed as a unique competitive advantage. The emerging Cannabis testing industry represents one such opportunity, among others.

The clinical industry has provided a template for the development of laboratory systems and automation. The list of instruments that meet the clinical communications standard continues to grow (e.g., Waters' MassLynx LIMS Interface[9]). There is nothing unique about the communications standards that prevent them from being used as a basis for development in other industries, aside from the data dictionary. As such, we need to move from an every-lab-for-itself approach to lab systems development toward a more cooperative and synergistic model.

Automation can cause some jobs to end, or at least change them significantly

One of the problems that managers and the people under them get concerned about is change. No matter how beneficial the change is for the organization, it raises people’s anxiety levels and can affect their job performance unless they are prepared for it. In that context, questions and concerns staff may have in relation to automating aspects of a job include:

  • How are these changes going to affect my job and my income? Will it cause me to lose my job? That’s about as basic as it gets, and it can impact people at any organizational level.
  • Bringing in new technologies and products means learning new things, and that opens up the possibility that people may fail or not be as effective as they are currently. It can reshuffle the pecking order.
  • The process of introducing new equipment, procedures, etc. is going to disrupt the lab's workflow. The changes may be procedural structural, or both; how are you going to deal with those issues?

Two past examples will highlight different approaches. In the first, a multi-instrument automation system was being introduced. Management told the lab personnel what was going to happen and why, and that they would be part of the final acceptance process. If they weren’t happy, the system would be modified to meet their needs. The system was installed, software written to meet their needs, instruments connected, and the system was tested. Everyone was satisfied except one technician, and that proved to be a major roadblock to putting the system into service. The matter was discussed with the lab manager, who didn’t see the problem; as soon as the system was up and running, that technician would be given a new set of responsibilities, something she was interested in. But no one told her that. As she saw it, once the system came on line she was out of a job. (One of the primary methods used in that lab's work was chromatography, with the instrument output recorded on chart paper. Most measurements were done using peak height, but peak area was used for some critical analyses. Those exacting measurements, made with a planimeter, were her responsibility and her unique—as she saw it—contribution to the lab's work. The instrument system replaced the need for this work. The other technicians and chemists had no problem adapting to the data system.) However, a short discussion between she and the lab manager alleviated the concerns.

The second example was handled a lot differently, and was concerned with the implementation of a lab’s first LIMS. The people in the lab knew something was going on but not the details. Individual meetings were held with each member of the lab team to discuss what was being considered and to learn of their impressions and concerns (these sessions were held with an outside consultant, and the results were reported to management in summary form). Once that was completed, the project started, with the lab manager holding a meeting of all lab personnel and IT, describing what was going to be done, why, how the project was going to proceed, and the role of those working in the lab in determining product requirement and product selection. The concerns raised in the individual sessions were addressed up-front, and staff all understood that no one was going to lose their job, or suffer a pay cut. Yes, some jobs would change, and where appropriate that was discussed with each individual. There was an educational course about what a LIMS was, its role in the lab's work, and how it would improve the lab’s operations. When those sessions were completed, the lab’s personnel looked forward to the project. They were part of the project and the process, rather than having it done without their participation. In short, instead of it happening to them, it happened with them as willing participants.

People's attitudes about automation systems and being willing participants in their development can make a big difference in a project's success or failure. You don’t want people to feel that the incoming system and the questions surrounding it are threatening, or seen as something that is potentially going to end their employment. They may not freely participate or they may leave when you need them the most.

All of this may seem daunting for a lab to take on by itself. Large companies may have the resources to handle it, but we need more than a large company to do this right; we need an industry-wide effort.

Development of industry-wide guidelines

Key point: The discussion above about clinical labs illustrates what can be accomplished when an industry group focuses on a technology problem. We need to extend that thinking—and action—to a broader range of industries individually. The benefits of an industry-wide approach to addressing technology and education issues include:

  • providing a wider range of inputs to solving problems;
  • providing a critical marketing mass to lobby vendors to create products that fit customers’ needs in a particular industry;
  • developing an organized educational program with emphasis on that industry's requirements;
  • giving labs (startup and existing) a well-structured reference point to guide them (not dictate) in making technology decisions;
  • reducing the cost of automation, with improved support; and
  • where competitive issues aren’t a factor, enabling industry-funded R&D technology development and implementation for production or manufacturing quality, process control (integration of online quality information), and process management.

The proposal: each industry group should define a set of guidelines to assist labs in setting up an information infrastructure. For the most part, large sections would end up similar across multiple industries, as there isn’t that much behavioral distinction between some industry sets. The real separation would come in two places: the data dictionaries (data descriptions) and the nature of the testing and automation to implement that testing. Where there is a clear competitive edge to a test or its execution, each company may choose to go it alone, but that still leaves a lot of room for cooperation in method development, addressing both the basic science and its automated implementations, particularly where ASTM, USP, etc. methods are employed.

The benefits of this proposal are noted in the key point above. However, the three most significant ones are arguably:

  • the development of an organized education program with an emphasis on industry requirements;
  • the development of a robust communications protocol for application-to-application transfers; and,
  • the ability to lobby vendors from an industry-wide basis for product development, modification, and support.

In the "Looking forward and back" section earlier in this guide, we showed a bulleted list of considerations for the development of such a guideline-based framework. What follows is a more organized version of those points, separated into three sections, which all need to be addressed in any industry-based framework. For the purposes of this guide, we'll focus primarily on Section C: "Issues that need concerted attention." Section A on background and IT support, and Section B on lab-specific background information, aren't discussed as they are addressed elsewhere, particularly in the previous works referenced in the Introduction.

A. General background and routine IT support
1. Physical security (including power)
2. Cybersecurity
3. Cloud and virtualization technologies
B. Lab-specific background information
1. Informatics (could be an industry-specific version of ASTM E1578 - 18 Standard Guide for Laboratory Informatics[10])
2. Sample storage management and organization (see Appendix 1, section 1.4 of this guide)
3. Guidance for automation
4: Data integrity and security (see S. Schmitt's Assuring Data Integrity for Life Sciences[11], which has broader application outside the life sciences)
5. The FAIR principles (the findability, accessibility, interoperability, and reusability of data; see Wilkinson et al. 2016[2])
C. Issues that need concerted attention:
1. Education for lab management and lab personnel
2. The conversion of manual methods to semi- or full-automated methods (see Considerations in the Automation of Laboratory Procedures for more)
3. Long-term usable access to lab information in databases without vendor controls (i.e., the impact of software as a service and other software subscription models)
4. Archived instrument data in standardized formats and standardized vendor software (requires tamper-proof formats)
5. Instrument design for automation (Most instruments and their support software are dual-use, i.e., they work as stand-alone devices via front panels, or through software controls. While this is a useful selling tool—whether by manual or automated use—it means the device is larger, more complex, and expensive than a automation-only device that uses software [e.g., a smartphone or computer] for everything. Instruments and devices designed for automation should be more compact and permit more efficient automation systems.)
6. Communications (networking, instrument control and data, informatics control and data, etc.)

We'll now go on to expand upon items C-1, C-3, C-4, and C-6.

C-1. Education for lab management and lab personnel

Laboratory work has become a merger of two disciplines: science and information technology (including robotics). Without the first, nothing happens; without the second, work will proceed but at a slower more costly pace. There are different levels of education requirements. For those working at the lab bench, not only do they need to understand the science, but also how the instrumentation and supporting computers systems (if any, including those embedded in the instrument) make and transform measurements into results. “The computer did it” is not a satisfactory answer to “how did you get that result,” nor is “magic," or maintaining willful ignorance of exactly how the instrument measurements are taken. Laboratorians are responsible and accountable for the results, and they should be able to explain the process of how they were derived, including how settings on the instrument and computer software affect that work. Lab managers should understand the technologies at the functional level and how the systems interact with each other. They are accountable for the overall integrity of the systems and the data and information they contain. IT personnel should understand how lab systems differ from office applications and why business-as-usual for managing office applications doesn’t work in the lab environment. The computer systems are there to support the instruments, and any changes that may affect that relationship should be initiated with care; the instrument vendor’s support for computer systems upgrades is essential.

As such, we need a new personnel position, that of the laboratory systems engineer or LSE, to provide support for the informatics architecture. This isn’t simply an IT person; it should be someone who is fluent in both the science and the information technology applied to lab work. (See Laboratory Technology Planning and Management: The Practice of Laboratory Systems Engineering for more on this topic.)

C-3. Long-term usable access to lab information in databases without vendor controls

The data and information your lab produces, with the assistance of instrumentation and instrument data systems, is yours, and no one should put limits on your ability to work with it. There is a problem with modern software design: most lab data and information can only be viewed through applications software. In some cases, the files may be used with several applications, but often it is the vendor's proprietary formats that limit access. In those instances, you have to maintain licenses for that software for as long as you need access to the data and information, even if the original application has been replaced by something else. This can happen for a variety of reasons:

  • Better technology is available from another vendor
  • A vendor sold part of its operations to another organization
  • Organizations merge
  • Completion of a consulting or research contract requires all data to be sent to the contracting organization

All of these, and others, are reasons for maintaining multiple versions of similar datasets that people need access to yet don’t want to maintain licenses for into the future, even though they still must consider meeting regulatory (FDA, EPA, ISO, etc.) requirements.

All of this revolves around your ability to gain value from your data and information without having to pay for its access. The vendors don’t want to give their software away for free either. What we need is something like the relationship between Adobe Acrobat Reader and the Adobe Acrobat application software. The latter gives you the ability to create, modify, comment, etc. documents, while the former allows you to view them. The Reader gives anyone the ability to view the contents, just not alter it. We need a “reader” application for instrument data collected and processed by an instrument data system. We need to be able to view the reports, raw data, etc. and export the data in a useful format, everything short of acquiring and processing new data. This gives you the ability to work with your intellectual property and allows it to be viewed by regulatory agencies if that becomes necessary, without incurring unnecessary costs or depriving the vendor of justifiable income.

This has become increasingly important as vendors have shifted to a subscription model for software licenses in place of one-time payments with additional charges for voluntary upgrades. One example from another realm illustrates the point. My wife keeps all of her recipes in an application on her iPad. One day she looked for a recipe and received a message that read, roughly, “No access unless you upgrade the app <not free>.” As it turned out, a Google search recommended re-installing the current app instead. It worked, but she upgraded anyhow, just to be safe. It’s your content, but who “owns” it if the software vendor can impose controls on how it is used?

As more of our work depends on software, we find ourselves in a constant upgrade loop of new hardware, new operating systems, and new applications just to maintain the status quo. We need more control over what happens to our data. Industry-wide guidelines backed by the buying power of an industry could create vendor policies that would mitigate that. Earlier in this document we noted that the future is going to extend industry change for a long time, with hardware and software evolving in ways we can’t imagine. Hardware changes (anyone remember floppy disks?) inevitably make almost everything obsolete, so how do we protect our access to data and information? Floppy disks were the go-to media 40 years ago, and since then cassette tapes, magnetic tape, Zip drives, CD-ROMs, DVDs, Syquest drives, and other types of storage media have come and gone. Networked systems, at the moment, are the only consistent and reliable means of exchange and storage as datasets increase in size.

One point we have to take into account is that versions of applications will only function on certain versions of operating systems and databases. All three elements are going to evolve, and at some point we’ll have to deal with “old” generations while new ones are coming online. One good answer to that is emulation. Some software systems like VMWare Corporation's VMware allow packages of operating systems, databases, and applications to operate on computers regardless of their age, with each collection residing in a “container”; we can have several generations of those containers residing on a computer and execute them at will, as if they were still running on the original hardware. If you are looking for the data for a particular sample, go to the container that covers its time period and access it. Emulation packages are powerful tools; using them you can even run Atari 2600 games on a Windows or OS X system.

Addressing this issue is generally bigger than what a basic laboratory-based organization can handle, involving policy making and support from information technology support groups and corporate legal and financial management. The policies have to take into account physical locations of servers, support, financing, regulatory support groups, and international laws, even if your company isn’t a multinational (third-party contracting organizations may be). Given this complexity, and the fact that most companies in a given industry will be affected, industry-wide guidelines would be useful.

C-4. Archived instrument data in standardized formats and standardized vendor software

This has been an area of interest for over 25 years, beginning with the Analytical Instrument Association's work that resulted in a set of ASTM Standard Guides:

  • ASTM E2078 - 00(2016) Standard Guide for Analytical Data Interchange Protocol for Mass Spectrometric Data[12]
  • ASTM E1948 - 98(2014) Standard Guide for Analytical Data Interchange Protocol for Chromatographic Data[13]

Millipore Sigma continues to invest in solutions based on the Analytical Information Markup Language (AnIML) standard (an outgrowth of work done at NIST).[14] There have also been a variety of standards programs, all of which have a goal of moving instrument data into a neutral data format that is free of proprietary interests, allowing it to be used and analyzed as the analyst needs (e.g., JCAMP-DX[15] and GAML[16]).

Data interchange standards can help address issues in two aspects of data analysis: qualitative and quantitative work. In qualitative applications, the exported data can imported into other packages that provide facilities not found in the original data acquisition system. Examining an infrared spectra or nuclear magnetic resonance (NMR) scan depends upon peak amplitude, shape, and positions to provide useful information, and some software (including user-developed software) may provide facilities that the original data acquisition system didn’t, or it might be a matter of having a file to send to someone for review or inclusion in another project.

Quantitative analysis is a different matter. Techniques such as chromatography, inductively coupled plasma mass spectrometry (ICP), and atomic absorption spectroscopy, among others, rely on the measurement of peak characteristics or single wavelength absorbance measurements in comparison with measurements made on standards. A single chromatogram is useless (for quantitative work) unless the standards are available. (However, it may be useful for qualitative work if retention time references to appropriate known materials are available.) If you are going to export the data for any of the techniques noted, and others as well, you need the full collection of standards and samples for the data to be of any value.

Yet there is a problem with some, if not all of these programs: they trust the integrity of the analyst to use the data honestly. It is possible for people to use these exported formats in ways that circumvent current data integrity practices and falsify results.

There are good reasons to want vendor neutral data formats so that data sets can be examined by user-developed software, to put it into a form where the analysis is not limited by a vendor's product design. It also holds the potential for separating data acquisition from data analysis as long as all pertinent data and information (e.g., standards and samples) were held together in a package that could not be altered without detection. It may be that something akin to blockchain technology could be used to register and manage access to datasets on a company-by-company basis (each company having it’s own registry that would become part of the lab's data architecture).

These standards formats are potentially very useful to labs, created by people with a passion for doing something useful and beneficial to the practice of science, sometimes at their own expense. This is another area where a coordinated industry-wide statement of requirements and support would lead to some significant advancements in systems development, and enhanced capability for those practicing instrumental science.

If these capabilities are important to you, than addressing that need has to be part of an industry-wide conversation and consensus to provide the marketing support to have the work done.

C-6. Communications

Most computers and electronic devices in the laboratory have communications capability built in: RS-232 (or similar), digital I/O, Ethernet port, Bluetooth, Wi-Fi, IEEE-488, etc. What these connection types do is provide the potential for communications, and the realization of that potential depends on message formatting, structure of the contents, and moving beyond proprietary interests to those expressed by an industry-wide community. There are two types of devices that need to be addressed: computer systems that service instruments (instrument data systems), and, devices that don’t require an external computer to function (pH meters, balances, etc.) that may have Ethernet ports, serial ASCII, digital I/O, or IEEE-488 connections. In either of these cases, the typical situation is one in which the vendor has determined a communications structure and the user has to adapt their systems to it, often using custom programming to parse messages and take action.

The user community needs to determine its needs and make them known using community-wide buying power as justification for asking for vendor development efforts. As noted earlier, this is not an adversarial situation, but rather a maturation of industry-wide communities working with vendors; the “buying power” comments simply give the vendors the confidence that its development efforts won’t be in vain.

In both cases we need application-to-application message structures that meet several needs. They should be able to handle test method identifications, so that the receiving application knows what the message is addressing, as well as whether the content is a list of samples to be processed or samples that have already been processed, with results. Additionally, the worklist message should ideally contain, in attribute-value pairs, the number of samples, with a list of sample IDs to be processed. As for the completed work message, the attribute-value pairs would also ideally contain the number of samples processed, the sample IDs, and the results of the analysis (which could be one or more elements depending on the procedure). Of course, there may be other elements required of these message structures; these were just examples. Ultimately, the final framework could be implemented in a format similar to that used in HTML files: plain-text that is easily read and machine- and operating-system-independent.

Mapping the message content to database system structure (LIMS or ELN) could be done using a simple built-in application (within the LIMS or ELN) that would graphically display the received message content on one side of a window, with the database’s available fields on the other side. The two sides would then graphically be mapped to one another, as shown in the Figure 4 below. (Note: Since the format of the messages are standardized, we wouldn’t need a separate mapping function to accommodate different vendors).


Fig4 Liscouski DirectLabSysOnePerPersp21.png

Figure 4. Mapping IDS message contents to database system

The second case—devices like pH meters, etc.—is a little more interesting since the devices don’t have the same facilities available as a computer system provides. However, in the consumer marketplace, this is well-trod ground, using both a smartphone or tablet as an interface, and translation mechanisms between small, fixed function devices and more extensive applications platforms. The only non-standard element is a single-point Bluetooth connection, as shown in Figure 5.


Fig5 Liscouski DirectLabSysOnePerPersp21.png

Figure 5. Using a smartphone or tablet as an interface to a LIMS or ELN. The procedure takes a series of pH measurements for a series of samples.

A single-point Bluetooth connection is used to exchange information between the measuring device and the smartphone or tablet. There are multi-point connection devices (the smartphone, for example) but we want to restrict the device to avoid confusion about who is in control of the measuring unit. A setup screen (not shown) would set up the worklist and populate the sample IDs. The “Take Reading” button would read the device and enter the value into the corresponding sample position—taking them in order—and enable the next reading until all samples had been read. “Send” would transmit the formatted message to the LIMS or ELN. Depending on the nature of the device and the procedure being used, the application can take any form, this is just a simple example taking pH measurement for a series of samples. In essence, the device-smartphone combination becomes an instrument data system, and the database setup would be the same as described above.

The bottom line is this: standardized message formats can greatly simplify the interfacing of instrument data systems, LIMS, ELNs, and other laboratory informatics applications. The clinical chemistry community created communications standards that would permit meaningful messages to be sent between an instrument data system and database system structured so that either end of the link would recognize the message and be able to extract and use the message content without the custom programming common to most labs today. There is no reason why the same thing can’t be done in any other industry; it may even be possible to adapt the clinical chemistry protocols. The data dictionary (list of test names and attributes) would have to be adjusted, but that is a minor point that can be handled on an industry-by-industry basis and be incorporated as part of a system installation.

What is needed is people coming together as a group and organizing and defining the effort. How important is it to you to streamline the effort in getting systems up and running without custom programming, to work toward a plug-and-play capability that we encounter in consumer systems (an environment where vendors know easy integration is a must or their products won’t sell)?

Additional note on Bluetooth-enabled instruments

The addition of Bluetooth to a device can result in much more compact systems, making the footprint smaller and reducing the cost of the unit. By using a smartphone to replace the front panel controls, the programming can become much more sophisticated. Imagine a Bluetooth-enabled pH meter and a separate Bluetooth, electronically controlled titrator. That combination would permit optimized[a] delivery of titrant making the process faster and more accurate, while also providing a graphical display of the resulting titration curve, putting the full data processing capability of the smartphone and it’s communications at the service of the experiment. Think about what the addition of a simple clock did for thermostats: it opened the door to programmable thermostats and better controls. What would smartphones controlling simple devices like balances, pH meters, titrators, etc. facilitate?

Laboratory work and scientific production

Key point: Laboratory work is a collection of processes and procedures that have to be carried out for research to progress, or, to support product manufacturing and production processes. In order to meet “productivity” goals, get enough work done to make progress, or provide a good ROI, we need labs to transition manual to automated (partial or full) execution as much as possible. We also need to ensure and demonstrate process stability, so that lab work itself doesn’t create variability and unreliable results.

Different kinds of laboratory work

There are different kinds of laboratory work. Some consist of a series of experiments or observations that may not be repeated, or are repeated with expected or unplanned variations. Other experiments may be repeated over and over either without variation or with controlled variations based on gained experience. It is that second set that concerns this writing, because repetition provides a basis and a reason for automation.

Today we are used to automation in lab work, enacted in the form of computers and robotics ranging from mechanical arms to auto-samplers, auto-titration systems, and more. Some of the technology development we've discussed began many decades before, and there's utility in looking back that far to note how technologies have developed. However, while there are definitely forms of automation and robotics available today, they are not always interconnected, integrated, or compatible with each other; they were produced as products either without an underlying integration framework, or with one that was limited to a few cooperating vendors. This can be problematic.

There are two major elements to repetitive laboratory methods: the underlying science and how the procedures are executed. At these methods’ core, however is the underlying scientific method, be it a chemical, biological, or physical sequence. We automate processes not things. We don’t, for example, automate an instrument, but rather the process of using it to accomplish something. If you are automating the use of a telescope to obtain the spectra of a series of stars, you build a control algorithm to look at each star in a list, have the instrument position itself properly, record the data using sensors, process it, and then move on the next star on the list. (You would also build in error detection with messages like "there isn’t any star there," "cover on telescope," and "it isn’t dark yet," along with relevant correction routines). The control algorithm is the automation, while the telescope and sensors are just tools being used.

When building a repetitive laboratory method, the method should be completely described and include aspects such as:

  • the underlying science
  • a complete list of equipment and materials when implemented as a manual process
  • possible interferences
  • critical facets of the method
  • variables that have to be controlled and their operating ranges (e.g., temperature, pH of solutions, etc.)
  • special characteristics of instrumentation
  • safety considerations
  • recommended sources of equipment and sources to be avoided
  • software considerations such as possible products, desirable characteristics, and things to be avoided
  • potentially dangerous, hazardous situations
  • an end-to-end walkthrough of the methods used

Of course, the method has to be validated. You need to have complete confidence in its ability to produce the results necessary given the input into the process. At this point, the scientific aspects of the work are complete and finished. They may be revisited if problems arise during process automation, but no further changes should be entertained unless you are willing to absorb additional costs and alterations to the schedule. This is a serious matter; one of the leading causes of project failures is “scope creep,” which occurs when incremental changes are made to a process while it is under development. This results in the project becoming a moving target, with seemingly minor changes able to cause a major disruption in the project's design.

Scientific production

At this point, we have a proven procedure that we need to execute repeatedly on a set of inputs (samples). We aren’t carrying out "experiments"; that word suggests that something may be changing in the nature of the procedure, and at this point it shouldn’t. Changes to the underlying process invite a host of problems and may forfeit any chance at a successful automation effort.

However, somewhere in the (distant) future something in the process is likely to change. A new piece of technology may be introduced, equipment (including software) may need to be upgraded, or the timing of a step may need to be adjusted. That’s life. How that change is made is very important. Before it is implemented, the process has to be in place long enough to establish that it works reliably, that it produces useful results, and that there is a history of running the same reference sample(s) over and over again in the mix of samples to show that the process is under control with acceptable variability in results (i.e., statistical process control). In manufacturing and production language, this is referred to as "evolutionary operations" (EVOP). But we can pull it all together under the heading “scientific production.”[b]

If your reaction to the previous paragraph is “this is a science lab not a commercial production operation,” you’re getting close to the point. This is a production operation (commercial or not, it depends on the type of lab, contract testing, and other factors) going on within a science lab. It’s just a matter of scale.

Demonstrating process stability: The standard sample program

One of the characteristics of a successfully implemented stable process is consistency of the results with the same set of inputs. Basically, if everything is working properly, the same set of samples introduced at the beginning should yield the same results time after time, regardless of whether the implementation is manual or automated. A standard sample program (Std.SP) is a means of demonstrating the stability and operational integrity of a process; it can also tell you if a process is getting out of control. Introduced early in my career, the Std.SP was used to show the consistency of results between analysts in a lab, and to compare the performance of our lab and the quality control labs in the production facilities. The samples were submitted and marked like any similar sample and became part of the workflow. The lab managers maintained control charts of the labs and and individual's performance on the samples. The combined results would show whether the lab and those working there had things under control or if problems were developing. Those problems might be with an individual, a change in equipment performance, or a change in incoming raw materials used in the analysis.

The Std.SP was the answer to the typical “how do you know you can trust these results?” question, which often came up when a research program or production line was having issues. This becomes more important when automation is used and the sampling and testing throughput increases. If an automated system is having issues, you are just producing bad data faster. The Std.SP is a high-level test of the system's performance. A deviation of the results trend line or a widening of the variability are indications that something is wrong, which should lead to a detailed evaluation of the system to account for the deviations. A troubleshooting guide should be part of the original method description (containing aspects such as “if something starts going wrong, here’s where to look for problems” or “the test method is particularly sensitive to …”, etc.) and notes made during the implementation process.

Reference samples are another matter, however. They have to be stable over a long enough period of use to establish a meaningful trend. If the reference samples are stable over a long period of time, you may only need two or three so that the method can be evaluated over different ranges (you may have a higher variability at a lower range than a higher one). If the reference samples are not stable, then their useful lifetime needs to be established and older ones swapped out periodically. There should be some overlap between samples near the end of their useful life and the introduction of new ones so that laboratorians can differentiate between system variation and changes in reference samples. You may need to use more reference samples in these situations.

This inevitably becomes a lot of work, and if your lab utilizes many procedures, it can be time-consuming to manage an Std.SP. But before you decide it is too much work, ask yourself how valuable it is to have a solid, documented answer to the “how do you know you can trust these results?” question. Having a validated process or method is a start, but that only holds at the start of a method’s implementation. If all of this is new to you, find someone who understands process engineering and statistical process control and learn more from them.

Where is the future of lab automation going to take us?

Key point: There is no single “future” for laboratories; each lab charts it own path. However there are things they can all do to be better prepared to meet the next day, month, or decade. Plan for flexibility and adaptability. Keep your data and information accessible, educate your personnel to recognize and take advantage of opportunities, and don’t be afraid of taking risks.

Where is the future of lab automation going to take us? The simple answer is that the future is in your hands. Lab personnel are going to be making their own choices, and that is both a reflection of reality and part of the problem. Aside from a few industries, labs have approached the subject of automation individually, making their own plans and charting their own course of action. The problems that approach engenders are that it is both inefficient and it wastes resources (e.g., funding, time, effort).

Earlier we saw how the clinical chemistry industry handled the problem of containing costs through automation: it made the automation work better. Within any given industry, it’s unlikely that the same type of testing is going to differ markedly; there will be unique demands and opportunities, but they will be the exception. Why not pool resources and solve common problems? It would benefit the end-user in the lab as well as the vendor, encouraging those vendors to build products for a specific market rather than meeting the needs of a single customer. Among the biggest issues in automation and lab computing is communications between devices; the clinical chemistry industry has solved that. Their solution has elements that are specific to their work but they should be easily adjusted to meet the needs of other groups.

We’re in a phase where we have many technologies that work well independently for their intended purpose, but they don't work well together. We’ve been there before, with two examples to be found in computer networking and computer graphics.

Computer networks began to emerge commercially in the early 1970s. Each computer vendor had their own approach to networking (similar hardware but different communications protocols). Within each vendor's ecosystem things worked well, yet bridging across ecosystems was interesting. When network installations occurred at customer sites, the best hardware and software people were involved in planning, laying out the systems, installing software, and getting things to work. Installation time was measured in weeks. Today, a reasonably sophisticated home or business network can be install in an hour or two, maybe longer depending on the number of machines and components involved, and when you turn it on, you’d be surprised if it didn’t work. What made the difference? Standards were developed for communications protocols. Instead of each vendor having their own protocol, the world adopted TCP/IP and everything changed.

The path of computer graphics development provides further insight. In the 1970s and 80s, when computer graphics hardware began seeing widespread use, every vendor in the market had their own graphics display hardware and software. If you liked one vendor's graphics library, you had to buy their hardware. If you liked their hardware you had to live with their software. We saw this play out in the early days of the PC, when there were multiple graphics processor card adapters (e.g., enhanced graphics adapter [EGA], color graphics adapter [CGA], video graphics adapter [VGA], Hercules, etc.), and if you had the wrong card installed, the software wouldn’t work (although combination cards allowed the user to switch modes and reduce the number of boards used in the computer). This continued to frustrate users and developers, until a standardized graphics architecture was developed that was device-independent. Problem solved.

This is a common theme that comes up over and over. A new technology develops, vendors build products with unique features and those products don’t play well with each other (i.e., they attempt to achieve a strong market positions). Customers get frustrated and standards are developed that convert “don’t play well” to "easy integration," and the market takes off.

How the future of lab automation is going to play out for you is in your hands, but there are things we can do to make it easier. As we noted earlier, few if any of the developments in lab automation were initially done according to a plan, though we’ve also seen what planning can actually do in at least one setting, the field of clinical chemistry. This highlights the fact that not only do you need a plan for your laboratory, but also a plan for automation in your industry would be worthwhile; it would avoid spending resources individually where pooling is a better approach, and it would give you a stronger voice to work with vendors to create products that meet your industry's needs. A set of industry-wide strategies (e.g., for mining, viniculture, cosmetics, materials research, healthcare, biotech, electronics, etc.) would give you a solid starting point for the use of computing and automation technologies in your lab. There still would be plenty of room for unique applications.

Industry working groups are one need, while getting people educated is another. Determining how much you are willing to rely on automation is an additional point, which we’ll illustrate with a discussion of quality control labs, in the next section.

Before we move on to the next section, however, there are two terms that haven’t been mentioned yet that need to be noted: artificial intelligence (AI) and big data. I’m not sure I know what AI is (the definition and examples change frequently), or that I’d trust what is available today for serious work. For every article that talks about the wonders it will have, there is one that talks about the pitfalls of design, built in bias, and concerns about misapplication. In science, aside from using it as a “did you know about this?” advisory tool, I’d be concerned about using it. If you don’t know how the AI arrived at its answer (one of AI’s characteristics) why would you trust it? Big data, on the hand, is on a more solid technical footing and should prove useful, though only if your data and information is structured to take advantage of it. You can’t just say “here are all my datasets”; you have to organize them to be usable.

Applying automation to lab work

Key point: Today, people’s attitudes about work have changed, and with it the world has changed; the COVID-19 pandemic has forced us to re-evaluate models of work. If nothing else, even though we’re talking about science, the use of technology to conduct scientific work is going to take on a new importance. This won’t be solved one lab at a time, but on an industry-by-industry basis. We need to think about key concepts and their implementation as a community if truly effective solutions are to be found and put into practice.

In Considerations in the Automation of Laboratory Procedures, the criteria for "automation readiness" were described. Two significant points were that a laboratory procedure must both be stable and have a duration of use sufficiently long enough to justify the cost of automation. Beyond that, additional concerns such as safety may come into play, but those two points were the most critical. From those points we can roughly derive the automation needs of research, testing, quality control, and contract laboratories.

Research laboratory: Automation in research is going to be predominately at the task level, i.e., portions of a process being done by automated instruments and devices rather than end-to-end automation. This is due to the two key points noted above: research labs often change procedures, with some being very short-lived, and the cost of full-automation (including validation) may not be worth it unless the equipment is designed to be modular, interconnected, and easily programmed. One exception can be seen with Bayer.[17] They are using robotics and microplate technology to automate a high-throughput screen system that couldn’t—in their words—be done practically by people; it's too slow and too expensive. Could something like this become more commonplace?

Testing laboratory: Non-routine testing laboratories are going to have a mix of task-level and end-to-end automation. It will depend on the stability of the test process and if there is sufficient work to justify the cost of automation, plus what level of control over the form of the incoming samples is available (see Appendix 1, section 1.3 of this guide). Often samples are submitted for multiple tests, which raised an additional concern: is the last step in a procedure destructive to the sample? Samples may be prepared so that they can be used in more than one test procedure. If the test process is destructive, then the initial preparation has to be divided so there is enough material for all testing; this can complicate the automation process.

Quality control laboratory: Quality control (QC) labs should be the ideal place for automation to take hold. The procedures are well defined, stable, and entrenched. The benefits in faster throughput and cost reduction should be clear. The biggest place for problems is in sample preparation and getting material ready for analysis. Given that the form of the samples is known and predictable, it is worthwhile to spend time in analyzing and designing a means of working with samples. There are going to be some preparation steps that may be too difficult to justify automation; this can be seen with tensile testing of polymers and fibers. The sample or parts fabrication and need to wait for the material to stabilize may make parts of the process difficult to deal with, or not financially justifiable. Insertion into the test apparatus may also be an issue. Testing that can be done by spectroscopic, chromatographic, or other easily automated instrumentation may be migrated from the laboratory to in-line testing, raising questions about the nature of the QC process.

Contract or independent laboratory: The ability to apply automation beyond the task level depends on the kind of work being done; is it specialized or highly flexible? The greater the flexibility (e.g., a number of different procedures with a small number of samples submitted at a time) the more task level automation will apply. However, the justification for automation may not be there. Specialized labs (e.g., clinical chemistry labs) or a lab that specializes in a technique with a high demand and throughput should benefit from a move to end-to-end automation since it reduces costs and improves accuracy and turn-around times (TATs).

More on the quality control lab and its process management

The evolution of QC labs may change our perspective of how QC testing is accomplished, as well as the nature of QC lab operations. Our current view, and that of the last few decades, may change significantly.

Early QC testing

Some form of quality testing was done during the production process in prior decades. For example, I’ve seen production station operators bite plastic materials to get an idea of how well the production process, and the expansion of polystyrene beads, was going in the production of Styrofoam cups (though this was not encouraged by management as butane and pentane, used as expansion agents, could cause health problems). If the cup was too stiff or soft, it would indicate a problem in the molding process. There were other “quick and dirty” production floor tests that were run because the TAT in the lab was producing an unacceptable delay (if you’re making adjustments, you need measurements quickly). People may also have wanted to avoid having a record of production problems and changes.

In another application (anecdotal reporting from a course on plastics manufacturing)—calendering[c] of plastic coated paper—the production operator would strike the laminated paper with a slender bamboo rod and listen to the sound it made to determine the product quality. According to the course instructor, he was adept at this, and no other test could replace this in-process procedure.

As such, it's clear not all "quality" testing went on in the laboratory.

Normal QC testing

Traditionally, QC lab operations became separate from production, partly because of the physical conditions needed to conduct the work, and partly to avoid bias in the results. The conclusion: it was important that the reporting lines of communication be separate and independent from production. QC labs would, and do, perform testing on incoming materials to certify them suitable for use and produced products to see if they met product specifications, as well as perform in-process testing. They would also certify products as qualified for shipment, acting as an unbiased evaluator of product quality.

Quality tests were manually implemented until the early 1960s. Then we saw the advent of in-process instrumentation and process chromatographs by Fisher Controls and Mine Safety Apparatus, for example. While the instruments themselves were on the production floor, their management and analysis was under the control of the QC lab, at least in the facility I worked in. The instruments' maintenance, calibration, peak measurements, and sample calculations were all done by lab personnel. Since that time, we’ve seen a continued growth in in-line testing for production processes. That said, what’s the logical conclusion of increasing automation?

A thought experiment

Lets posit that we have a production process whose raw materials are fluids and the end product is a fluid. We’d be concerned with certifying in-coming raw materials as suitable for use, while monitoring the production process for product composition and potential contaminants. The end product would have to be certified for shipment.

If all the testing were chromatographic with in-process instruments and a chromatography data system (CDS), and/or in-line spectrophotometric and other in-line or in-process tests, with the results becoming part of the process control system (sample IDs would be a hash of sensor type-location-timestamp), what would the role of the QC lab become? Assuring that the equipment was running properly and regularly calibrated, with periodic verification of test results (off-line testing) is one set of possibilities. Is this a desirable developmental direction? We need to look at the benefits and issues that result from this design.

Benefits:

  • It provides an integrated system where all process sensor and test result data are immediately available to management. This allows them to detect issues faster and provide a more timely response (better process control), reducing down time and off-spec product. If we want to enter the world of science fiction, we can even imagine a combination AI-machine learning solution providing closed loop process monitoring and control.
  • It signifies a potential cost reduction resulting from smaller labs and lower personnel costs.

Issues:

  • There's a loss of independent product evaluation. Basically, you are trusting the system to honestly monitor, report, and reject off-spec, incoming, and outgoing material. In the movie version, this is where the sound track becomes ominous. This loss of independent checking may reduce customer confidence in product quality.
  • The veracity of statistical process control and correction may suffer.
  • System validation could be challenging as the production process has to be validated, each sensor and instrument data system has to be validated, and the integrated system has to be validated, including the statistical process control.

Assuming a system were built, how far are we willing to trust automated systems to function without external oversight? The control room would still be populated with people managing the process and exerting a higher level of scrutiny, but you are still trusting the designers of the system to do very sophisticated work, not only in process design but also integrating testing as part of the effort. Ideally you’d want the CDS and other instrument data processing equipment in the control room, where it is more easily used and maintained, than on the process floor. The role of the QC lab would then change to that of an overarching quality manager of the entire system, ensuring that equipment functioned properly, testing was accurate, the process and testing were operating within control limits, and the data logs were correct.

Some organizations may be past this point, while for others this may be interesting, bordering on science fiction. The point of this thought experiment is to see what could happen and where your comfort level is with it. How much control do you want to give an automated system and how much do you want to retain? What are the consequences of not providing sufficient oversight? How much bad product could be made?

Also note that this isn’t an all-or-nothing proposition; give it some room to work, see what happens, and if everything is good, give it more. Just build in a big red button that allows you to reboot and revert to manual operations; in other words, don't self-destruct, just remove some critical controls. A lot depends on the nature of the finished product. If the end product is something critical (e.g., a medical device or therapeutic), you’ll want to be cautious about phasing in automated control systems.

All that said, two additional points should be made. First, be willing to play with the ideas. Turn it into a science fiction project (“sci fi” is just a playground for "what ifs"), remove it from reality enough that people can look at it from different perspectives and see what might work and what might not. Then let people play with those ideas; you might learn something. What are all the things that could go right, and what could go wrong (and what can you do about it)? You probably won't have to worry about alien robots, but malware interference is certainly a factor, as is a network air-gap. There is always the possibility of someone causing a problem; the question of course is how do you detect it and correct it. Second, be willing to model the system. There are a number of modeling packages ideal for this purpose. You can model the behavior and see how different control methods react.

(While this thought experiment used a process involving fluids only, as they are relatively easy to work with, its worth noting that solid materials become more of an issue, complicating the automation process [see Appendix 1, section 1.4 of this guide]. In some cases sample preparation for testing may be too cumbersome for automation. This would shift the automated-manual testing balance more toward the latter in those cases, introducing delays and disrupting the timing of results to process control.)

The creation of a "center for laboratory systems engineering"

Key point: Throughout this piece, the need for education has been a consistent theme. Developing and using the technologies in lab work, both scientific and informatics-related, will require people who know what they are doing, specifically educated to carry out the work noted above. We also need a means of pulling things together so that there is a centralized resource to start a learning process and continue development from there.

Let's propose a "center for laboratory system engineering." This center would firstly prepare people to be effective planners, designers, implementers, supporters, and users of laboratory informatics and automation systems in scientific applications. Additionally, the center would ideally drive innovation and provide assistance to scientific personnel and IT groups seeking to apply and manage such laboratory technologies.

Those goals would be accomplished by:

  • Developing and delivering courses for LSEs, lab personnel, and IT support (These courses would cover technical science topics as well as skills in working with people, conflict resolution, and communications. They would be presented both in-person and online or on-demand to reach a broad audience; an intensive summer course with hands-on experience should also be considered.)
  • Creating an LSE certification program
  • Carrying out research on the application of informatics, robotics, and computer-assisted data collection and processing
  • Documenting the best practices for an LSE
  • Aggregating and publishing material on the roles and requirements of the LSE

Ideally, this center would be part of a university setting so that it could interact with both science and computer science departments, contribute to their programs, and in turn gain from that association.

Appendix 1: A very brief historical note

It would be useful to understand how we arrived at our current state in regards to informatics and automation in science. That will make it easier to understand what we need to do to make advancements. There is one key point to take away from this: in the history of lab automation, products weren’t developed according to a grand plan[d] but rather to meet perceived needs and opportunities. Thought processes in this vein have likely included:

  • “Here’s a problem that needs to be solved.”
  • “If I can figure out how to do X, then I can accomplish Y.”
  • “There’s a business opportunity in building product concept X, which will help people do Y and Z."

Sometimes these ideas were voiced by lab personnel, but most of the time they were the result of someone seeing a problem or opportunity and taking the initiative to address it.

1.1 Collecting data from instruments

In the late 1960s and early 1970s, instrument companies recognized that connecting a computer to the analog output of an instrument would help lab personnel capture and process data. The form that this product development took depended on how the developer saw the problem. We’re going to look at chromatography[e] as an example for several reasons: it received the most attention for automation, it’s a data rich technique that took considerable manual effort to analyze, and it was and still is one of the most popular instrumental techniques in chemical labs. The product solutions provided by the vendor reflected the technology available and their view of the problem that needed to be solved.

The analysis (Figure A1) depends on recognizing and quantifying a series of peaks, each of which represents the amount of a component in a mixture of materials. The size of the peak (measured by area "better" or height with respect to a baseline) helps quantify the amount, and the time it takes the peak to appear can be used to help identify the component.


FigA1 Liscouski DirectLabSysOnePerPersp21.png

Figure A1. Illustration of peaks from chromatograph. Source: public domain

Reference standards are prepared and run along with the samples. The peak response of the standards and their corresponding concentrations are used to draw a calibration curve, and the samples are quantified by comparing peak sizes against that curve (Figure A2).


FigA2 Liscouski DirectLabSysOnePerPersp21.png

Figure A2. Samples are quantified by comparison to a calibration curve

The solutions reflected both the user’s input and the vendor's observations, plus being too close to the problem and not seeing the whole picture. In that same timeframe, computing was expensive and you had to have a lot of processing done to justify the costs. Otherwise you dropped down to microprocessors and scaled back the size of the problem you could tackle. The microprocessor of choice was the Intel 4004, which was superseded by the Intel 8008 in 1972.

With computing, the chromatographer could detect peaks and quantify the peak height and area, later printing the results on a strip of paper. This was a big help to the chromatographer since determining peak size, and area in particular, was a major hassle. Prior to computerized methods, chromatographers were using:

  • mechanical integrators built into the strip chart recorder (that recorded the chromatograph output), which were hard to read and didn’t provide for baseline corrections (critical for accurate results);
  • a planimeter, which was time-comsuming and demanded careful attention;
  • a cut-and-weigh method, where the chromatographer literally cut our a copy of the peak and weighed it on a balance, cutting out a reference area, and comparing it to a calibration chart constructed from a similar procedure; and
  • a method that had the chromatographer counting the boxes the peak enclosed on the chart paper, which was not very accurate, though better than peak height in some cases.

Having the quantified peaks printed on a piece of paper meant that the analyst could move quickly to drawing the calibration curve and evaluating the results. However, from the standpoint of being a laborious, time-consuming task, this is only a piece of the problem (a major one, but only a part). Some users connected the output of the integrators (RS-232 serial ASCII) to minicomputers, transferred the integrator information, and completed the analytical process in the larger machines. Several integrators could be connected to a mini-computer so the cost per instrument was reduced. This was a step toward a better solution, but only a step.

Computer vendors wanted to participate in the same market, but the cost of minicomputers put them at a disadvantage unless they could come at it from a different perspective. They looked at the entire data processing problem, including the points mentioned above, plus getting the computer to compute the calibration curve, complete the analysis, print out a report, and store the report and the data for later retrieval (integrators didn’t have that capability). They could also store the raw digitized instrument output for later analysis and display. The cost per instrument dropped when computer vendors began connecting multiple instruments to one system, some from different labs. Nelson Analytical used a local-to-the-instrument box for data collection and control and then forwarded those information packets to a central system for processing. This bigger-picture view of the problem greatly reduced the workload on the analyst. As computing costs dropped and the power of microchips increased, several different approaches emerged from different vendors that had varied perspectives on computing. Most took the big-picture view but worked on a one-instrument-one-computer-station approach, which benefited small labs since they didn’t have to invest in a minicomputer.

The low-cost of microprocessors more readily allowed digital systems to join the lab, to the point where almost every lab device had a computer chip in it (“digital” was a strong marketing point). Now that we had lots of digital data sources, what was the lab going to do with them?

1.2 The beginning of laboratory informatics systems

In the early 1970s, PerkinElmer described its instrument supporting computers as “data stations.” Then they announced the “laboratory information management system” or "LIMS," and the next level of informatics hit the lab market. However, “laboratory information management system” was a problematic name for a product that did sample and test tracking. The customers thought it would take into account all of a lab's information, including personnel records, scheduling, budgets, documents, anything that “information” could be attached to, ultimately promising more than it could deliver. It took some time, but eventually something like that happened. From a marketing standpoint, it got people’s attention.

Several other vendors, consulting companies, startups, and computer vendors began LIMS development projects (computer vendors felt that database systems were their turf). This was viewed as a strategic offering: the testing lab's operations would revolve around the LIMS, and that gave the LIMS vendor whose product was chosen a strong marketing position in that company.

The introduction of the LIMS eventually opened the door to other informatics applications. The major classes and functions of said applications are show in Figure A3.


FigA3 Liscouski DirectLabSysOnePerPersp21.png

Figure A3. The starting point for major classes of lab informatics systems and the lab functions they addressed

Missing from the chart are user-developer tools like LabView from National Instruments that enabled users to develop data acquisition and control applications via a graphical user interface.

The term “electronic laboratory notebook” or "ELN" has had an interesting history. It’s been applied to at least three types of software before its most current iteration. Laboratory Technologies, Inc. first created LABTECH Notebook, a PC-based software package designed to assist users with the communication between a computer and its lab instruments via RS-232 connections. Then there was Wolfram's Mathematica software, an electronic notebook for advanced mathematics. And finally there was Velquest's SmartLAB, an ELN for conducting analyzes and recording information from laboratory procedures, becoming the first laboratory execution system (LES).

Figure A3 showed a nice clean, somewhat idealized, starting point for the introduction of lab informatics. That didn’t last long. Vendors saw what their competition was doing and the opportunities to expand their products' capabilities (and market acceptance). What were clean, neat, functionality silos became more complex products to attract the attention of scientists and laboratorians (Figure A4).


FigA4 Liscouski DirectLabSysOnePerPersp21.png

Figure A4. Lab informatics capability expansion

These expanded capabilities meant that a single vendor solution could address more of a lab’s needs, simplifying support, installation, and so on through the use of a singular software package. It did, however, make product and vendor selection more of a challenge; you really had to know what you needed. It also raised questions of cross-product compatibility: what instruments connected easily to what LIMS, ELN, or SDMS? If it wasn’t easy, what did you do? Third-party intermediate systems were eventually developed that translated instrument communications similarly in the way database systems did.

While all this was going on in the expanding digital laboratory space, people still had things to do on the lab bench.

1.3 Electro-mechanical robotics

During the 1970s and 80s, vendors noted the amount of time spent doing repetitive tasks at the lab bench. This resulted in two different approaches to product development:

  • Dedicated-function robotics: The development of auto-samplers, auto-titrators, and similar devices that were used to carry out a specific task
  • General-purpose robotics: The development of an elaborate kit the use user had to assemble to robotically complete a task; not too different from a computer programming language: the

components are there, you just have to organize them to make something useful happen

Although each of these approaches was potentially useful, they each presented the user community with different sets of problems.

The dedicated-function robotics device generally worked well; that wasn’t the issue. The problem arose when they had to connect to other components and instruments. The use of an auto-sampler, for example, required the user to adjust their sample preparation so that the material to be analyzed wound up in the appropriate sample vials. This may have required adjusting the procedure to accommodate a different set of mechanics than they were used to, e.g., sealing the vials or choosing the proper vial for an application. Auto-samplers feed samples into instruments so there is a matter of setting up control signals for proper coordination with instruments and instrument data systems.

Other devices such as auto-titrators required a reservoir of samples (in the proper vial formats), and the vials had to be organized so that an analysis on sample ID456 was in fact that sample and that the order didn’t get mixed up. The data output could also be a problem. The default for many vendors was a spreadsheet-compatible file, though it was up to the user to make sense of it and integrate it into the workflow. This was probably the best compromise available until more formalized data interchange and communications standards became available. The vendor had no idea how a particular device was going to fit into a procedure's workflow or what was going to happen to the data further downstream, so a CSV file format as a solution was simple, flexible, and easy to work with. It also meant more work on the user's part on the integration front, representing another place where changes may have to be made if a device is replaced in the future.

With the dedicated-function device, which could be as simple as a balance (the process of reading a force exerted on a sensor and converted to a weight is an automated process), we have potentially isolated elements of automation that are either interconnected by programming or someone reading data and re-entering it. However, there is no “plug-and-go” capability.

As for the general-purpose robotics device, they could be broken down into two categories: those that were successful in the broad marketplace and those that weren't. In the 1980s, three vendors were competing for attention in the laboratory robotics market: Zymark, Hewlett-Packard, and PerkinElmer. Each of their general-purpose systems had a moveable arm that could grip items and be programmed to carry out lab procedures. Yet they faced a daunting task: the process of gripping items was problematic. The robot didn’t work with just one gripper or "hand"; it required multiple interchangeable hands that had to be swapped depending on the next sequence of actions and what items had to be grasped. Those items were typically things designed for human use, including flasks, test tubes, and a variety of other equipment. This problem was non-trivial, a result of having the robot work with equipment designed for humans so that the lab didn’t have to buy duplicate components, which would have raised problems of cost, and the ability to carry on work if the robot didn’t function. Another issue with the grippers and their holders was that they took up space. The Zymark system for example had a central robotic arm that could reach items within the shell of a hemispherical volume; grippers took up space that could be used by other devices. Some people were successful in building systems, but not enough to make the products economically viable. In many respects, the core robotics technologies should have been successful, but the peripheral devices were not up to the necessary operational levels; good for humans, lacking for automation.

Another problem was education. The vendors would run courses to train people to use their systems. However, successful implementations required engineering, expertise, and experience, not just experimentation. Further robust systems, those capable of running days on end with built-in error detection and correction, were rare but necessary to avoid processing samples into junk. People had the need and the desire to make working systems, but not the process engineering skills to create successful implementations.

The life sciences market, and biotechnology in particular, took a different approach to robotics: they standardized the sample carrier format in the form of the microplate. The physical dimensions of the microplate were constant while the number of sample cells could vary. The image on the right of Figure A5, for example, shows plates with 96, 384, and 1,536 wells, with even higher densities available.


FigA5 Liscouski DirectLabSysOnePerPersp21.png

Figure A5. A microplate sample holder (left); examples of 96, 384, and 1,536 well holders, care of WikiMedia Commons (right)

What did this mean for robotics? Every device that interacted with a microplate could be reliably designed with a single gripper size and a single aperture size for plates to be inserted. Systems would “know” where in the sample space the sample wells were. In short, standardization made it easier to build equipment for a marketplace. And a lot of equipment was built and put into successful use.[f]

The combination of robotics and microplates also worked well in biotechnology because of the prevalence of liquid samples; this point can not be stressed enough. We are good at working with fluids, including measuring amounts, transferring them, dispensing them, and so on. Solids can be a problem because of cross-contamination if equipment isn’t cleaned, if the solid is tacky, if the particle size isn’t right for the equipment, if it has a tendency to roll, and so on. Solids also have the potential for problems with homogeneity. (You can get layering in liquids, but that can be solved in “two shakes.”)

1.3.1 Two approaches to sample processing with robotics

Broadly speaking, there are two approaches to sample processing with robotics: batching and sample-at-a-time runs. The microplate is an example of batch processing. Each tray is handled (dilution, sealing, reading, etc.) at a station, and you don’t have an opportunity to evaluate the results until the analysis for a tray is completed and reported. Other methods such as ICP, mass spectrometry, chromatography, etc. that use auto-injectors can behave in a similar fashion, or be evaluated on a sample-at-a-time basis. It depends on how the process is planned and implemented, and at what stage the results are evaluated.

The sample-at-a-time procedure offers an interesting alternative pathway for automated analysis. This process can include auto-titrators, in addition to the techniques noted, as well as others. Here each sample is processed in stages either one-at-a-time or in overlapping stages. While one sample is being processed in stage 3 (adding a solvent for example), a sample in stage 4 is being injected into an instrument. This means that the results for one sample can be known before the next one is processed. The benefit is that systematic errors can be caught and corrected before all samples are processed. This would reduce waste and improve overall efficiency.

1.4 Sample storage organization

Before we leave the topic of samples, we need to address the subject of sample storage organization. This is important, as poor organization and management can nullify any gains from automation. There are two aspects to consider: the organization process itself and the physical nature of the samples.

In the life sciences, biobanking and refrigerated storage management have been actively discussed topics. For example a white paper commissioned for Titian Software Ltd. titled The Essential Guide to Managing Laboratory Samples goes into a fair amount of depth on the subject.[18] And if you Google “laboratory sample storage management,” you’ll get a sizeable listing of material, including peer-reviewed work by Redrup et al. titled "Sample Management: Recommendation for Best Practices and Harmonization from the Global Bioanalysis Consortium Harmonization Team.”[19] The abstract of the work of Redrup et al. reads in part[19]:

The importance of appropriate sample management in regulated bioanalysis is undeniable for clinical and non-clinical study support due to the fact that if the samples are compromised at any stage prior to analysis, the study results may be affected. Health authority regulations do not contain specific guidance on sample management; therefore, as part of the Global Bioanalysis Consortium (GBC), the A5 team was established to discuss sample management requirements and to put forward recommendations.

In short, you have to have control of your samples and be able to ensure their integrity.

1.4.1 The nature of incoming samples

Considerations about the nature of incoming samples don’t get as much press as they deserve. For example, if you are in quality control, the nature of your incoming samples is going to be consistent and determined by the production process. In fact, sample preparation is likely to be integrated with the test procedure’s automation. If the samples are fluids, the impact on an automation process may be small compared to working with solids. One complicating factor with fluids is the need to remove extraneous material so that downstream problems aren’t created. That removal may be accomplished by filtering, settling, centrifugal separation, batch centrifugation, or other means depending on the amount of material and its composition.

Working with solids raises several other issues. First, does the material have to be reduced to a coarse or fine powder before processing? This may be needed to permit precise weighing of amounts or providing a large surface area for solvent extractions. Second, is fabrication of a sample piece required? Some mechanical properties testing of plastics require molding test bars. Other tests may require pressing films for spectral analysis. Other issues exist as well. Some are industry-specific, for example working with hazardous materials (including toxic substances and radioactive samples), and those requiring careful environmental and/or security controls.

In many labs, automation is viewed as something that happens after the samples are logged in. Yet that doesn’t have to be the case. The following paragraphs focus on testing labs because they are the most likely to benefit from automation. They meet the two key criteria for automation implementation: stable procedures and sufficient workload to justify the cost of automation development. That can also happen in research labs, though; in the end it's simply a matter of the nature of the work.

Testing labs (e.g., quality control and non-routine internal and contract labs) can take steps to streamline their sample handling operations, though unfortunately at the expense of someone else’s labor (just make it worth their effort). For example, those submitting samples can be required to pre-log their samples. This can be accomplished by giving them access to restricted accounts on a LIMS that lets them log samples in, and little more. Sample containers can also be standardized with barcodes. The barcodes can then be required as part of the logging process and are critical to identifying samples that have reached the lab, as well as tracking their physical location. Additionally, sample container sizes and related container forms can also be standardized. These should match the requirements for sample handling in automated systems, if possible. (Unless you supply them, your submitters may not have the tools for sealing sample vials, etc.) Finally, all this cooperative effort to standardize sample reception can be incentivized with price breaks, which is likely to lead to faster TAT. In other words, give them an incentive to work with you, the lab. They are supplying labor that could potentially impact their productivity, so give them a good reason to work with you.

Testing operations can, as a result, see further benefits:

  • Sample storage management and access is improved.
  • You’ll be more informed of incoming work.
  • Automation and scheduling is enhanced when it begins with the requester instead of post-login.

My first professional lab experience was in polymers in an analytical research group (some routine work, a lot of non-routine work). Samples would arrive in a variety of containers (e.g., bags, jars, test tubes, envelopes, fabricated parts taped to sample request forms). Sample matrices would range from pellets, waxes, and powders to liquids, gas cylinders, rolls of film, and more. Classifying those samples, figuring out where to put them, locating them, and preparing them for work (which often involved grinding them in a Wiley mill) was a major time sink, sufficiently so that the actual analysis was a smaller part of the overall workflow. As such, anything you can do to streamline that process will help productivity and contribute to a successful automation project.

Abbreviations, acronyms, and initialisms

CDS: Chromatography data system

ELN: Electronic laboratory notebook

EVOP: Evolutionary operations

ICP: Inductively coupled plasma mass spectrometry

K/D/I: Knowledge, data, and information

LIMS: Laboratory information management system

LOF: Laboratory of the future

LSE: Laboratory systems engineer

NMR: Nuclear magnetic resonance

QC: Quality control

ROI: Return on investment

SDMS: Scientific data management system

Std.SP: Standard sample program

TLA: Total laboratory automation

TAT: Turn-around time

Footnotes

  1. Where the amount of titrant added is adjusted based on the response to the previous addition. This should yield faster titrations with increased accuracy.
  2. In prior writings, the term “scientific manufacturing” was used. The problem with that term is that we’re not making products but instead producing results. Plus “manufacturing results” has some negative connotations.
  3. Wikipedia says this of calendering: "A calender is a series of hard pressure rollers used to finish or smooth a sheet of material such as paper, textiles, or plastics. Calender rolls are also used to form some types of plastic films and to apply coatings."
  4. See the discussion on clinical chemistry in the main text.
  5. If you’re not familiar with the method, Wikipedia's chromatography page is a good starting point.
  6. See Wikipedia's microplate article for more.

About the author

Initially educated as a chemist, author Joe Liscouski (joe dot liscouski at gmail dot com) is an experienced laboratory automation/computing professional with over forty years of experience in the field, including the design and development of automation systems (both custom and commercial systems), LIMS, robotics and data interchange standards. He also consults on the use of computing in laboratory work. He has held symposia on validation and presented technical material and short courses on laboratory automation and computing in the U.S., Europe, and Japan. He has worked/consulted in pharmaceutical, biotech, polymer, medical, and government laboratories. His current work centers on working with companies to establish planning programs for lab systems, developing effective support groups, and helping people with the application of automation and information technologies in research and quality control environments.

References

  1. Liscouski, J. (2015). Computerized Systems in the Modern Laboratory: A Practical Guide. PDA/DHI. pp. 432. ASIN B010EWO06S. ISBN 978-1933722863. 
  2. 2.0 2.1 Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J. et al. (2016). "The FAIR Guiding Principles for scientific data management and stewardship". Scientific Data 3: 160018. doi:10.1038/sdata.2016.18. PMC PMC4792175. PMID 26978244. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792175. 
  3. Fish, M.; Minicuci, D. (2005). "Overcoming the Challenges of a LIMS Migration" (PDF). Research & Development 47 (2). http://apps.thermoscientific.com/media/SID/Informatics/PDF/Article-Overcoming-the-Challanges.pdf. 
  4. Fish, M.; Minicuci, D. (1 April 2013). "Overcoming daunting business challenges of a LIMS migration". Scientist Live. https://www.scientistlive.com/content/overcoming-daunting-business-challenges-lims-migration. Retrieved 17 November 2021. 
  5. "Overcoming the Challenges of Legacy Data Migration". FreeLIMS.org. CloudLIMS.com, LLC. 29 June 2018. https://freelims.org/blog/legacy-data-migration-to-lims.html. Retrieved 17 November 2021. 
  6. "AUTO03 Laboratory Automation: Communications With Automated Clinical Laboratory Systems, Instruments, Devices, and Information Systems, 2nd Edition". Clinical and Laboratory Standards Institute. 30 September 2009. https://clsi.org/standards/products/automation-and-informatics/documents/auto03/. Retrieved 17 November 2021. 
  7. Sarkozi, L.; Simson, E.; Ramanathan, L. (2003). "The effects of total laboratory automation on the management of a clinical chemistry laboratory. Retrospective analysis of 36 years". Clinica Chimica Acta 329 (1–2): 89–94. doi:10.1016/S0009-8981(03)00020-2. 
  8. "Total Laboratory Automation - Michael Bissell, MD, Ph.D". YouTube. University of Washington. 15 July 2014. https://www.youtube.com/watch?v=RdwFZyYE_4Q. Retrieved 17 November 2021. 
  9. "MassLynx LIMS Interface" (PDF). Waters Corporation. November 2016. https://www.waters.com/webassets/cms/library/docs/720005731en%20Masslynx%20LIMS%20Interface.pdf. Retrieved 17 November 2021. 
  10. "ASTM E1578 - 18 Standard Guide for Laboratory Informatics". ASTM International. 2018. https://www.astm.org/Standards/E1578.htm. Retrieved 17 November 2021. 
  11. Schmitt, S., ed. Assuring Data Integrity for Life Sciences. DHI Publishing. pp. 385. ISBN 9781933722979. https://www.dhibooks.com/assuring-data-integrity-for-life-sciences. 
  12. "ASTM E2078 - 00(2016) Standard Guide for Analytical Data Interchange Protocol for Mass Spectrometric Data". ASTM International. 2016. https://www.astm.org/Standards/E2078.htm. Retrieved 17 November 2021. 
  13. "ASTM E1948 - 98(2014) Standard Guide for Analytical Data Interchange Protocol for Chromatographic Data". ASTM International. 2014. https://www.astm.org/Standards/E1948.htm. Retrieved 17 November 2021. 
  14. "MilliporeSigma Acquires BSSN Software to Accelerate Customers’ Digital Transformation in the Lab". Merck KGaA. 6 August 2019. https://www.emdgroup.com/en/news/bssn-software-06-08-2019.html. Retrieved 17 November 2021. 
  15. McDonald, Robert S.; Wilks, Paul A. (1 January 1988). "JCAMP-DX: A Standard Form for Exchange of Infrared Spectra in Computer Readable Form" (in en). Applied Spectroscopy 42 (1): 151–162. doi:10.1366/0003702884428734. ISSN 0003-7028. http://journals.sagepub.com/doi/10.1366/0003702884428734. 
  16. "Welcome to GAML.org". GAML.org. 22 June 2007. http://www.gaml.org/default.asp. Retrieved 17 November 2021. 
  17. CNN Business (17 April 2015). "Micro robots drive Bayer's high-tech vision". YouTube. https://www.youtube.com/watch?v=_PuTKH7143c. Retrieved 17 November 2021. 
  18. Oxer, M.; Stroud, T. (2017). "The Essential Guide to Managing Laboratory Samples". Titian Software Ltd. https://www.titian.co.uk/the-essential-guide-to-managing-laboratory-samples-web. Retrieved 17 November 2021. 
  19. 19.0 19.1 Redrup, Michael J.; Igarashi, Harue; Schaefgen, Jay; Lin, Jenny; Geisler, Lisa; Ben M’Barek, Mohamed; Ramachandran, Subramanian; Cardoso, Thales et al. (1 March 2016). "Sample Management: Recommendation for Best Practices and Harmonization from the Global Bioanalysis Consortium Harmonization Team" (in en). The AAPS Journal 18 (2): 290–293. doi:10.1208/s12248-016-9869-2. ISSN 1550-7416. PMC PMC4779093. PMID 26821803. http://link.springer.com/10.1208/s12248-016-9869-2.