LETTER
Extraordinary Command Line: Basic Data Editing Tools for Biologists Dealing with Sequence Data
Magda Mielczarek1, 2, *
Article Information
Identifiers and Pagination:
Year: 2020Volume: 13
First Page: 137
Last Page: 145
Publisher ID: TOBIOIJ-13-137
DOI: 10.2174/1875036202013010137
Article History:
Received Date: 07/9/2020Revision Received Date: 18/11/2020
Acceptance Date: 30/11/2020
Electronic publication date: 31/12/2020
Collection year: 2020
open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Abstract
The command line is a standard way of using the Linux operating system. It contains many features essential for efficiently handling data editing and analysis processes. Therefore, it is very useful in bioinformatics applications. Commands allow for rapid manipulation of large ASCII files or very numerous files, making basic command line programming skills a critical component in modern life science research. The following article is not a guide to Linux commands. In this manuscript, in contrast to many various Linux manuals, we aim to present basic command line tools helpful in handling biological sequence data. This manuscript provides a collection of simple and popular hacks dedicated to users with very basic experience in the area of the Linux command line. It includes a description of data formats and examples of editing of four types of data formats popular in bioinformatics applications.