Skip to content

Latest commit

 

History

History
159 lines (116 loc) · 8.51 KB

02_command_line.md

File metadata and controls

159 lines (116 loc) · 8.51 KB

Introduction to the Command Line

This document outlines basic usage of the command line. For Linux and Mac users, these commands should work in Terminal. For Windows users, these should work in Git Bash.

What is the command line?

The Command Line Interface (CLI) is a way of interacting with your computer using text-based commands. This is different from the way most people interact with their computers, using their mouse and a Graphical User Interface (GUI).

Why should I use it?

Once you become comfortable with the basics, it can be a more powerful way to use your computer. You're able to do many things more quickly and programatically.

General format for commands

<command> -<options> <arguments>

  • <command> is the action we want the computer to take
  • <options> (or "flags") modify the behavior of the command
  • <arguments> are the things we want the command to act on

For Linux and Mac users, you can get view the manual for a command by typing man <command>. For Windows users, you can view the help page by typing <command> --help.

Tips

  • If there are spaces in file or directory names, use a "" to "escape" the space characters, or just put the entire file path in quotes.
  • After typing the first few letters of a file or directory name, you can hit Tab to auto-complete the name. (This often auto-escapes spaces for you.)
  • Use the up and down arrow keys to navigate previously entered commands.

File paths

A relative file path specifies the path to a file, taking into account your current working directory. For example, if you were to give someone "relative" directions to your house, you would give them directions from their current location (the relative path from where they are to where you are).

An absolute file path specifies the complete path to a file, ignoring your current working directory. For example, if you were to give someone "absolute" directions to your house, you would start by telling them to be on earth, then go to your continent, then go to your country, then go to your region, etc.

Basic commands

pwd
  • prints working directory (the directory you are currently in)
ls
  • lists files and subdirectories in your working directory
  • ls -a lists all files, including hidden files
  • ls -l lists the files in a long format with extra information (permissions, size, last modified date, etc.)
  • ls * also lists the contents of subdirectories (one level deep) in your working directory
  • ls <path> lists files in a specific directory (without changing your working directory)
clear
  • clears all output from your console
cd
  • cd <path> changes directory to the path you specify, which can be a relative path or an absolute path
  • cd .. moves you "up" one directory (to the parent directory)
  • cd moves you to your "home" directory
mkdir
  • mkdir <dirname> makes a new directory called <dirname>
touch
  • touch <filename> creates an empty file called <filename>
  • This is useful for creating empty files to be edited at a later time.
  • You can create multiple empty files with a single command: touch <filename1> <filename2> <filename3> ...
rm -i
  • rm <filename> removes (deletes) a file permanently
  • rm -i <filename> removes files in interactive mode, in which you are prompted to confirm that you really want to delete the file. It's best to always use rm -i.
  • rm -ir <dirname> removes a directory and recursively deletes all of its contents
mv
  • mv <filename> <new path> moves a file from its current location to <new path>
  • mv <filename> <new filename> renames a file without changing its location
cp
  • cp <filename> <new path> copies a file from its current location to <new path>, leaving the original file unchanged
  • cp <filename> <new filename> copies a file without changing its location

Pre-class exercise

  • Open your command line interface.
  • Navigate to your Desktop, and confirm you are there:
    • Print your working directory (it should end with Desktop).
    • List your files and subdirectories (they should match what you see on your Desktop).
  • Create a directory called project.
  • Navigate to the project directory, and create the following files in it: draft_paper.md, plot1.png, plot2.png.
  • Create two subdirectories in the project directory: code, data
  • Navigate to the code subdirectory, and create the following files in it: processing.py, exploration.py.
  • Navigate to the data subdirectory, and create the following files in it: original.csv, clean.csv, other.csv.
  • Make a copy of draft_paper.md called final_paper.md.
  • Rename plot1.png as scatterplot.png, and rename plot2.png as histogram.png.
  • Create a subdirectory called viz, and then move scatterplot.png and histogram.png to viz.
  • Delete other.csv from the data subdirectory.
  • Navigate back to project, and then print out (with a single command) all of its files, subdirectories, and the contents of those subdirectories. The output should look similar to this image.
  • Viewing this collapsible tree diagram may help you to visualize the directory structure that we have created.

Intermediate commands

head
  • head <filename> prints the head (the first 10 lines) of the file
  • head -n20 <filename> prints the first 20 lines of the file
  • This is useful for previewing the contents of a large file without opening it.
tail
  • tail <filename> prints the tail (the last 10 lines) of the file
cat
  • cat <filename> prints the entire file
less
  • less <filename> allows you to page or scroll through the file
  • Hit the spacebar to go down a page, use the arrow keys to scroll up and down, and hit q to exit.
wc
  • wc <filename> returns the count of lines, words, and characters in a file
  • wc -l <filename> only counts lines, wc -w <filename> only counts words, and wc -c <filename> only counts characters
  • A "word" is defined as any set of characters delimited by a space.
find
  • find <path> -name <name> will recursively search the specified path (and its subdirectories) and find files and directories with a given <name>
    • Use . for the <path> to refer to the working directory.
  • For the <name>, you can search for an exact match, or use wildcard characters to search for a partial match:
    • * specifies any number of any characters, such as find . -name *.py or find . -name *data*.*
    • ? specifies one character, such as find . -name ??_*.*
grep
  • grep <pattern> <filename> searches a file for a regular expression pattern and prints the matching lines
    • The pattern should be in quotation marks to allow for multiple words.
    • The pattern is case-sensitive by default, but you can use the -i option to ignore case.
    • You can use wildcards in the filename to search multiple files, but it only searches the working directory (not subdirectories).
  • grep -r <pattern> <path> does a recursive search of the path (checks subdirectories) for matches within files
    • Use . for the <path> to refer to the working directory.
  • grep <pattern> does a global search (of your entire computer) for matches
    • Hit Ctrl + c if you want to cancel the search.
  • Much more complex string-matching patterns can be used.
|
  • <command 1> | <command 2> pipes the results from <command 1> into <command 2>, and then the results of <command 2> are printed to the console
>
  • <command> > <filename> takes the output of <command> and saves it in <filename>
  • This will overwrite the file if it already exists.
>>
  • <command> >> <filename> takes the output of <command> and appends it to <filename>
  • This will create the file if it does not yet exist.

Advanced commands

cut
  • cut -f1,2 <filename> cuts a tab-delimited file into columns and returns the first two fields
  • cut -f1,2 -d, <filename> indicates that the file is delimited by commas
sort
  • sort <filename> sorts a file by the first field
uniq
  • uniq <filename> discards all but one of the successive identical lines (thus it only keeps unique lines)
  • uniq -c <filename> also records the count of the number of occurrences
  • Because lines must be successive to be counted as identical, you will usually use sort before uniq.