Git for n00b
A detailed tutorial of Git for people knowing very few about versions systems. You’ll understand utility of such program and how we use modern version control system. I try to stay as pragmatic as possible.
Begin with conclusion
Here is the list of sufficient and necessary command to use Git. There is very few. It is normal not to understand immediately but it is to gives you an idea. Even if this article is long, 95% of Git usage is in these 7 commands:
Get a project from the web:
git clone ssh://server/path/to/project
Everyday Git usage:
# get modifications from other git pull # read what was done git log # Make local changes to files hack, hack, hack... # list the modified files git status # show what I've done git diff # tell git to version a new file git add new/file # commit its own modifications # to its local branch git commit -a -m "Fix bug #321" # send local modifications to other git push
This article is written for people knowing very few about versionning systems. It is also written for those who had didn’t followed progress since CVS or subversion (SVN). This is why, in a first time I’ll explain quickly which are the goal of such systems. Secondly, I’ll explain how to install and configure Git. Then, I give the command for each feature a DCVS must have.
Git for what?
If you just want to use Git immediately, just read dark part. You read this part later to understand correctly foundations of version systems and not doing strange things.
Git is a DCVS, which means a Decentralized Concurrent Versions System. Let’s analyze each part of this long term:
Firstly, versions system manage files. When somebody work with files without a versions system, the following happens frequently:
When you modify a somehow critical file you don’t want to loose. You copy naturally this file with another name. For example:
$ cp fichier_important.c fichier_important.c.bak
In consequence of what, the new file, play the role of backup. If you break everything, you can always return in the last state by overwriting your modifications. Of course, this method is not very professional and is a bit limited. If you make many modifications, you’ll end with many files with strange names like:
fichier_important.c.bak fichier_important.c.old fichier_important.c.Bakcup fichier_important.c.BAK.2009-11-14 fichier_important.c.2009.11.14 fichier_important.c.12112009 old.fichier_important.c
If you want to make it works correctly, you’ll have to use naming convention. Files take many place even if you modify most of time only some lines.
Fortunately, versions system are here to help.
You only have to signal you want a new version of a file and the versions system will do the job for you. It will record the backup where it could be easily recovered. Generally, systems version do it better than you, making the backup only of the modified lines and not the total file.
Once upon a time versions were managed for each file separately. I think about CVS. Then it naturally appears projects are a coherent set of files. Recover each file separately was a tedious work. This is why versions number passed from files to the entire project.
It is therefore possible to say, “I want to get back three days earlier”.
What gives versions system? (I didn’t mention everything at all)
- automatic backups: back in time,
- gives the ability to see differences between each version,
- put a tag on some version to be able to refer to them easily,
- gives the ability to see an historic of all modifications. Generally the user must add a comment for each new version.
Version Systems are already useful to manage its own projects. They help to organize and resolve partially backup problems. I say partially because you have to backup your repository on a decent file system. But versions system are really interesting is on projects done by many people.
Let’s begin by an example, a two person project ; Alex and Beatrice. On a file containing a Lovecraft’s gods list:
Cthulhu Shubniggurath Yogsototh
Cthulhu Shubniggurath Soggoth Yogsototh
after that he send the file on the project server. Then on the server there is the Alex file:
A bit later, Beatrice who had not get the Alex file on the server make the modification:
Cthulhu Dagon Shubniggurath Yogsototh
Beatrice send her file on the server
Alex modification is lost. One more time, versions system are here to help.
A version system would had merge the two files at the time Beatrice send the file on the server. And like by magic, on the server the file would be:
Cthulhu Dagon Shubniggurath Soggoth Yogsototh
In real life, at the moment Beatrice want to send her modifications, the versions system alert her a modification had occurred on the server. Then she uses a command which pull the modification from the server to her local computer. And this command update her file. After that, Beatrice send again the new file on the server.
In what Concurrent Versions System help?
- get without any problem others modifications,
- send without any problem its own modifications to others,
- manage conflicts. I didn’t speak about it, but sometimes a conflict can occur (when two different people modify the same line on a file for example). SVC help to resolve such problem. More on that later,
- help to know who done what and when.
This word became popular only recently about CVS. And it mainly means two things:
First, until really recently (SVN), you’ll have to be connected to the distant server to get informations about a project. Like get the history. New decentralized systems work with a local REPOSITORY (directory containing backups and many informations linked to the versions system functionalities). Hence, one can view the history of a project without the need of being connected.
All instances of a project can live independently.
To be more precise, DCVS are base on the branch notion.
Practically, it has great importance. It means, everybody work separately, and the system help to glue all their work.
It is even more than just that. It help to code independently each feature and bug fixes. Under other system it was far more difficult.
I develop my project. I’m ameliorating something. An urgent bug is reported.
With a DCVS I can easily, get back to the version with the bug. Fix it. Send the fix. Get back to my feature work. And even, use the fix for the new version with my new feature.
In a not decentralized version system, doing such a thing is possible but not natural. Decentralization means it become natural to use a branch for each separable work.
Advantages given by DCVS:
- Ability to work offline,
- Ability to create many atomic patches,
- Help the maintenance of many different versions of the same application.
Let’s resume what we can easily do with DCVS:
- back in time,
- list differences between versions,
- name some versions to refer to them easily
- show history of modifications
- get others modifications,
- send its modifications to others,
- know who done what and when,
- conflicts management.
- Easily manipulate branches
Now let’s see how to obtain all these things easily with Git.
Before usage, configuration
Under Linux Ubuntu or Debian:
$ sudo apt-get install git
Under Mac OS X:
$ sudo port selfupdate $ sudo port install git-core
Save the following file as your
[color] branch = auto diff = auto status = auto [alias] st = status co = checkout br = branch lg = log --pretty=oneline --graph logfull = log --pretty=fuller --graph --stat -p unstage = reset HEAD # there should be an article on what this command do uncommit = !zsh -c '"if (($0)); then nb=$(( $0 - 1 )); else nb=0; fi; i=0; while ((i<=nb)); do git revert -n --no-edit HEAD~$i; ((i++)); done; git commit -m \"revert to $0 version(s) back\""' undomerge = reset --hard ORIG_HEAD conflict = !gitk --left-right HEAD...MERGE_HEAD # under Mac OS X, you should use gitx instead # conflict = !gitx --left-right HEAD...MERGE_HEAD [branch] autosetupmerge = true
You can achieve the same result using for each entry the command:
git config --global. Next, configure your name and your email. For example, if your name is John Doe and your email is
email@example.com. Launch the following commands:
$ git config --global user.name John Doe $ git config --global user.email firstname.lastname@example.org
Here it is. Base configuration is over. The file containing alias will help to type shorter commands.
Get a project
If a project is already versionned with Git you should have an
URL of the sources. Then use the following command:
$ cd ~/Projets $ git clone git://main.server/path/to/file
If there is no git server but you’ve got an
ssh access. Just replace the
ssh://user@host. In order not to type your password each time, use:
$ ssh-keygen -t rsa
Reply to question and do not enter a password. Then copy your keys to the distant server. This is not the safest way to do this. The safest being, using
The easiest way if you have
me@locahost$ ssh-copy-id ~/.ssh/id_rsa.pub email@example.com
me@locahost$ scp ~/.ssh/id_rsa.pub firstname.lastname@example.org: me@locahost$ ssh email@example.com password: firstname.lastname@example.org$ cat id_rsa.pub >> ~/.ssh/authorized_keys email@example.com$ rm id_rsa.pub firstname.lastname@example.org$ logout
Now you don’t need to write your password to access the
Creating a new project
Suppose you already have a project with files. Then it is really easy to version it.
$ cd /path/to/project $ git init $ git add . $ git commit -m "Initial commit"
Let do a small remark. If you don’t want to version every file. Typically intermediate compilation file, swap files… Then you need to exclude them. Just before launching the
git add . command. You need to create a
.gitignore file in the root directory of your project. This file will contain all exclude pattern. For example:
*.o *.bak *.swp *~
Now, if you want to create a repository on a distant server, it must not be in
bare mode. The repository will contain only versionning informations, but not the files of the project. To achieve that:
$ cd /path/to/local/project $ git clone --bare . ssh://server/path/to/project
Others will be able to get your modifications.
git clone ssh://server/path/to/project
Abstract of the second step
You now have a local directory on your computer. It is versionned and you can say it is, because there is a
.git directory at the root (and the root only) of your project. This directory contain all necessary informations for Git to version your project.
Now you only need to know how to use it.
Here we go!
Here is one from many way to use Git. This method is sufficient to work on a project. Not there is many other workflows.
Work with Git immediately:
- Get modification done by others
- See details of these modifications
- Many times:
- Make an atomic modification
- Verify details of this modification:
- Add some file to be versionned if necessary:
git add [file],
- Save you modifications
git commit -a -m "message",
- Send your modifications to others:
git push(redo a
git pullif push return an error).
With these few commands you can use Git. Even if it is sufficient, you need to know one more thing before really begin ; How to manage conflicts.
Conflicts can arise when you change the same line of code on the same file from another branch you’re merging. It can seems a bit intimidating, but with Git this kind of thing is really simple to handle.
You start from the following file
and you modify one line
Zoot the pure
except during this time, another user had also modified the same line and had done a
Zoot, just Zoot
Now when you do a:
$ git pull remote: Counting objects: 5, done. remote: Total 3 (delta 0), reused 0 (delta 0) Unpacking objects: 100% (3/3), done. From /home/yogsototh/tmp/conflictTest d3ea395..2dc7ffb master -> origin/master Auto-merging foo CONFLICT (content): Merge conflict in foo Automatic merge failed; fix conflicts and then commit the result.
foo now contains:
<<<<<<< HEAD:foo Zoot the pure ======= Zoot, just Zoot >>>>>>> 2dc7ffb0f186a407a1814d1a62684342cd54e7d6:foo
To resolve the conflict you only have to edit the file for example, writing:
Zoot the not so pure
and to commit
git commit -a -m "conflict resolved"
Why Git is cool?
Because with Git you can work on many part of some project totally independently. This is the true efficiency of decentralisation.
Each branch use the same directory. Then you can easily change your branch. You can also change branch when some files are modified. You can then dispatch your work on many different branches and merge them on one master branch at will.
git rebase you can decide which modifications should be forget or merged into only one modification.
What does it mean for real usage? You can focus on coding. For example, you can code, a fix for bug b01 and for bug b02 and code a feature f03. Once finished you can create a branch by bug and by feature. And finally you can merge these modifications on a main branch.
All was done to code and decide how to organize your versions after. In other VCS it is not as natural as in Git.
With Git you can depend of many different sources. Then, there is not necessarily a ‘master’ repository where everybody puts its modifications.
What changes the most with Git when you come from SVN, it’s the idea of a centralized project on one server. With Git many people could work on the same project but not necessarily on the same repository as main reference. One can easily fix a bug and send a patch to many different versions of a project.
Command for each functionality
- get others modifications,
- send modifications to others,
- get back in time,
- list differences between each version,
- name some versions in order to refer easily to them,
- write an historic of modifications,
- know who did what and when,
- manage conflicts,
- easily manage branches.
get others modifications
$ git pull
send modifications to others
$ git push
or more generally
$ git pull $ git push
get back in time
For all tree
$ git checkout
$ git revert
revert three version before (see my
$ git uncommit 3
Undo the las merge (if something goes wrong)
$ git revertbeforemerge
For one file
$ git checkout file $ git checkout VersionHash file $ git checkout HEAD~3 file
list differences between each version
list files being modified
$ git status
differences between last version files and local files
$ git diff
differences between some version and local files
$ git diff VersionHash fichier
name some version to refer to them in the future
$ git tag 'toto'
show historic of modifications
$ git log $ git lg $ git logfull
know who did what and when
$ git blame fichier
$ git conflict
To create a branch:
$ git branch branch_name
To change the current branch:
$ git checkout branch_name