Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2177 |
Symbol | |
ID | 8137513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2546209 |
End bp | 2547219 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869792 |
Product | transglutaminase domain protein |
Protein accession | YP_003021987 |
Protein GI | 253700798 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 126 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGT TGTTGCTGTC CATGCTCGCA CTCTCACTCT GTCTTACTCC GGCTGCCTGG GCTAAAAGCC GCGGCGGCGT AGTCACCGTG GAAGTGGATC TTTCCAAACA GGAGCAGGGC AAGGAGACGA AACTCTGGAT ACCGTACGCG GTTTCCGGAA AGCACCAGTC AGTCACGGAC GTGAAGGTGG GCGGCGATTT CGCCGCCTCG GCGGTTTACA CCGACACGGC CAACGGCACC CCCATCCTCT TCGCCCAGTG GGGGAAGGAT GCGGCCAGCC GCAAGCTCAC CTACTCCTTT TCCGTGGAGC GCGAGGAGGT GCTGGTGCGC GACCTTTCCG CGAAGGAGAC GTCCTGGAGC AAGGCGGAGT TCGCCCCCTA CCTGCAGTCG ACCTCGATGG GCCCGGTCGA CGGCGAAGTG AAAAAACTCG CCGATTCCAT CACCAAGGGG AAACGCACGG TCCTGGAGAA GGCGAAGGCG ATCTATGACT GGACTTGCGA GAACATGTAC CGCGATCCGG CCACGGTCGG TTGCGGCAAG GGCGACGTCT GCGAACTGCT CAAAAAGCCC GGCGGCAAGT GTACCGACAT CTCGTCGGTC TACGTCGCCC TGGCGCGCGC TGCCGGCGTT CCCGCCCGCG AGGTCTTCGG GGTGAGGCTC GGCAAAAAAG CGACGGAGGA TATCACTTCC TGGCAGCACT GCTGGGCCGA ATTCTACCTG CCCGGCACCG GCTGGGTCCC GGTCGATCCG GCCGACGTGA GAAAGGCGAT GCTGGTCGAG AAGCTCGAAC TGAAGGATGC GAAGACACGC GAGTACCGGG ACTACTTCTG GGGCGGGATC GATCCGTACC GCTTCCAGGT CGCCGCCGGC CGCGACATCG TCCTCAACCC GCCTCAGGCA GGCGCTTCGC TCAACACCTT CGGCTACCCT TATGCCGAGG TAGGCGGCGC GGCGCTCGAT TCCTACGATC CCAAGAGCTT CAGCTACCGG ATCACCTACA AGGAACAGTA G
|
Protein sequence | MKRLLLSMLA LSLCLTPAAW AKSRGGVVTV EVDLSKQEQG KETKLWIPYA VSGKHQSVTD VKVGGDFAAS AVYTDTANGT PILFAQWGKD AASRKLTYSF SVEREEVLVR DLSAKETSWS KAEFAPYLQS TSMGPVDGEV KKLADSITKG KRTVLEKAKA IYDWTCENMY RDPATVGCGK GDVCELLKKP GGKCTDISSV YVALARAAGV PAREVFGVRL GKKATEDITS WQHCWAEFYL PGTGWVPVDP ADVRKAMLVE KLELKDAKTR EYRDYFWGGI DPYRFQVAAG RDIVLNPPQA GASLNTFGYP YAEVGGAALD SYDPKSFSYR ITYKEQ
|
| |