Gene GM21_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2177 
Symbol 
ID8137513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2546209 
End bp2547219 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content63% 
IMG OID644869792 
Producttransglutaminase domain protein 
Protein accessionYP_003021987 
Protein GI253700798 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones126 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGT TGTTGCTGTC CATGCTCGCA CTCTCACTCT GTCTTACTCC GGCTGCCTGG 
GCTAAAAGCC GCGGCGGCGT AGTCACCGTG GAAGTGGATC TTTCCAAACA GGAGCAGGGC
AAGGAGACGA AACTCTGGAT ACCGTACGCG GTTTCCGGAA AGCACCAGTC AGTCACGGAC
GTGAAGGTGG GCGGCGATTT CGCCGCCTCG GCGGTTTACA CCGACACGGC CAACGGCACC
CCCATCCTCT TCGCCCAGTG GGGGAAGGAT GCGGCCAGCC GCAAGCTCAC CTACTCCTTT
TCCGTGGAGC GCGAGGAGGT GCTGGTGCGC GACCTTTCCG CGAAGGAGAC GTCCTGGAGC
AAGGCGGAGT TCGCCCCCTA CCTGCAGTCG ACCTCGATGG GCCCGGTCGA CGGCGAAGTG
AAAAAACTCG CCGATTCCAT CACCAAGGGG AAACGCACGG TCCTGGAGAA GGCGAAGGCG
ATCTATGACT GGACTTGCGA GAACATGTAC CGCGATCCGG CCACGGTCGG TTGCGGCAAG
GGCGACGTCT GCGAACTGCT CAAAAAGCCC GGCGGCAAGT GTACCGACAT CTCGTCGGTC
TACGTCGCCC TGGCGCGCGC TGCCGGCGTT CCCGCCCGCG AGGTCTTCGG GGTGAGGCTC
GGCAAAAAAG CGACGGAGGA TATCACTTCC TGGCAGCACT GCTGGGCCGA ATTCTACCTG
CCCGGCACCG GCTGGGTCCC GGTCGATCCG GCCGACGTGA GAAAGGCGAT GCTGGTCGAG
AAGCTCGAAC TGAAGGATGC GAAGACACGC GAGTACCGGG ACTACTTCTG GGGCGGGATC
GATCCGTACC GCTTCCAGGT CGCCGCCGGC CGCGACATCG TCCTCAACCC GCCTCAGGCA
GGCGCTTCGC TCAACACCTT CGGCTACCCT TATGCCGAGG TAGGCGGCGC GGCGCTCGAT
TCCTACGATC CCAAGAGCTT CAGCTACCGG ATCACCTACA AGGAACAGTA G
 
Protein sequence
MKRLLLSMLA LSLCLTPAAW AKSRGGVVTV EVDLSKQEQG KETKLWIPYA VSGKHQSVTD 
VKVGGDFAAS AVYTDTANGT PILFAQWGKD AASRKLTYSF SVEREEVLVR DLSAKETSWS
KAEFAPYLQS TSMGPVDGEV KKLADSITKG KRTVLEKAKA IYDWTCENMY RDPATVGCGK
GDVCELLKKP GGKCTDISSV YVALARAAGV PAREVFGVRL GKKATEDITS WQHCWAEFYL
PGTGWVPVDP ADVRKAMLVE KLELKDAKTR EYRDYFWGGI DPYRFQVAAG RDIVLNPPQA
GASLNTFGYP YAEVGGAALD SYDPKSFSYR ITYKEQ