Gene GM21_0779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0779 
Symbol 
ID8136094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp927761 
End bp928831 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID644868396 
Productpeptidase M42 family protein 
Protein accessionYP_003020611 
Protein GI253699422 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG CATCTTTCGA ATTCCTGCAG CAATTGCTGG CGGCGCCCAG CCCCTCGGGG 
TACGAGCAGC CTGCCCAGCG GGTCTTCCGC TCCTACATAG AGCCGTTTTG CCAGGTGGCG
ACCGACGTCA TGGGTAACGT CTTCGGCATG ATTCAGGGCG CCGGAAATAA CCGCCCCCGC
GTCATGGTCG TGGGGCACTC CGACGAGATC GGGCTCCAGG TCCGCTACCT GGACGACAAC
GGTTTCATCT ATTTCTCCGC CATCGGCGGG GTCGATCCCC ACATAACGCC CGGGATGCGG
GTCCACGTCC ACACCGCGAA GGGGAAGCTG AACGGCGTCA TCGGCAAGCG CCCCATCCAC
CTGATCGAGC CCAAAGAGCG CGACACCGTC ATTAAGCTGG ACGCCCAGTA CATAGACATC
GGCGCCGCCA ACAAGAAAGA GGCCCTGGAG TGGGTGCGGG TAGGCGATCC CATCACCTTC
GACAGCAATC TGGAGCGGCT CTTCGGGGAC CGCGTCAGTT CGCGTGGTCT CGACGACAAG
GCAGGCAGCT TCGTTGTCGC CGAGGTGCTC CGTCGCGTCT CCGAGCTTCC GGACCAGCTC
CCCATCGACC TCTACGGCGT ATCCTCCGTC CAGGAGGAGG TTGGGCTTCG CGGCGGCACC
ACCAGCAGCT ACTCGGTGAA CCCCGACGTC GGCATCTGCG TCGAGGTGGA TTTCGCCACC
GACCAGCCCG ACGTGGACAA AAAGCACAAC GGCGAAGTAG GTCTAGGCAA AGGGCCGATC
CTTCCCCGCG GCGCCAACAT CAACCCCGTC CTCTTCGACC TCCTCTCCGA CACCGCGACC
GGCAACGGCA TCGCCGTGCA GTACACCGGC ATCGCCCGGG CGACCGGCAC CGACGCGAAC
GTAATGCAGA TTTCGCGCGG GGGCGTCGCC ACCGCTTTGG TGAAGATCCC GCTGCGCTAC
ATGCACACCC CGGTGGAGAC CCTGTCGCTT GCCGACCTGG ACGGGGCGGT CGAGCTGATT
GTCGCCTCGC TCTCCAAGAT GGGGCACAAG GACGCGTTCA TTCCGATGTG A
 
Protein sequence
MRDASFEFLQ QLLAAPSPSG YEQPAQRVFR SYIEPFCQVA TDVMGNVFGM IQGAGNNRPR 
VMVVGHSDEI GLQVRYLDDN GFIYFSAIGG VDPHITPGMR VHVHTAKGKL NGVIGKRPIH
LIEPKERDTV IKLDAQYIDI GAANKKEALE WVRVGDPITF DSNLERLFGD RVSSRGLDDK
AGSFVVAEVL RRVSELPDQL PIDLYGVSSV QEEVGLRGGT TSSYSVNPDV GICVEVDFAT
DQPDVDKKHN GEVGLGKGPI LPRGANINPV LFDLLSDTAT GNGIAVQYTG IARATGTDAN
VMQISRGGVA TALVKIPLRY MHTPVETLSL ADLDGAVELI VASLSKMGHK DAFIPM