Gene GM21_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3648 
Symbol 
ID8139022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4224474 
End bp4226348 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content67% 
IMG OID644871269 
Productglycosyltransferase 36 associated 
Protein accessionYP_003023427 
Protein GI253702238 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones177 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCACT TTATCACGCC GCGCTTGTGT TACCACCCGG AGTACGGCGA GCGGGTCGCC 
TTCGCGGCCA TGACCCCCGG CGCCGACGAC TACGGCGGCG ACCGCTCCGC CTTCATCGGG
CGCAACCGCT CTCTTGCCGC CCCTGCGGCC ATGGAAGCGA CCCGCCTGTC CCGAAGGACG
GGGGCCGGTC TCGATCCCTG CGCGGTCCAG CGGGTCACCC TGGAGCTGGC TCCGGGCGAG
CAGCGCGACA TCGTCTGCAT GCTGGGCCAG GAGGCTTCCG TCCCTGAGGC CCGGGAGCTG
GTCCTTCGCT ACCGGCAGGA CCGGTCCTTC GAAGACGCCT TCGATAGGAC CCGCGCCTGG
TGGGATGAAC TGCTGGGCAC GGTCCAGGTG CAGACCCCGG AGCCTGCCGC CGATCTCCTG
ATCAACCGCT GGCTTCAGTA CCAGTCCTTG AGCTGCCGTA TCTGGGGGCG CTCCGGCTTT
TACCAGTCGG GGGGCGCCTT CGGTTTTCGC GACCAGTTGC AGGACGTGAT GGCGTTTCTC
TATGCGAGGC CGGAGCTTGC CGCCGAGCAG ATCCTATTGG CCGCCAGCCG CCAGTTCAGC
GAGGGGGACG TTCAGCACTG GTGGCACGAG CCGGCCGGCG CCGGCATCCG GTCGCGCATC
TCCGACGACC TGCTCTGGCT CCCCTATGTG GTCGCGCAGT ACGTCCGGAC CACAGGGGAT
CTGGCGATCC TGGAGGCGGA GGTCCCCTTC CTGAACGCCC CGCCACTTTC GGACGATCAG
CACGAGGTGT TCTCCGTCCC GCAGGTAAGT CTCGAGCGGG CGACACTCTT CGAGCACTGC
AGGCGGGCGG TGGCCCGCGG CCTCACCACA GGCCCCCACG GCTTACCGCT GATGGGGACA
GGCGACTGGA ACGACGGCAT GAACCTGGTT GGCGCCGGCG GCAAGGGCGA GAGCGTATGG
CTTGCCTGGT TTCTTTGCGA CATCCTGAAG GGAATGGCGG AGATCTCCAC CCTCCTAAAG
CTGCCGGAAC TGGCCCGGGG GTATCTGGAG GAGAGGTCCG CCCTGGTGCT TCGTACCGAA
AAGGCCGGCT GGGACGGGGA GTGGTATCTA AGGGGGACCT TCGACGACGG CACACCGCTC
GGCTCCTCGT CGAACAGCGA GGCGAGGATC GATTCGCTCC CGCAATCGTG GGCTTGGCTT
TCCGGCGCAG CCGACCGTGA ACGCGCCGGG AAGGGACTGG AAGCGGCCTG GCAGCATCTG
GTGCGCGAGG ATGAGGGGCT CGTGCTTCTT TTCGAACCCC CCTTCGACAC CTCGGAACCG
TCTCCAGGGT ACATCAAGGG GTATCCGCCG GGGGTGCGCG AAAACGGCGG GCAGTACACC
CACGCGGCAC TCTGGATGGT CATGGCCCTG GCGAAAAAGG GAGAAGGGGA TCGCGCGGCG
CAGCTTTTGC GCGTGCTCAA TCCGATCGAG CACGCGCGGG ACGCTCAGGC CGCCTGGCTT
TACGGGGTCG AGCCCTACGT GGTAGCCGCC GACGTGTACC GTTTGCCGGG ACGTATCGGA
CAAGGAGGCT GGTCCTGGTA CACCGGCTCC GCCGCCTGGA TGTACCGGGC CTGGATCGAA
GAGGTGTTGG GACTTAAGGT GAGAGGAGAC GAGCTGCGGA TGAATCCGGT CATCCCCGCT
GCGTGGCCCG GCTTCAGCAT GAGCTATCGG CATGGAGAGG CGGTCTACGC GATTCGAGTG
GAGAACCCGG ACGGCTGCCA GTGCGGGGTC GCCCAGGTGG AAATGGATGG CCGCCGTGCC
GACGGCGGCG TGATCAAACT GGAACGGGGT CTGGTGAAGC ATCAGGTCGT GGTGCGGATG
GGTACCCGGA AGTAG
 
Protein sequence
MSHFITPRLC YHPEYGERVA FAAMTPGADD YGGDRSAFIG RNRSLAAPAA MEATRLSRRT 
GAGLDPCAVQ RVTLELAPGE QRDIVCMLGQ EASVPEAREL VLRYRQDRSF EDAFDRTRAW
WDELLGTVQV QTPEPAADLL INRWLQYQSL SCRIWGRSGF YQSGGAFGFR DQLQDVMAFL
YARPELAAEQ ILLAASRQFS EGDVQHWWHE PAGAGIRSRI SDDLLWLPYV VAQYVRTTGD
LAILEAEVPF LNAPPLSDDQ HEVFSVPQVS LERATLFEHC RRAVARGLTT GPHGLPLMGT
GDWNDGMNLV GAGGKGESVW LAWFLCDILK GMAEISTLLK LPELARGYLE ERSALVLRTE
KAGWDGEWYL RGTFDDGTPL GSSSNSEARI DSLPQSWAWL SGAADRERAG KGLEAAWQHL
VREDEGLVLL FEPPFDTSEP SPGYIKGYPP GVRENGGQYT HAALWMVMAL AKKGEGDRAA
QLLRVLNPIE HARDAQAAWL YGVEPYVVAA DVYRLPGRIG QGGWSWYTGS AAWMYRAWIE
EVLGLKVRGD ELRMNPVIPA AWPGFSMSYR HGEAVYAIRV ENPDGCQCGV AQVEMDGRRA
DGGVIKLERG LVKHQVVVRM GTRK