Gene GM21_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4107 
Symbol 
ID8139481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4689534 
End bp4691321 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content65% 
IMG OID644871722 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003023880 
Protein GI253702691 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones103 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTATC CCCGCCCGCA ATTGCAACGC CCCAACTGGC ATACGCTCGA CGGCCAGTGG 
CACTTCAGCT TCGACGACGA TCTCTCTTAC AGCATTCCGG CCGACGTGAA AGAGTGGCCC
CTCACCATTG AGGTACCGTA CGCGCCCGAG AGTGAGGCAA GCGGCATCGG CGACACCTCC
TTTCACCATG CCTGCTGGTA CCAGCGCGAG TTCTTCTACG ACGCGAGCGA CGGGAAGAAG
GTGCAGCTCC ATTTCGGCGC CGTCGATTAC CTGGCGCACG TCTGGGTCAA CGACATGCTG
GTGGCGCGGC ACCGAGGCGG CTTCACCCCG TTCAGCGCCG ACATCACCCA CGCCCTCGAT
CCTTCCGGGC GCCAGGTCGT GACCGTGCGG GTCGAGGACG ACCCTGAGGA TCTGGAAAAG
CCGCGCGGAA AGCAGGACTG GAAGCTGGAG CCGCACGCGA TCTGGTACTA CCGCACCACC
GGCATCTGGC AGACCGTCTG GCTGGAATCG GTCCCCGAGA CCTACCTCAA GAAGCTGCGC
TGGTCGCCGC ACCTGGAGCG CTGGGAGGTG GGATTCGAGG CCTTCATCGT GGGGCCTATC
GCGGACCAGA TGGAAGTGAA CGTGCGGCTT TCCTTCAGCG ACCAGCTCCT GGTCGAGGAC
CGCTACCGGG TGATCCACGG CGAGGTGCAC CGGCGCATCG CCCTCTCCGA CCCGGGGATC
GACGACTTCA GGAACGACCT CCTCTGGTCC CCGGAGCACC CGAGGCTGAT CCACGCCGAC
ATCAAGCTGA TGCAGGGGGG CGAGGTGATC GACGAGATCA CCTCCTACAC CGCGCTCCGT
TCCGCCAAGG TGCACCGGGA CCGGTTCCTC CTGAACGGGC GGCTCTATCC GCTCAGGCTG
GTGCTGGACC AGGGTTACTG GCCGGAAACG CTGATGACTC CCCCCTCCGA CGAGGCGCTC
AAGCGGGACG TCGAGCTCAC CAAGGCGATG GGGTTCAACG GGGTGAGAAA GCACCAAAAG
CTGGAGGACC CGCGCTACCT GTACTGGGCG GATCGGCTGG GGCTCGTGGT CTGGTCGGAG
ATGCCGAGCG CCTTCCGCTT CACCACGCGC GCGATAAAAA GGCTGATGCG GGAGTGGATC
GAGGCGATCG ATCGTGATTA CAGCCACCCC TGCGTCATCG TCTGGGTCCC CTTCAACGAG
TCGTGGGGGG TCCCTGACTT GACGGCCACC AAGGCGCACC GGGACGCGGT GCACGCCTTC
TACCACCTCA CCAAGACGCT CGACCCCGAG CGGCCGGTGA TCGGCAACGA CGGCTGGGAA
AGCTCCGCCA CCGACATCAT CGGCATCCAC GACTACGACA ACAACCCCGA GAGGCTCGCG
GAGCGCTACG GCCCGCAGGT GAAGCCGGAG GAGCTCTTCG ACCGGCGCCG CCCGGGCGGG
CGCATCCTCA CCCTCGACGG CTATCCGCAC CGGGGGCAGC CCATCATCCT CACCGAGTTC
GGCGGCGTCG CCTACGTAAA ACCCTCGGAC GCCCTGCACC AGAAGGCGTG GGGGTATTTC
CGCCACGACA AGATCGAGGA GTTCGAACGC CTCGCCCTCT CCCTCATCGA GACCGCCCGC
GGCGTCGCCA TGTTCAGCGG CTTTTGCTAC ACGCAATTCG CCGACACCTT CCAGGAGGCC
AACGGCCTCC TCTTCGCGGA CCGGACCCCG AAGATCCCGC TGGAGCGGAT CGCCGACGCG
GTGCGTGGGA AGGTCGAACA GGGCGCGCTC TTCTGGGAAA GCACCTGA
 
Protein sequence
MDYPRPQLQR PNWHTLDGQW HFSFDDDLSY SIPADVKEWP LTIEVPYAPE SEASGIGDTS 
FHHACWYQRE FFYDASDGKK VQLHFGAVDY LAHVWVNDML VARHRGGFTP FSADITHALD
PSGRQVVTVR VEDDPEDLEK PRGKQDWKLE PHAIWYYRTT GIWQTVWLES VPETYLKKLR
WSPHLERWEV GFEAFIVGPI ADQMEVNVRL SFSDQLLVED RYRVIHGEVH RRIALSDPGI
DDFRNDLLWS PEHPRLIHAD IKLMQGGEVI DEITSYTALR SAKVHRDRFL LNGRLYPLRL
VLDQGYWPET LMTPPSDEAL KRDVELTKAM GFNGVRKHQK LEDPRYLYWA DRLGLVVWSE
MPSAFRFTTR AIKRLMREWI EAIDRDYSHP CVIVWVPFNE SWGVPDLTAT KAHRDAVHAF
YHLTKTLDPE RPVIGNDGWE SSATDIIGIH DYDNNPERLA ERYGPQVKPE ELFDRRRPGG
RILTLDGYPH RGQPIILTEF GGVAYVKPSD ALHQKAWGYF RHDKIEEFER LALSLIETAR
GVAMFSGFCY TQFADTFQEA NGLLFADRTP KIPLERIADA VRGKVEQGAL FWEST