Gene GM21_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3904 
Symbol 
ID8139278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4490646 
End bp4493198 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content66% 
IMG OID644871521 
Productalpha-glucan phosphorylase 
Protein accessionYP_003023679 
Protein GI253702490 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02094] alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.0004418 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCTGA GCTCAACGCT GCACAGATTT ACCGTCGTCC CCTCGCTTCC CAAAGAGCTT 
GCTGGGCTGC AGCGCATCGC CTACAACCTC TGGTGGAGCT GGGAACCCGA GGCCATCGGC
CTTTTCAAAC GGCTGGACCC CGAACTCTGG CGCCTGACGC GCCACAACCC CCTCGAGGTG
CTGGGAAGCC TGCAGCAGGC GACCTTCGAG AGCCTGATGG CGGACGAGGG GTTCATGTCC
CACCTCTCGC AGGTAGAGGA GCGGCTCACC GAATACCTCT CCTCCCGCAC CTGGTACGAG
CGGCACGGCA ACCCCGGCGC CCGCATCGCC TACTTCTCCA TGGAATTCGG CCTGCACGAG
TCGCTCCCCA TCTACTCCGG GGGGCTCGGC ATCCTCGCGG GCGACCACCT CAAATCCGCC
AGCGATTTAG GACTCCCCCT GGTCGGCGTC GGGCTTTTGT ACCGTCAGGG GTACTTCCGC
CAGTACCTGA ACCTGGAAGG GTGGCAGCAG GAAATCTACC CGGAAAACGA CTTCTACAAC
CTCCCGCTGC ACCAGGAGAG GGACAGGGAA GGAAAGCCCC TGGCGTTCTC CCTGGAGTAC
CCGGGGAGGA ATGTGCGGGT GCAGGTCTGG CGGGTGCAGG TAGGGCGGGT CAACCTCTAC
CTCCTGGACA CGAACCTCGA GGAGAATTCC CCCGCCGACC GCGAGATCAC CACCAGGCTC
TACGGCGGCG ACCAGGAGAT GCGGATCAGG CAGGAGATCC TGCTCGGCAT CGGCGGCGTC
CGGGCGCTGA GGCTTTTGGG GGCCGAGCCC AACGTCTGCC ACATGAACGA GGGGCACGCC
GCCTTCCTGG CGCTGGAGCG GATCCGCTCC CTGATGGAGC AGCGCCAGCT CAACTTCCGG
GAGGCGATGG AGGCGGTGAG AGGCGGCAAC GTCTTCACCA CCCACACGCC GGTCGAGGCC
GGGATCGACC ATTTCCCACC CGACCTCCTC GACCAGTACA TGGGTCATTA CTACCGCAGC
CTCGGGCTCT CCCGCGAGCA GTTCATGGCC CTGGGGATGC AGAACCACGG CAGGAGCAAC
GAAAGTTTCT GCATGGCGGT GCTCGCCATG AAGCTCTCGC TCCACTCGAA CGGGGTGAGC
GAACTGCACG GGGAGGTGTC GCGCAGGATG TGGGCGGACG TCTGGCCGGA CCTCCCCGAG
GAACAGCTCC CCCTTACCTA CGTCACCAAC GGGGTGCACC AGAAGAGCTG GCTCTCGGAG
GAGATGACCG GCCTGTTGAT CCGGTTCCTC GGCACCAGAT GGCTGGAGCA AAGCGCCGAC
CAGCTCTGGC GCAGGGTCTC GCGCATCCCG GACGCGGAAC TTTGGCGCAC GCACCGGCGC
GGCACCGAGC AGCTGGTCGA CTACGCGCGC CGCAGCCTGA GGGCGCAGTT GGAGAAGCTC
GACGCCTCCG CCAAGGAGAT CGACGCGGCA GGCGACGTCC TCGACCCGGA GATACTCACC
ATCGGCTTCG CGCGCCGCTT CGCCACCTAC AAGCGAGGGA CGCTCCTGTT GCACGACAAG
GAGCGGCTTT TCAGGATACT GAACCATCCC GAGCGGCCGG TGCAGATCGT CTTCGCCGGG
AAGGCGCACC CCGCCGACCA CCAGGGTAAG GAGCTGATCC GCCAGATCGT GCAGCTGTCG
CAGCAGCCCG AATTCCGGCG CCGCATCGTC TTCCTGGAGG ATTACGACAT CTCGGTGGCC
CGGCGCCTGG TGCAGGGGGT GGACGTCTGG CTCAACACGC CGCTCAGGCC CCTGGAGGCC
AGCGGCACCA GCGGCATGAA GGTCGCCTTC AACGGCGGCC TCAACCTGAG CATCCTGGAC
GGGTGGTGGT GCGAGGGGTA CCGCGGGAAC AACGGCTGGG CCATAGGGCG CGGCGAGGTG
TACGACGACC TGGCCTACCA GAACCAGGTG GAGAGCCGCG CCATCTATGA CCTTCTGGAG
AAGGAGATCG TGCCCCTTTT CTACAACCGG GGGAGCGACG GCATCCCGCG CGGCTGGACC
GCCTTCATGA AGAGCTCGAT GCAGACGCTC TGCCCGGTGT TCAGCACGGA CCGGATGGTG
CAGGAGTACG CCAGGCGCTG CTACCTTCCC GCCTTCGAGC ACTGGGAGCG GCTGAACCGC
GACAACCTGA GGCTCGCGGT GGAGCTGGCG CGCTGGAAGG AGCGGCTGCA CGGCATGTGG
GGTGAGCTTT CCATCGTGGC GGTCCACGCC GAGATCCGGA AGGAAGTGAC GGTGGGGGAG
ATGCTCCCCA TCACGGTGCA GATCACGCCG GGACGGATCC CCCTTTCGGA GATCGCGGTC
GAGGTATATT TCGGAGTGCT CGATTCCCGC GGCGCCATCA TCGGGGGGGA GATCGTGCCG
CTTGAGCCTG CGGCCGACCC GGAAAAGGCG GGGCACTTCG GCGGGGAACT GGAGTGCCGC
TTCTGCGGCA GGCACGGCTT CATGCTGCGG GTAATGCCCA GGCACCCTGA GCTTGGGACC
GTGTACGATC CGGGGTTGAT ACTCTGGGGC TGA
 
Protein sequence
MNLSSTLHRF TVVPSLPKEL AGLQRIAYNL WWSWEPEAIG LFKRLDPELW RLTRHNPLEV 
LGSLQQATFE SLMADEGFMS HLSQVEERLT EYLSSRTWYE RHGNPGARIA YFSMEFGLHE
SLPIYSGGLG ILAGDHLKSA SDLGLPLVGV GLLYRQGYFR QYLNLEGWQQ EIYPENDFYN
LPLHQERDRE GKPLAFSLEY PGRNVRVQVW RVQVGRVNLY LLDTNLEENS PADREITTRL
YGGDQEMRIR QEILLGIGGV RALRLLGAEP NVCHMNEGHA AFLALERIRS LMEQRQLNFR
EAMEAVRGGN VFTTHTPVEA GIDHFPPDLL DQYMGHYYRS LGLSREQFMA LGMQNHGRSN
ESFCMAVLAM KLSLHSNGVS ELHGEVSRRM WADVWPDLPE EQLPLTYVTN GVHQKSWLSE
EMTGLLIRFL GTRWLEQSAD QLWRRVSRIP DAELWRTHRR GTEQLVDYAR RSLRAQLEKL
DASAKEIDAA GDVLDPEILT IGFARRFATY KRGTLLLHDK ERLFRILNHP ERPVQIVFAG
KAHPADHQGK ELIRQIVQLS QQPEFRRRIV FLEDYDISVA RRLVQGVDVW LNTPLRPLEA
SGTSGMKVAF NGGLNLSILD GWWCEGYRGN NGWAIGRGEV YDDLAYQNQV ESRAIYDLLE
KEIVPLFYNR GSDGIPRGWT AFMKSSMQTL CPVFSTDRMV QEYARRCYLP AFEHWERLNR
DNLRLAVELA RWKERLHGMW GELSIVAVHA EIRKEVTVGE MLPITVQITP GRIPLSEIAV
EVYFGVLDSR GAIIGGEIVP LEPAADPEKA GHFGGELECR FCGRHGFMLR VMPRHPELGT
VYDPGLILWG