Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3904 |
Symbol | |
ID | 8139278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4490646 |
End bp | 4493198 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871521 |
Product | alpha-glucan phosphorylase |
Protein accession | YP_003023679 |
Protein GI | 253702490 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0058] Glucan phosphorylase |
TIGRFAM ID | [TIGR02094] alpha-glucan phosphorylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.0004418 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCTGA GCTCAACGCT GCACAGATTT ACCGTCGTCC CCTCGCTTCC CAAAGAGCTT GCTGGGCTGC AGCGCATCGC CTACAACCTC TGGTGGAGCT GGGAACCCGA GGCCATCGGC CTTTTCAAAC GGCTGGACCC CGAACTCTGG CGCCTGACGC GCCACAACCC CCTCGAGGTG CTGGGAAGCC TGCAGCAGGC GACCTTCGAG AGCCTGATGG CGGACGAGGG GTTCATGTCC CACCTCTCGC AGGTAGAGGA GCGGCTCACC GAATACCTCT CCTCCCGCAC CTGGTACGAG CGGCACGGCA ACCCCGGCGC CCGCATCGCC TACTTCTCCA TGGAATTCGG CCTGCACGAG TCGCTCCCCA TCTACTCCGG GGGGCTCGGC ATCCTCGCGG GCGACCACCT CAAATCCGCC AGCGATTTAG GACTCCCCCT GGTCGGCGTC GGGCTTTTGT ACCGTCAGGG GTACTTCCGC CAGTACCTGA ACCTGGAAGG GTGGCAGCAG GAAATCTACC CGGAAAACGA CTTCTACAAC CTCCCGCTGC ACCAGGAGAG GGACAGGGAA GGAAAGCCCC TGGCGTTCTC CCTGGAGTAC CCGGGGAGGA ATGTGCGGGT GCAGGTCTGG CGGGTGCAGG TAGGGCGGGT CAACCTCTAC CTCCTGGACA CGAACCTCGA GGAGAATTCC CCCGCCGACC GCGAGATCAC CACCAGGCTC TACGGCGGCG ACCAGGAGAT GCGGATCAGG CAGGAGATCC TGCTCGGCAT CGGCGGCGTC CGGGCGCTGA GGCTTTTGGG GGCCGAGCCC AACGTCTGCC ACATGAACGA GGGGCACGCC GCCTTCCTGG CGCTGGAGCG GATCCGCTCC CTGATGGAGC AGCGCCAGCT CAACTTCCGG GAGGCGATGG AGGCGGTGAG AGGCGGCAAC GTCTTCACCA CCCACACGCC GGTCGAGGCC GGGATCGACC ATTTCCCACC CGACCTCCTC GACCAGTACA TGGGTCATTA CTACCGCAGC CTCGGGCTCT CCCGCGAGCA GTTCATGGCC CTGGGGATGC AGAACCACGG CAGGAGCAAC GAAAGTTTCT GCATGGCGGT GCTCGCCATG AAGCTCTCGC TCCACTCGAA CGGGGTGAGC GAACTGCACG GGGAGGTGTC GCGCAGGATG TGGGCGGACG TCTGGCCGGA CCTCCCCGAG GAACAGCTCC CCCTTACCTA CGTCACCAAC GGGGTGCACC AGAAGAGCTG GCTCTCGGAG GAGATGACCG GCCTGTTGAT CCGGTTCCTC GGCACCAGAT GGCTGGAGCA AAGCGCCGAC CAGCTCTGGC GCAGGGTCTC GCGCATCCCG GACGCGGAAC TTTGGCGCAC GCACCGGCGC GGCACCGAGC AGCTGGTCGA CTACGCGCGC CGCAGCCTGA GGGCGCAGTT GGAGAAGCTC GACGCCTCCG CCAAGGAGAT CGACGCGGCA GGCGACGTCC TCGACCCGGA GATACTCACC ATCGGCTTCG CGCGCCGCTT CGCCACCTAC AAGCGAGGGA CGCTCCTGTT GCACGACAAG GAGCGGCTTT TCAGGATACT GAACCATCCC GAGCGGCCGG TGCAGATCGT CTTCGCCGGG AAGGCGCACC CCGCCGACCA CCAGGGTAAG GAGCTGATCC GCCAGATCGT GCAGCTGTCG CAGCAGCCCG AATTCCGGCG CCGCATCGTC TTCCTGGAGG ATTACGACAT CTCGGTGGCC CGGCGCCTGG TGCAGGGGGT GGACGTCTGG CTCAACACGC CGCTCAGGCC CCTGGAGGCC AGCGGCACCA GCGGCATGAA GGTCGCCTTC AACGGCGGCC TCAACCTGAG CATCCTGGAC GGGTGGTGGT GCGAGGGGTA CCGCGGGAAC AACGGCTGGG CCATAGGGCG CGGCGAGGTG TACGACGACC TGGCCTACCA GAACCAGGTG GAGAGCCGCG CCATCTATGA CCTTCTGGAG AAGGAGATCG TGCCCCTTTT CTACAACCGG GGGAGCGACG GCATCCCGCG CGGCTGGACC GCCTTCATGA AGAGCTCGAT GCAGACGCTC TGCCCGGTGT TCAGCACGGA CCGGATGGTG CAGGAGTACG CCAGGCGCTG CTACCTTCCC GCCTTCGAGC ACTGGGAGCG GCTGAACCGC GACAACCTGA GGCTCGCGGT GGAGCTGGCG CGCTGGAAGG AGCGGCTGCA CGGCATGTGG GGTGAGCTTT CCATCGTGGC GGTCCACGCC GAGATCCGGA AGGAAGTGAC GGTGGGGGAG ATGCTCCCCA TCACGGTGCA GATCACGCCG GGACGGATCC CCCTTTCGGA GATCGCGGTC GAGGTATATT TCGGAGTGCT CGATTCCCGC GGCGCCATCA TCGGGGGGGA GATCGTGCCG CTTGAGCCTG CGGCCGACCC GGAAAAGGCG GGGCACTTCG GCGGGGAACT GGAGTGCCGC TTCTGCGGCA GGCACGGCTT CATGCTGCGG GTAATGCCCA GGCACCCTGA GCTTGGGACC GTGTACGATC CGGGGTTGAT ACTCTGGGGC TGA
|
Protein sequence | MNLSSTLHRF TVVPSLPKEL AGLQRIAYNL WWSWEPEAIG LFKRLDPELW RLTRHNPLEV LGSLQQATFE SLMADEGFMS HLSQVEERLT EYLSSRTWYE RHGNPGARIA YFSMEFGLHE SLPIYSGGLG ILAGDHLKSA SDLGLPLVGV GLLYRQGYFR QYLNLEGWQQ EIYPENDFYN LPLHQERDRE GKPLAFSLEY PGRNVRVQVW RVQVGRVNLY LLDTNLEENS PADREITTRL YGGDQEMRIR QEILLGIGGV RALRLLGAEP NVCHMNEGHA AFLALERIRS LMEQRQLNFR EAMEAVRGGN VFTTHTPVEA GIDHFPPDLL DQYMGHYYRS LGLSREQFMA LGMQNHGRSN ESFCMAVLAM KLSLHSNGVS ELHGEVSRRM WADVWPDLPE EQLPLTYVTN GVHQKSWLSE EMTGLLIRFL GTRWLEQSAD QLWRRVSRIP DAELWRTHRR GTEQLVDYAR RSLRAQLEKL DASAKEIDAA GDVLDPEILT IGFARRFATY KRGTLLLHDK ERLFRILNHP ERPVQIVFAG KAHPADHQGK ELIRQIVQLS QQPEFRRRIV FLEDYDISVA RRLVQGVDVW LNTPLRPLEA SGTSGMKVAF NGGLNLSILD GWWCEGYRGN NGWAIGRGEV YDDLAYQNQV ESRAIYDLLE KEIVPLFYNR GSDGIPRGWT AFMKSSMQTL CPVFSTDRMV QEYARRCYLP AFEHWERLNR DNLRLAVELA RWKERLHGMW GELSIVAVHA EIRKEVTVGE MLPITVQITP GRIPLSEIAV EVYFGVLDSR GAIIGGEIVP LEPAADPEKA GHFGGELECR FCGRHGFMLR VMPRHPELGT VYDPGLILWG
|
| |