Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0695 |
Symbol | |
ID | 8136010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 831796 |
End bp | 835152 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868312 |
Product | alpha amylase catalytic region |
Protein accession | YP_003020527 |
Protein GI | 253699338 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00000000568993 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAGA GATTAATCAA CTCCGACTAT TTCCCCTTCC AGATTCAGAT CTCGGCGGAT TGCGGCTCGA AGATTGAGTT GCGCGACCTG CTCGAGGAGC TGGGAGAGGT GCCTGGCATC ATCTACGCCA GGCGGATAGC GGCCAAGCTG AACCGCCAGT TGGCCCCGCA GGAAACCCCG GTCCAGCCCG GCCTTTTGCA CCTATATTCC ATCTTGAGCC AGGTTTACCG CTACGTGCTC AGCCAGTACT GCTCCAAACA ACAGCCGAGC ATACTGGCGG CCTTGATGGC GCAGGCGGGC TACCCGGAGT TCTCCGGCGA CGCCGGCCGT GCGTTGTACC GCTTCATGGA GCTTTTCCCC TCACGGCAGA TGGTCCTCGG CAACGAAACC CCGGAAGGGC TGTTGGCGCA GGACGGCGAG GATCTGGCGC GCCGGCAGGC CTTGACCGCG GAGCTGCTCC TCATGCTCTT AAACGGGGAA AACCGCGCGC TGAACCAGTT CCGGCGCATC TTCGACGCTT CGGAACTGGC CTCCTCCTCC CCCTACCTCG CGGTGGCCCT GGAGCTCGAC CGCCGCCTGG CGAAGGCCCC CCCGTTCGAG CCGGTGGGTG TGTCGCTCAC GGAGCTTTTG CGCGCCCCGC TGAAGGCCTC CCCCGACTCC CTTTCCGGCC AGATCGCCTA CATACGTGAC AACTGGGCCT CGTTTCTCCC GCAGGAGCTT CTGGGCGAGC TCGTGACCGC GCTCGACATC GTCTCCCAGG AAGGGCGCGC CTTCTTCGGC GGCCCGGGCG AACCCAAGGT GCTTAAATTC GGCAGGAACG CCCAGCGAGG CGGCGACGAA TATCCCGAGT ATGAGCGCTT TTCCCGGGAC GCCGACTGGA TGGCCAACGT GGTGATGATC GCTAAGATGG TCTACGTCTG GCTGGGGCAG TTGGCGAAGA CCTACCAGAC CGAGGTGCAC ACCCTGGACC AGATCCCGGA CGCCGAGCTG GACCGGCTGG CGCGCTACGG CTTCTCGGCC CTGTGGCTGA TCGGCATATG GGAGCGCTCC CCTGCCTCTC AGACCATCAA GCGCATCTCC GGCAACCAGG AGGCGATCTC CTCGGCCTAT TCGCTCTACG ACTACGTGAT CGCGCACGAC CTTGGGGGGG AGTGGGCCTT GGAGAACCTG AGGCGGCGCT GCGCCGCGCG GGGCATCAGG CTTGCGAGCG ACATGGTCCC CAACCATACC GGCCTCTACT CCAAGTGGAC CGTGGAGCAC CCCGACTGGT TCATCCAGCT CGATTACCCC CCCTATCCCG ACTACCAGTT CAACGGACCC GACCTCTCCC CCGACGGCCG TGTGGGGCTC TTCATCGAGG ACGGCTACTG GGACAAGCGC GACGCGGCGG TGGTGTTCAA GCACCTGGAC CGCGACAACG GCAGGGTGCG CTACATCTAT CACGGCAACG ACGGCACGAG CACCCCCTGG AACGACACGG CGCAGCTGAA CTACCTGATC CCGGAGGTGC GCGAGGCGGT GATAGGGACC ATCCTCCACG TGGCGCGCCA GTTCCCGATC ATCCGTTTCG ACGCCGCCAT GACCCTTGCC AAGAAGCACT ACCAGCGCCT GTGGTTCCCG CTCCCCGGGC ACGGCTCGGG CGTCCCATCC AGGGCCGAGC ACGGCATGGA CCGCGCGAGT TTCGACGAGG TCTTCCCGGT CGAGTTCTGG CGCGAGGTGG TGGACCGGGT CGCGGTGGAG GCGCCCGACA CGCTACTCCT GGCGGAGGCG TTCTGGCTCA TGGAGGGGTA CTTCGTGCGC ACCCTCGGCA TGCACCGGGT CTACAACAGC GCCTTCATGA ACATGCTGAA GATGGAGGAG AATGCCAAGT ACCGGCAGAC GGTGAAAAAC GTGCTGGAGT TCGAACCGGA GATCCTGAAG CGGTTCGTCA ACTTCATGAA CAACCCCGAC GAACGCACCG CGGTCGAGCA GTTCGGCAAG GAAGGGAAGT ACTTCGGCGC GACGGTGCTC CTGGTCACCA TGCCGGGGCT CCCCATGGTC GGGCACGGAC AGATCGAGGG GTTCCACGAG AAGTACGGGA TGGAGTACAA GCGGGCCTAT TGGGACGAGC CGGTGGACGA GCACCTGGTG GCCCGGCACG AGAGCGACAT CTTCCCGCTC ATGCGCAGGC GCCACATCTT CTCGGGAAGC GAGCAGTTCA CCCTGTACGA CTTTTTCAGC GGCAACAGCG TGAACGAGAA CGTCTTCGCC TACTCCAACC GCAACAACGG CGAACGCGGG CTGATCCTTT TCCACAACGC CTTCGCCTCC ACCGCGGGGT GGATCAGGAG CTCGTGCGCG GTGCTCAGGA AGAACGCCGC AGGGGGGACC TCGCTGGTTC AGACGAACCT TGGGGAGGCG CTGGGCTTCA AGGGGGACGG GCGCCATTAC TACTCGTTCC GGGATTACGC CTCGGGGCTT AGCTACTTGA GAAACGGCCG CGACCTCTGC GACCAGGGGC TCTACGTGGA GATGGAGGGG TACGAGTACC ACGCCTTCCT CGACTTCGAG GAGATCTACG ACGACGATTT CGGGACCTGG GGCGCCCTCT GTTACCGCAT GAATGGGGCG GGGGTGGAGA GCATCGAGGA GGAGGTGAAG CAAGTTCGCT ACGCCTCGGC CCACGGCGCC CTGCAATCGC TGCTCTCCAA GGCGGCCGCC GCGGCCCGCG AGCCGGGAGC CACCGTGCAG TCGCTCCTAT CCCAGCTGGA GCCGCTTCTG GCTGCCTTCT TCAAGGCCGT CGCGCCGCAG GCCCCCGAAA AGGTGCAGCG CCTTTTGGTG AGCGCCTTCG GAACCGAAGC GGACGACGCG CTGCGGGGGG CATCGACCGG TCCGGAGCGC GCCCCCGGGG AATGGCTCCT TCTTTGCGCC TATCTGGCCC TGCACCGGAT CGGCGACCTC TCAGGCGAGG AATCGGCGGA GGTGGTCGAT GCGCTCGGCC TTGTGCGCCC GGTGGTGCAT GCCTTTCACG CACTTCCCCC CGCCGAGCCG GAAGAGGCGG CCGACCTCTC GCCGGACGCC TACGGCAAGC TGTTGCGGGT ACTTTTGCGC CAAGATTCCT TCTTCTCGCT CTGCCGCGAG CTCGGTGCGT TGAAAAGCTG TGCCGCGCTT TTCTCCGACC CGGCGGCTGC CGGTTTTGTC TACCTGCATG AAAGCGCCGG GGTGCAGTGG TTCAACAAGG AGCGCTTCGA GCTCCTGCTC GACTGGTTCC TGAAGCTAGA CGAGGACAGT GCGCTGCACG GGATGCTCAG GGAGAGCGCC GCGGCGGCCG GCTACCGGCT CGATGAGCTA ATTAAAATCT TGGGGTCTCC TTCCTGA
|
Protein sequence | MKKRLINSDY FPFQIQISAD CGSKIELRDL LEELGEVPGI IYARRIAAKL NRQLAPQETP VQPGLLHLYS ILSQVYRYVL SQYCSKQQPS ILAALMAQAG YPEFSGDAGR ALYRFMELFP SRQMVLGNET PEGLLAQDGE DLARRQALTA ELLLMLLNGE NRALNQFRRI FDASELASSS PYLAVALELD RRLAKAPPFE PVGVSLTELL RAPLKASPDS LSGQIAYIRD NWASFLPQEL LGELVTALDI VSQEGRAFFG GPGEPKVLKF GRNAQRGGDE YPEYERFSRD ADWMANVVMI AKMVYVWLGQ LAKTYQTEVH TLDQIPDAEL DRLARYGFSA LWLIGIWERS PASQTIKRIS GNQEAISSAY SLYDYVIAHD LGGEWALENL RRRCAARGIR LASDMVPNHT GLYSKWTVEH PDWFIQLDYP PYPDYQFNGP DLSPDGRVGL FIEDGYWDKR DAAVVFKHLD RDNGRVRYIY HGNDGTSTPW NDTAQLNYLI PEVREAVIGT ILHVARQFPI IRFDAAMTLA KKHYQRLWFP LPGHGSGVPS RAEHGMDRAS FDEVFPVEFW REVVDRVAVE APDTLLLAEA FWLMEGYFVR TLGMHRVYNS AFMNMLKMEE NAKYRQTVKN VLEFEPEILK RFVNFMNNPD ERTAVEQFGK EGKYFGATVL LVTMPGLPMV GHGQIEGFHE KYGMEYKRAY WDEPVDEHLV ARHESDIFPL MRRRHIFSGS EQFTLYDFFS GNSVNENVFA YSNRNNGERG LILFHNAFAS TAGWIRSSCA VLRKNAAGGT SLVQTNLGEA LGFKGDGRHY YSFRDYASGL SYLRNGRDLC DQGLYVEMEG YEYHAFLDFE EIYDDDFGTW GALCYRMNGA GVESIEEEVK QVRYASAHGA LQSLLSKAAA AAREPGATVQ SLLSQLEPLL AAFFKAVAPQ APEKVQRLLV SAFGTEADDA LRGASTGPER APGEWLLLCA YLALHRIGDL SGEESAEVVD ALGLVRPVVH AFHALPPAEP EEAADLSPDA YGKLLRVLLR QDSFFSLCRE LGALKSCAAL FSDPAAAGFV YLHESAGVQW FNKERFELLL DWFLKLDEDS ALHGMLRESA AAAGYRLDEL IKILGSPS
|
| |