Gene GM21_0695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0695 
Symbol 
ID8136010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp831796 
End bp835152 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content65% 
IMG OID644868312 
Productalpha amylase catalytic region 
Protein accessionYP_003020527 
Protein GI253699338 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00000000568993 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAGA GATTAATCAA CTCCGACTAT TTCCCCTTCC AGATTCAGAT CTCGGCGGAT 
TGCGGCTCGA AGATTGAGTT GCGCGACCTG CTCGAGGAGC TGGGAGAGGT GCCTGGCATC
ATCTACGCCA GGCGGATAGC GGCCAAGCTG AACCGCCAGT TGGCCCCGCA GGAAACCCCG
GTCCAGCCCG GCCTTTTGCA CCTATATTCC ATCTTGAGCC AGGTTTACCG CTACGTGCTC
AGCCAGTACT GCTCCAAACA ACAGCCGAGC ATACTGGCGG CCTTGATGGC GCAGGCGGGC
TACCCGGAGT TCTCCGGCGA CGCCGGCCGT GCGTTGTACC GCTTCATGGA GCTTTTCCCC
TCACGGCAGA TGGTCCTCGG CAACGAAACC CCGGAAGGGC TGTTGGCGCA GGACGGCGAG
GATCTGGCGC GCCGGCAGGC CTTGACCGCG GAGCTGCTCC TCATGCTCTT AAACGGGGAA
AACCGCGCGC TGAACCAGTT CCGGCGCATC TTCGACGCTT CGGAACTGGC CTCCTCCTCC
CCCTACCTCG CGGTGGCCCT GGAGCTCGAC CGCCGCCTGG CGAAGGCCCC CCCGTTCGAG
CCGGTGGGTG TGTCGCTCAC GGAGCTTTTG CGCGCCCCGC TGAAGGCCTC CCCCGACTCC
CTTTCCGGCC AGATCGCCTA CATACGTGAC AACTGGGCCT CGTTTCTCCC GCAGGAGCTT
CTGGGCGAGC TCGTGACCGC GCTCGACATC GTCTCCCAGG AAGGGCGCGC CTTCTTCGGC
GGCCCGGGCG AACCCAAGGT GCTTAAATTC GGCAGGAACG CCCAGCGAGG CGGCGACGAA
TATCCCGAGT ATGAGCGCTT TTCCCGGGAC GCCGACTGGA TGGCCAACGT GGTGATGATC
GCTAAGATGG TCTACGTCTG GCTGGGGCAG TTGGCGAAGA CCTACCAGAC CGAGGTGCAC
ACCCTGGACC AGATCCCGGA CGCCGAGCTG GACCGGCTGG CGCGCTACGG CTTCTCGGCC
CTGTGGCTGA TCGGCATATG GGAGCGCTCC CCTGCCTCTC AGACCATCAA GCGCATCTCC
GGCAACCAGG AGGCGATCTC CTCGGCCTAT TCGCTCTACG ACTACGTGAT CGCGCACGAC
CTTGGGGGGG AGTGGGCCTT GGAGAACCTG AGGCGGCGCT GCGCCGCGCG GGGCATCAGG
CTTGCGAGCG ACATGGTCCC CAACCATACC GGCCTCTACT CCAAGTGGAC CGTGGAGCAC
CCCGACTGGT TCATCCAGCT CGATTACCCC CCCTATCCCG ACTACCAGTT CAACGGACCC
GACCTCTCCC CCGACGGCCG TGTGGGGCTC TTCATCGAGG ACGGCTACTG GGACAAGCGC
GACGCGGCGG TGGTGTTCAA GCACCTGGAC CGCGACAACG GCAGGGTGCG CTACATCTAT
CACGGCAACG ACGGCACGAG CACCCCCTGG AACGACACGG CGCAGCTGAA CTACCTGATC
CCGGAGGTGC GCGAGGCGGT GATAGGGACC ATCCTCCACG TGGCGCGCCA GTTCCCGATC
ATCCGTTTCG ACGCCGCCAT GACCCTTGCC AAGAAGCACT ACCAGCGCCT GTGGTTCCCG
CTCCCCGGGC ACGGCTCGGG CGTCCCATCC AGGGCCGAGC ACGGCATGGA CCGCGCGAGT
TTCGACGAGG TCTTCCCGGT CGAGTTCTGG CGCGAGGTGG TGGACCGGGT CGCGGTGGAG
GCGCCCGACA CGCTACTCCT GGCGGAGGCG TTCTGGCTCA TGGAGGGGTA CTTCGTGCGC
ACCCTCGGCA TGCACCGGGT CTACAACAGC GCCTTCATGA ACATGCTGAA GATGGAGGAG
AATGCCAAGT ACCGGCAGAC GGTGAAAAAC GTGCTGGAGT TCGAACCGGA GATCCTGAAG
CGGTTCGTCA ACTTCATGAA CAACCCCGAC GAACGCACCG CGGTCGAGCA GTTCGGCAAG
GAAGGGAAGT ACTTCGGCGC GACGGTGCTC CTGGTCACCA TGCCGGGGCT CCCCATGGTC
GGGCACGGAC AGATCGAGGG GTTCCACGAG AAGTACGGGA TGGAGTACAA GCGGGCCTAT
TGGGACGAGC CGGTGGACGA GCACCTGGTG GCCCGGCACG AGAGCGACAT CTTCCCGCTC
ATGCGCAGGC GCCACATCTT CTCGGGAAGC GAGCAGTTCA CCCTGTACGA CTTTTTCAGC
GGCAACAGCG TGAACGAGAA CGTCTTCGCC TACTCCAACC GCAACAACGG CGAACGCGGG
CTGATCCTTT TCCACAACGC CTTCGCCTCC ACCGCGGGGT GGATCAGGAG CTCGTGCGCG
GTGCTCAGGA AGAACGCCGC AGGGGGGACC TCGCTGGTTC AGACGAACCT TGGGGAGGCG
CTGGGCTTCA AGGGGGACGG GCGCCATTAC TACTCGTTCC GGGATTACGC CTCGGGGCTT
AGCTACTTGA GAAACGGCCG CGACCTCTGC GACCAGGGGC TCTACGTGGA GATGGAGGGG
TACGAGTACC ACGCCTTCCT CGACTTCGAG GAGATCTACG ACGACGATTT CGGGACCTGG
GGCGCCCTCT GTTACCGCAT GAATGGGGCG GGGGTGGAGA GCATCGAGGA GGAGGTGAAG
CAAGTTCGCT ACGCCTCGGC CCACGGCGCC CTGCAATCGC TGCTCTCCAA GGCGGCCGCC
GCGGCCCGCG AGCCGGGAGC CACCGTGCAG TCGCTCCTAT CCCAGCTGGA GCCGCTTCTG
GCTGCCTTCT TCAAGGCCGT CGCGCCGCAG GCCCCCGAAA AGGTGCAGCG CCTTTTGGTG
AGCGCCTTCG GAACCGAAGC GGACGACGCG CTGCGGGGGG CATCGACCGG TCCGGAGCGC
GCCCCCGGGG AATGGCTCCT TCTTTGCGCC TATCTGGCCC TGCACCGGAT CGGCGACCTC
TCAGGCGAGG AATCGGCGGA GGTGGTCGAT GCGCTCGGCC TTGTGCGCCC GGTGGTGCAT
GCCTTTCACG CACTTCCCCC CGCCGAGCCG GAAGAGGCGG CCGACCTCTC GCCGGACGCC
TACGGCAAGC TGTTGCGGGT ACTTTTGCGC CAAGATTCCT TCTTCTCGCT CTGCCGCGAG
CTCGGTGCGT TGAAAAGCTG TGCCGCGCTT TTCTCCGACC CGGCGGCTGC CGGTTTTGTC
TACCTGCATG AAAGCGCCGG GGTGCAGTGG TTCAACAAGG AGCGCTTCGA GCTCCTGCTC
GACTGGTTCC TGAAGCTAGA CGAGGACAGT GCGCTGCACG GGATGCTCAG GGAGAGCGCC
GCGGCGGCCG GCTACCGGCT CGATGAGCTA ATTAAAATCT TGGGGTCTCC TTCCTGA
 
Protein sequence
MKKRLINSDY FPFQIQISAD CGSKIELRDL LEELGEVPGI IYARRIAAKL NRQLAPQETP 
VQPGLLHLYS ILSQVYRYVL SQYCSKQQPS ILAALMAQAG YPEFSGDAGR ALYRFMELFP
SRQMVLGNET PEGLLAQDGE DLARRQALTA ELLLMLLNGE NRALNQFRRI FDASELASSS
PYLAVALELD RRLAKAPPFE PVGVSLTELL RAPLKASPDS LSGQIAYIRD NWASFLPQEL
LGELVTALDI VSQEGRAFFG GPGEPKVLKF GRNAQRGGDE YPEYERFSRD ADWMANVVMI
AKMVYVWLGQ LAKTYQTEVH TLDQIPDAEL DRLARYGFSA LWLIGIWERS PASQTIKRIS
GNQEAISSAY SLYDYVIAHD LGGEWALENL RRRCAARGIR LASDMVPNHT GLYSKWTVEH
PDWFIQLDYP PYPDYQFNGP DLSPDGRVGL FIEDGYWDKR DAAVVFKHLD RDNGRVRYIY
HGNDGTSTPW NDTAQLNYLI PEVREAVIGT ILHVARQFPI IRFDAAMTLA KKHYQRLWFP
LPGHGSGVPS RAEHGMDRAS FDEVFPVEFW REVVDRVAVE APDTLLLAEA FWLMEGYFVR
TLGMHRVYNS AFMNMLKMEE NAKYRQTVKN VLEFEPEILK RFVNFMNNPD ERTAVEQFGK
EGKYFGATVL LVTMPGLPMV GHGQIEGFHE KYGMEYKRAY WDEPVDEHLV ARHESDIFPL
MRRRHIFSGS EQFTLYDFFS GNSVNENVFA YSNRNNGERG LILFHNAFAS TAGWIRSSCA
VLRKNAAGGT SLVQTNLGEA LGFKGDGRHY YSFRDYASGL SYLRNGRDLC DQGLYVEMEG
YEYHAFLDFE EIYDDDFGTW GALCYRMNGA GVESIEEEVK QVRYASAHGA LQSLLSKAAA
AAREPGATVQ SLLSQLEPLL AAFFKAVAPQ APEKVQRLLV SAFGTEADDA LRGASTGPER
APGEWLLLCA YLALHRIGDL SGEESAEVVD ALGLVRPVVH AFHALPPAEP EEAADLSPDA
YGKLLRVLLR QDSFFSLCRE LGALKSCAAL FSDPAAAGFV YLHESAGVQW FNKERFELLL
DWFLKLDEDS ALHGMLRESA AAAGYRLDEL IKILGSPS