Gene GM21_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0114 
Symbol 
ID8135417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp138830 
End bp141274 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content62% 
IMG OID644867734 
Productglycoside hydrolase family 57 
Protein accessionYP_003019958 
Protein GI253698769 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.0789579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAG CTAAAGAGAG GTTCGTCTGC ATTCACGGCC ATTTCTACCA GCCGCCGCGC 
GAGAACCCGT GGCTTGAGGC GGTCGAGATC CAGGATTCCG CCTTCCCCTA CCACGACTGG
AACGAGCGGA TCACGGCAGA GTGCTACGCC GCAAACTCCG CGTCCCGCAT ACTCGACGGC
GACCAGCGGG TCATGGATAT CACCAGCAAC TACGCCAAGA TCAACTTCAA TTTCGGCCCA
ACGGTACTCT CCTGGATGGC CTTCGCCGCT CCGAAAATCT ACCAGGCGAT CCTGGACGCG
GACAAGCTGA GCATGAAGTG GCGCTCCGGG CACGGTTCGG CCATCGCCCA GGTCTTCAAC
CACATGATCA TGCCGCTGGC CAACTCCAGG GACAAGCGGA CCCAGATCGT CTGGGGGATC
AAGGACTTCG AGCAGAGGTT CCAGCGCTTC CCGGAAGGGA TGTGGCTTGC GGAGACCGCA
GTGGACCTGG AGAGCCTGGA CCTCCTGGCC GAGTATGGGA TCAAGTACAC CATCCTCGCC
CCGCACCAGG CTGCGGGGTA CCGCGAGCTG GGCGCCGAGG AGTGGACCGA GACGGAGATC
GATCCCACCA GGGCCTACCT TTGCAGGCTT CCCTCCGGGC GCGAGATCAG CCTCTTTTTT
TACGACGGCC CCATATCCCG CGCGGTCGCC TTCGAAAATC TCCTGGACAG CGGCGAGGCG
TTAGCGAACC GGCTGGTGGG AGGATTCACC GAGGACCGCG ACTGGGAACA GCTCATGCAC
ATCGCCACCG ACGGCGAGAC CTACGGCCAC CACCAGAAAT TCGGCGACAT GGCGCTCGCC
GCCGCGCTGA ACCACATCGA GCAGAACAAC CTGGCGCGGC TCACCAACTA CGGCGAATAC
CTGGAGCTTT GCCCCCCGAC CATGGAGGCG AAGATCCACG AGCGGACCTC CTGGAGCTGC
GCCCATGGCG TTGAGCGCTG GAACAGCGAC TGCGGCTGCT CGGGCGGAAC GCCTGGATGG
AACCAGCAGT GGCGCGGCCC CTTGCGAGCC TCTCTCGACT GGCTGCGGGA CCGCCTGGCT
CAGGGGTTCT CCAGAAAGGG GGCGGAGCTT TTGAAGGACC CGTGGCAGGC TAGGGATGCC
TACATCGAGG TGATCCTGAA CCGGGAAATG GAGCAGGCCG AAAGCTTCCT GGCCCAGCAC
GCGAAGAAGG ATCTCGACGC CGACGAGAAA ATAGCCGCGC TGAAGCTCCT GGAGATGCAG
CGCCACGCCA TGCTGATGTA CACGAGCTGC GGCTGGTTCT TCGACGAACT CTCGGGGCTG
GAGACGGTGC AGGTGATCGA TTACGCCAGC CGCGCGTTGC AGCTTTCCGA TGGCATCGTG
GAACACGGCG TGGAGAAGGC ATTTCTGGAT CGTCTCAAGG AGGCGAAGAG CAACATCCCC
GCGCACCAGG ACGGCCTTTG GATCTACCAG AACTTCGTGC TCCCCATCCG GCTGGACCTG
GTCAAGGTCG GCGCCCACTA CGCCTTCAGT TCGCTCTACG AGGAGTACGA GGACCATTCC
CAGATCTACT GCTACGCCAT AGCGAAAGAG GAGTACGGCA AGATCTCCAC CCCGGACGCG
GTGATAGCCA TGGGGCGCAT CCACGTCGCC AGCGAGATCA CCGAGGAGAA CACCTGCCTC
ACCTTCTGCG TCATGCGCCT GGGGAGCCAC GACTTCAAGG GGGGGGTGAT CGAAAGCTGC
GACGGGGAGG CGTATGCGGC CATGCGGGAG GAGATGAGCG CCAGCTTCGA CAAAGGGCTC
TACACCGAGC TGGTCACCCT GATGGACAAG CACTTCGGCA CCCACAGCTT CTCGCTTTTG
AACCTCTTCT CCGACGAGCA GCGCAAGATC ATCAACATCA TCATCAACCA GAACATGGAG
GAGAGCATCT CCAGCTACCA GGATATGTTC GAGCGCAGCC GTCCGCTGAT GGAGTTCGTC
AAGGATACCC GGGTCCCGGT GCCGCACATA TTCCTGGCCG CGGCCGAGCC TGCTCTGAAC
CAGGCGCTGA AAAAGGCGAT GAGCGAGGAG GAGATCGACG AGGACGCGGT GCGCCGCATC
ATCGGGCAGA TCAAGAAGTG GCAGGTGGGG ATCGACGGCG GTGACACCGA GTACTTCATG
CGGCGGCACA TGGAGAGCAT GTCGGCGCAA CTGATGGAGG ACCCGGGTGA CGCGAAGCTC
ATGGGGAGGA TGCTGAAGTA CATGAACCTC CTGAACGAGA TCCCCATCAA CCTGGTGCTC
TGGCAGATGC AGAACGACTA CTACATCCTG GCCAAGACCG TCTACCCCGA TTACGCCGCA
AAGGCGGCCA AGGGGGAGGA GGGGGCGGCC GCATGGACCG AGGCGTTCCA GAAGCTGGGG
GAGACCTTCC GCTTCAATCT CGGCGCAGTG CTGCCGCAGG GGTAG
 
Protein sequence
MEQAKERFVC IHGHFYQPPR ENPWLEAVEI QDSAFPYHDW NERITAECYA ANSASRILDG 
DQRVMDITSN YAKINFNFGP TVLSWMAFAA PKIYQAILDA DKLSMKWRSG HGSAIAQVFN
HMIMPLANSR DKRTQIVWGI KDFEQRFQRF PEGMWLAETA VDLESLDLLA EYGIKYTILA
PHQAAGYREL GAEEWTETEI DPTRAYLCRL PSGREISLFF YDGPISRAVA FENLLDSGEA
LANRLVGGFT EDRDWEQLMH IATDGETYGH HQKFGDMALA AALNHIEQNN LARLTNYGEY
LELCPPTMEA KIHERTSWSC AHGVERWNSD CGCSGGTPGW NQQWRGPLRA SLDWLRDRLA
QGFSRKGAEL LKDPWQARDA YIEVILNREM EQAESFLAQH AKKDLDADEK IAALKLLEMQ
RHAMLMYTSC GWFFDELSGL ETVQVIDYAS RALQLSDGIV EHGVEKAFLD RLKEAKSNIP
AHQDGLWIYQ NFVLPIRLDL VKVGAHYAFS SLYEEYEDHS QIYCYAIAKE EYGKISTPDA
VIAMGRIHVA SEITEENTCL TFCVMRLGSH DFKGGVIESC DGEAYAAMRE EMSASFDKGL
YTELVTLMDK HFGTHSFSLL NLFSDEQRKI INIIINQNME ESISSYQDMF ERSRPLMEFV
KDTRVPVPHI FLAAAEPALN QALKKAMSEE EIDEDAVRRI IGQIKKWQVG IDGGDTEYFM
RRHMESMSAQ LMEDPGDAKL MGRMLKYMNL LNEIPINLVL WQMQNDYYIL AKTVYPDYAA
KAAKGEEGAA AWTEAFQKLG ETFRFNLGAV LPQG