Gene Acid345_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0387 
Symbol 
ID4069209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp444061 
End bp446481 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content58% 
IMG OID637982390 
ProductAlpha-glucosidase 
Protein accessionYP_589466 
Protein GI94967418 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.44264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA GACAGCTACT GAGATTTCTG AAGAGCGCCA TGTTGAGCTT GCTCCTCGCA 
GGGGTTACGC TGGCCCAAGG CGGCCCGCTG GAATTGAAGC GGGAAGGCCG AGTAATCTCG
CTGGTGCCGT ATGCCCCGAA CGTTTTGCGC GTGACGATGA GCATTGACAA CTCGGCGGCG
TCGGCCGCAC CGGGCTTTGG GATCGTTGGT GCACCGTCGC CAACCGGATG GACTCACGAG
CATGACGCCG ATGGCAGCGA GGTGTACCGG TCAGACAACA TGATCGTGCG TCTTGCGCCG
GGCGATCTGC CGAAGGAGAA GCTGCCACGG CCGATGCCGC TCGACGAATT GAACCAGCAG
CTTCGCGATG TGTACTTCGG AGGCGGAGGA GATCACGGTC CAAACCACGA TGCTCTGCTG
GTGACAACGC CACAAGGCAA GATGCTGCTC CATATGCGCA CCTGGATCAT GACGCCGCAG
CAAGAGGGAG CGCAGGCGAA ATCGGGCGGG AAGACGTACC GGGTTTCAGC GAGTTTTGAT
TCCCCCTCGG ACGAGCACTA CTACGGATTA GGGCAGCAAC AGAAGGGCTG GATGGATCTG
CGCGATCACC AGATCCACTG CTGGCATGAT TACGGCGCGA TTGGTGGAGA GAACGTCTGT
GTGCCGTTTA TGGTTTCGAG CCGGGGCTAT GGGCTGGTCT GGGACAATCC ATCGAAGACA
ACTGTGGACT TGGGATTCAA TGGACAAAAC CGTTGGACCT CGGACGTTGG CGACCGGGTT
TCGTACTTCG TGATTGCCGG CGATTCCAGC GATCAAATCT ACGCGGGCTA TCGGTTGTTG
ACGGGAGTGA CCCATCTGCT GCCGAGGGGC TCGTATGGGT ACATCCAGAG CAAGGCAATT
TATCCGACGC AAGGGCAGAT CCTGGACCTT GCAAGAACTT ATCGCGAGAA GAAGGTTCCA
CTCGACACGG TGGTGGTTGA TTTTCTGAAC ATGACCAGGC AAGGAGAAAT GGACCTCGAT
CCAAAACGAT GGCCTGATCC CGCTGGGATG AATCGTCAAC TGCACGATAT GAATGTGCGG
ACGCTACTGA GCGTATGGCC GCACTTTGCG CCGGGAACTC AGTTTTACGA CATGCTGCTG
AAGAAAGGCT GGCTGATCCA CACGCCGGAT GGAAAGCCTG ACCACGGTTG GTACAAAGAG
ATTATTGGAC CGAACCTCGA TACGACCAAC CCAGATGCTG CGAAGTGGTG GTGGGAACAG
ATTCGCGACC GTTATGTAAA GCCGTATGGT TTCGACGATC TGTGGCTGGA TGAAACGGAG
CCAGACGTCG ATCCGGCGAA TGACGTGTTC TGGGTCGGGC CGGGGACGAG CTTCTACAAC
GTCTATCCGC TGTTTCATAC TGCGTCGGTG TACGAAGGGT TTCGCCGCGA TTTTGGCGAC
AGCAAGAGGC TGATGATCCT GGCGCGAGCG TCTTATCTCG GCGCGCAGAG GAACGGCACT
GTTTTCTGGT CGAGCGATAT TGTTTCGACA TGGGACATGC TGAGGCGCTC GATTCCGGCA
GGGCTGAACT TCACCGCTAG CGGTATGCCG TATTGGGATA CGGACATCGC AGGCTTCTTC
TCGCCTGCAA TTCCGGCGGA CCACCACGCC GAGCACAAAC CCCTGATCGA CGGGTCGGAC
GCACGCGACG TGATCGACCG CTACGAGGAT TATCCCGAGC TCTTCGTGCG CTGGTTCGAA
TGGGGCGCAT TTCATCCGGT GATGCGGGCC CACGGCGAGA GAAAGCACAA CGAGGTCTGG
GCCTACGGGA AACAGGCCGA GCCGATTCTG ACGAAGTACC TGAAGCTTCG CTACGAGCTT
CTGCCGTACA CGTACTCGGT TGCGTATCGC AGCTATGAAA CGGGTGCACC TTACATGCGC
GCGCTGTTCA TGGATTTTCC CAACGACCCC AAGGCGTTGG ACATTCCGGA CGAGTACATG
TACGGACCCG CTTTCCTGGT CGCCCCAGTA ACCGAACAGG GCGCGACGCA ACGAACGGTG
TATTTGCCGG CCGGCTCCGA TTGGTACAAC TACTGGACCA ATGAGAAGCT GCACGGCGGG
CAGACGGTCG TGGTGCAAGC TCCCATCGAT ACGCTGCCCT TGTTCGTGAG GGCAGGAAGC
ATCGTGCCGT TTGGGTCAGA GGTGCAGAGC GCGCAGCAAG AGCAGAAGAT CGCGTCGGTA
CGGATTTATC CGGGAGCGAA TGGGAGCTTC ACGTTGTTCC AGGATGACGG CAAGACGTAT
GCGTACGAGA AAGGCGCGGG CTCCGTCACC AAGCTCATGT GGAATGACGC GGAGGGCCGA
CTGAAACACG AAGGCGTGCC AGCATGGAAC GGATCGGATG AGTCCATTGT GGTGGTCGTA
GGCAAGAAAC CGATGCAGTG A
 
Protein sequence
MNSRQLLRFL KSAMLSLLLA GVTLAQGGPL ELKREGRVIS LVPYAPNVLR VTMSIDNSAA 
SAAPGFGIVG APSPTGWTHE HDADGSEVYR SDNMIVRLAP GDLPKEKLPR PMPLDELNQQ
LRDVYFGGGG DHGPNHDALL VTTPQGKMLL HMRTWIMTPQ QEGAQAKSGG KTYRVSASFD
SPSDEHYYGL GQQQKGWMDL RDHQIHCWHD YGAIGGENVC VPFMVSSRGY GLVWDNPSKT
TVDLGFNGQN RWTSDVGDRV SYFVIAGDSS DQIYAGYRLL TGVTHLLPRG SYGYIQSKAI
YPTQGQILDL ARTYREKKVP LDTVVVDFLN MTRQGEMDLD PKRWPDPAGM NRQLHDMNVR
TLLSVWPHFA PGTQFYDMLL KKGWLIHTPD GKPDHGWYKE IIGPNLDTTN PDAAKWWWEQ
IRDRYVKPYG FDDLWLDETE PDVDPANDVF WVGPGTSFYN VYPLFHTASV YEGFRRDFGD
SKRLMILARA SYLGAQRNGT VFWSSDIVST WDMLRRSIPA GLNFTASGMP YWDTDIAGFF
SPAIPADHHA EHKPLIDGSD ARDVIDRYED YPELFVRWFE WGAFHPVMRA HGERKHNEVW
AYGKQAEPIL TKYLKLRYEL LPYTYSVAYR SYETGAPYMR ALFMDFPNDP KALDIPDEYM
YGPAFLVAPV TEQGATQRTV YLPAGSDWYN YWTNEKLHGG QTVVVQAPID TLPLFVRAGS
IVPFGSEVQS AQQEQKIASV RIYPGANGSF TLFQDDGKTY AYEKGAGSVT KLMWNDAEGR
LKHEGVPAWN GSDESIVVVV GKKPMQ