Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0387 |
Symbol | |
ID | 4069209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 444061 |
End bp | 446481 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982390 |
Product | Alpha-glucosidase |
Protein accession | YP_589466 |
Protein GI | 94967418 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.44264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCA GACAGCTACT GAGATTTCTG AAGAGCGCCA TGTTGAGCTT GCTCCTCGCA GGGGTTACGC TGGCCCAAGG CGGCCCGCTG GAATTGAAGC GGGAAGGCCG AGTAATCTCG CTGGTGCCGT ATGCCCCGAA CGTTTTGCGC GTGACGATGA GCATTGACAA CTCGGCGGCG TCGGCCGCAC CGGGCTTTGG GATCGTTGGT GCACCGTCGC CAACCGGATG GACTCACGAG CATGACGCCG ATGGCAGCGA GGTGTACCGG TCAGACAACA TGATCGTGCG TCTTGCGCCG GGCGATCTGC CGAAGGAGAA GCTGCCACGG CCGATGCCGC TCGACGAATT GAACCAGCAG CTTCGCGATG TGTACTTCGG AGGCGGAGGA GATCACGGTC CAAACCACGA TGCTCTGCTG GTGACAACGC CACAAGGCAA GATGCTGCTC CATATGCGCA CCTGGATCAT GACGCCGCAG CAAGAGGGAG CGCAGGCGAA ATCGGGCGGG AAGACGTACC GGGTTTCAGC GAGTTTTGAT TCCCCCTCGG ACGAGCACTA CTACGGATTA GGGCAGCAAC AGAAGGGCTG GATGGATCTG CGCGATCACC AGATCCACTG CTGGCATGAT TACGGCGCGA TTGGTGGAGA GAACGTCTGT GTGCCGTTTA TGGTTTCGAG CCGGGGCTAT GGGCTGGTCT GGGACAATCC ATCGAAGACA ACTGTGGACT TGGGATTCAA TGGACAAAAC CGTTGGACCT CGGACGTTGG CGACCGGGTT TCGTACTTCG TGATTGCCGG CGATTCCAGC GATCAAATCT ACGCGGGCTA TCGGTTGTTG ACGGGAGTGA CCCATCTGCT GCCGAGGGGC TCGTATGGGT ACATCCAGAG CAAGGCAATT TATCCGACGC AAGGGCAGAT CCTGGACCTT GCAAGAACTT ATCGCGAGAA GAAGGTTCCA CTCGACACGG TGGTGGTTGA TTTTCTGAAC ATGACCAGGC AAGGAGAAAT GGACCTCGAT CCAAAACGAT GGCCTGATCC CGCTGGGATG AATCGTCAAC TGCACGATAT GAATGTGCGG ACGCTACTGA GCGTATGGCC GCACTTTGCG CCGGGAACTC AGTTTTACGA CATGCTGCTG AAGAAAGGCT GGCTGATCCA CACGCCGGAT GGAAAGCCTG ACCACGGTTG GTACAAAGAG ATTATTGGAC CGAACCTCGA TACGACCAAC CCAGATGCTG CGAAGTGGTG GTGGGAACAG ATTCGCGACC GTTATGTAAA GCCGTATGGT TTCGACGATC TGTGGCTGGA TGAAACGGAG CCAGACGTCG ATCCGGCGAA TGACGTGTTC TGGGTCGGGC CGGGGACGAG CTTCTACAAC GTCTATCCGC TGTTTCATAC TGCGTCGGTG TACGAAGGGT TTCGCCGCGA TTTTGGCGAC AGCAAGAGGC TGATGATCCT GGCGCGAGCG TCTTATCTCG GCGCGCAGAG GAACGGCACT GTTTTCTGGT CGAGCGATAT TGTTTCGACA TGGGACATGC TGAGGCGCTC GATTCCGGCA GGGCTGAACT TCACCGCTAG CGGTATGCCG TATTGGGATA CGGACATCGC AGGCTTCTTC TCGCCTGCAA TTCCGGCGGA CCACCACGCC GAGCACAAAC CCCTGATCGA CGGGTCGGAC GCACGCGACG TGATCGACCG CTACGAGGAT TATCCCGAGC TCTTCGTGCG CTGGTTCGAA TGGGGCGCAT TTCATCCGGT GATGCGGGCC CACGGCGAGA GAAAGCACAA CGAGGTCTGG GCCTACGGGA AACAGGCCGA GCCGATTCTG ACGAAGTACC TGAAGCTTCG CTACGAGCTT CTGCCGTACA CGTACTCGGT TGCGTATCGC AGCTATGAAA CGGGTGCACC TTACATGCGC GCGCTGTTCA TGGATTTTCC CAACGACCCC AAGGCGTTGG ACATTCCGGA CGAGTACATG TACGGACCCG CTTTCCTGGT CGCCCCAGTA ACCGAACAGG GCGCGACGCA ACGAACGGTG TATTTGCCGG CCGGCTCCGA TTGGTACAAC TACTGGACCA ATGAGAAGCT GCACGGCGGG CAGACGGTCG TGGTGCAAGC TCCCATCGAT ACGCTGCCCT TGTTCGTGAG GGCAGGAAGC ATCGTGCCGT TTGGGTCAGA GGTGCAGAGC GCGCAGCAAG AGCAGAAGAT CGCGTCGGTA CGGATTTATC CGGGAGCGAA TGGGAGCTTC ACGTTGTTCC AGGATGACGG CAAGACGTAT GCGTACGAGA AAGGCGCGGG CTCCGTCACC AAGCTCATGT GGAATGACGC GGAGGGCCGA CTGAAACACG AAGGCGTGCC AGCATGGAAC GGATCGGATG AGTCCATTGT GGTGGTCGTA GGCAAGAAAC CGATGCAGTG A
|
Protein sequence | MNSRQLLRFL KSAMLSLLLA GVTLAQGGPL ELKREGRVIS LVPYAPNVLR VTMSIDNSAA SAAPGFGIVG APSPTGWTHE HDADGSEVYR SDNMIVRLAP GDLPKEKLPR PMPLDELNQQ LRDVYFGGGG DHGPNHDALL VTTPQGKMLL HMRTWIMTPQ QEGAQAKSGG KTYRVSASFD SPSDEHYYGL GQQQKGWMDL RDHQIHCWHD YGAIGGENVC VPFMVSSRGY GLVWDNPSKT TVDLGFNGQN RWTSDVGDRV SYFVIAGDSS DQIYAGYRLL TGVTHLLPRG SYGYIQSKAI YPTQGQILDL ARTYREKKVP LDTVVVDFLN MTRQGEMDLD PKRWPDPAGM NRQLHDMNVR TLLSVWPHFA PGTQFYDMLL KKGWLIHTPD GKPDHGWYKE IIGPNLDTTN PDAAKWWWEQ IRDRYVKPYG FDDLWLDETE PDVDPANDVF WVGPGTSFYN VYPLFHTASV YEGFRRDFGD SKRLMILARA SYLGAQRNGT VFWSSDIVST WDMLRRSIPA GLNFTASGMP YWDTDIAGFF SPAIPADHHA EHKPLIDGSD ARDVIDRYED YPELFVRWFE WGAFHPVMRA HGERKHNEVW AYGKQAEPIL TKYLKLRYEL LPYTYSVAYR SYETGAPYMR ALFMDFPNDP KALDIPDEYM YGPAFLVAPV TEQGATQRTV YLPAGSDWYN YWTNEKLHGG QTVVVQAPID TLPLFVRAGS IVPFGSEVQS AQQEQKIASV RIYPGANGSF TLFQDDGKTY AYEKGAGSVT KLMWNDAEGR LKHEGVPAWN GSDESIVVVV GKKPMQ
|
| |