Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1956 |
Symbol | |
ID | 4073216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2348905 |
End bp | 2351688 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983968 |
Product | glycoside hydrolase family protein |
Protein accession | YP_591031 |
Protein GI | 94968983 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.289429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATAAACC CCCGGCTCTG TCTCGCACTC CTTCTAAGCG CATTCGCTAT CGTCCCAGTT CACGCACAGA CTCCTACTGC CAAGCCCGCG GCGGCCACGG TTCAGCCGCC GGAAGCCTCT TACAATCCGG TTGCCGATCC GAAGGCCGTG GTGGTGGTGG GCCACGCGCG CTTCACGGTG CTTACGCCGC AGATGATCCG GATGGAGTGG GCGGCGGATG GCAAGTTCGA AGACCGCCCG TCGTTTGTGT TTCTGAACCG GCTCCTGCCC GTGCCTGAGT TCACGCAGAA GAAAGACGGG CAGCGCGTCA CTCTGAAGAC TGCCGATGTG GAACTCACGT ATAACGCCGA CTCCGCCGGC GACGGCAAGT TCACCGCAGA GAACCTCACC GCGACGATTC AATTGAACGG CAAGGCGGTG ACGTGGCATC CCGGCATGGA TGACTCTGGC AATCTACAGG GCACAACGCG CACGCTCGAC GGCGCCAAGG GCAATCAGAC GAAAGAACCG ATCGACCCCG GCCTGGTTTC ACGGGATGGC TGGGTGGTTG TGGACGACTC CAAGCGTCCG CTTTTTGATA GCGACAACTT CACCTTCGCA CAGGGTGAAA AGAGCGAGTG GCCGTGGATT GTGCTGCGCA CGGACACTGA TCGCCAGGAT TGGTATTTCT TCGGATACGG CCACGAGTAC AAAAATGCTC TCGGCGATTT TGTGAAGGTC GCCGGCCGGA TTCCGATTCC GCCGCGGTTT GCGTTCGGCA TCTGGTGGTC GCGCTACTGG GCTTATAGCG ACCAGGAGTT AGACAACCTC GTCAGCGGTT TTCATGAAAA CGATTTGCCG CTCGATGTGC TCGTGATTGA CATGGACTGG CACCTGAACA AAGAGCAGCT GCTCGCGATG GGCGAGAAAG ACCAGTCTGG CCACGAACTG GGATGGTCGG GCTATACGTG GAACCCGCTG CTGTTCCCGG ACCCGAAGGC ATTTCTCGAT GGCATTCACG CGCAAGGGAT CAAGGCCACG CTGAACGTGC ATCCGGCGTC GGGGGTGCAT CCGTGGGAGA AGGCGTATCC CGAGATGGCG AAGGCGATGG GCATTGACCC TGCGACGCGC AAGTGGGTGG CCTTCGACAT TACGAACAAG AAATTCGCGC GCAATTACAT GGACCTGCTG CACCATCCGC TGGAGCGGCA GGGGATCAAC TTCTGGTGGC TCGACTGGCA ACAGGAGATG AAGACGGGCA CACCGGGCGT GAGCCCGACG TGGTGGTTGA ACTACGTGCA CTTTACCGAC CAGCAGATGG AAGGGAAGCG TCCGCTGTTG TTCCATCGCT GGGGCGGGTT GGGGAACCAT CGTTACCAGA TAGGTTTTTC GGGCGACACG ATTTCGGTGT GGGATTCGCT GGCGTTCCAG CCGTGGTTTA CGGCGACGGC AGCAAACGTT GGGTATGCGT ATTGGAGCCA CGACATCGGC GGGCACATGC CGGGCGTGGT GGACCCGGAA ATTATCACGC GCTGGATTGA GTTTGGCGCG TTCAGCCCGA TCCTGCGCAC GCATACCACG AAGAATCCGG ACTCCGAGCG TCGCGTGTGG GCGTATCCCG AGCCGTACGC GGACATCATG CGCGAGACCA TGCAGCACCG TGAACAGATG CAGCCGTACA TTTACACCGA AGCGCGGCGC ACCTACGACA CCGGTGTGGC GTTCCTGCAT CCGCTGTATT ACGACTGGCC GGAGGCGGAG CAGGCGTACA ACGTGAAGGA CGAGTATGTC TTTGGGAGTC AAATGCTGGT GGCGCCAATT ACGTCGCCGG TGGATCCGGT GACGCAACTG TCAACGCGCA AGGTTTGGAT TCCGCAAGGC GAATGGATCG AGCGATCGAG CGGAAAGCAC TTCGCCGGTC CCGCAGATGC GACGCGCAGT TTCGACATTC GCGAGACGCC GGTGTATGTG AAGGCGGGGG CGATCGTTCC GGGCCAGCCG CCGATGCAGC ATGCGGACCA GCGGCCGGTG GATCCGCTTA TCCTGAATGT ATTTCCGCTG GCCGATAAGC AGACTAGCGA ATACAAACTC TACTCCGATG GCAGCGACTC AGAGGCGTAC AAGCGCGGCG CGTTTTCGTG GACGAAGATC AGCGCTGCGC AGAGCGGCGA TGAACTCACG CTGACGATTG CGCCGGTGGA GGGAAGCTAT CCGGGGATGA CGACGGAGCG GGCCTACGAA TTGCGCCTGC CATACGATTG GCCGCCGGAA ACGGTGACGG CCAACGGGCA ATCACTCACT TTCACCGCAA AGGGCTCACC GAAAATCGGC TGGCGCTATG AAGGGAACAC GCTGAGCACG GTGATCACGA CGGTGCGCTA CTCGGTGCAC ACGCCGGTGA AGATCGTGGT GAAACGGGCT GGCGGATCGC TGGCGTCGCG CTCGCAACTG GATGGCTTCG CCGGTGCGAT TGCGCGTTTG CAGCTCGCTT ACGACACGCT GAACGCGCTG GAAAACTTCC GCACACGCGC GACTGATCCG GTGATTGATG CCTGGGAGAC CGGGGATCGT TTGAGCTATC GACCGGAGAA GGCCAAAGCC GAGATCGCGC GATTCCCGAA GGTCTACGCA GATGCGCTGG CGTCGGTGAA AGCACTGGTG GCGAAGGCTG ATACGGATGC AGGCGATCTC GATAAGAAGC GCGATGAGTA TCATCGCTCG GCGCAGGATG AGGAGCGGAT GAAGAACTTC CGCAACTATT TGAAGCGCGC TGAGAACGCG ATCGAAGATG GCGGCGTCAA GTAG
|
Protein sequence | MINPRLCLAL LLSAFAIVPV HAQTPTAKPA AATVQPPEAS YNPVADPKAV VVVGHARFTV LTPQMIRMEW AADGKFEDRP SFVFLNRLLP VPEFTQKKDG QRVTLKTADV ELTYNADSAG DGKFTAENLT ATIQLNGKAV TWHPGMDDSG NLQGTTRTLD GAKGNQTKEP IDPGLVSRDG WVVVDDSKRP LFDSDNFTFA QGEKSEWPWI VLRTDTDRQD WYFFGYGHEY KNALGDFVKV AGRIPIPPRF AFGIWWSRYW AYSDQELDNL VSGFHENDLP LDVLVIDMDW HLNKEQLLAM GEKDQSGHEL GWSGYTWNPL LFPDPKAFLD GIHAQGIKAT LNVHPASGVH PWEKAYPEMA KAMGIDPATR KWVAFDITNK KFARNYMDLL HHPLERQGIN FWWLDWQQEM KTGTPGVSPT WWLNYVHFTD QQMEGKRPLL FHRWGGLGNH RYQIGFSGDT ISVWDSLAFQ PWFTATAANV GYAYWSHDIG GHMPGVVDPE IITRWIEFGA FSPILRTHTT KNPDSERRVW AYPEPYADIM RETMQHREQM QPYIYTEARR TYDTGVAFLH PLYYDWPEAE QAYNVKDEYV FGSQMLVAPI TSPVDPVTQL STRKVWIPQG EWIERSSGKH FAGPADATRS FDIRETPVYV KAGAIVPGQP PMQHADQRPV DPLILNVFPL ADKQTSEYKL YSDGSDSEAY KRGAFSWTKI SAAQSGDELT LTIAPVEGSY PGMTTERAYE LRLPYDWPPE TVTANGQSLT FTAKGSPKIG WRYEGNTLST VITTVRYSVH TPVKIVVKRA GGSLASRSQL DGFAGAIARL QLAYDTLNAL ENFRTRATDP VIDAWETGDR LSYRPEKAKA EIARFPKVYA DALASVKALV AKADTDAGDL DKKRDEYHRS AQDEERMKNF RNYLKRAENA IEDGGVK
|
| |