Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4152 |
Symbol | |
ID | 4072343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4911948 |
End bp | 4914443 |
Gene Length | 2496 bp |
Protein Length | 831 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637986183 |
Product | glycoside hydrolase family protein |
Protein accession | YP_593226 |
Protein GI | 94971178 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.198899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.318604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTCA CATTCCTCCG CAAGTTCTCG GCAACGCTTC TCCTCAGCGT TGCCGTTGTC GCTTCAGCGC AGAAGCTTCC GACGAAGCAG GAAGCCGCCG CCCGCGCCGA AAAAATCCTC ACGCAAATGA CGCTCGAAGA AAAAGTCGCC TACATCGGCG GTGACCGCGA CTTCTACATC CGCGCCATCC CGCGCCTCAA CGTTCCCGAA ATCAAAATGT CCGACGGCCC GCTCGGCACC CGTAACGATG GCAACTCCAC CGCCTATCCT GCCGGTATCG CCCTCGCTGC CTCGTGGGAC ATCAAGCTCG CTCACGAGAT GGGAGCCGCT CTCGGTTCGG ACTCCCGCGC CCGTGGCGTG AACATCCTCC TCGGCCCCGG ACTCAACATC TATCGTGCGC CGATGTGCGG CCGCAACTTC GAGTACTTTG GCGAAGATCC CTACCTCGCC TCGCGCATGG CCGTCGCCGA CGTCCAGGGC ATCCAGAGCA TGGGCGTCAT CGCCACCGCC AAGCACTACG CCGCCAACAA CCAGGAGTGG GACCGCAACC GCGTCTCCTC CGATGTCGAC GAGCGCACAT TGCGCGAAAT CTACCTCCCC TCCTTCGAGT ACGCCGTGAA GGAAGGCCAC GCCGGCGCCA TCATGGACTC CTACAACCTC GTCAACGGCG TCCACTCCAC GCAGAACACC TTCCTCAACA TTGACGTTGC CCGCAAAGAC TGGAACTTCA CCGGCATCAT CATGTCCGAC TGGGAAGCTA CCTACGACGG CGTCGCCGCC GCCAACGGTG GCCTCGATCT CGAAATGCCC AGCGGCAAAT TCATGAGCCC CACCACGCTC CTGGCCGCCG TCAAAGATGG CTCCGTCAAA GAATCCGTCA TCGACGAAAA AGTTCGCCGC ATCCTGCGCA CTTCGATCGA GTTCGGCTTC TTCGATCGTC CGCAGAAAAC CGCCACCCCA TGGAACGATC CAGCCTCGCG TGCCGTCGCC CTCAAAGTCG CGCAGGAAGG CTTCGTCCTC CTCAAAAATC AAGGCGGTGT GCTTCCGCTC GATCGCACGA AATTCAAGAA CATCGCTCTC ATCGGCCCCA ACGCCGGCAT TCCCGCCACC GGCGGTGGTG GCAGCTCCAA GATCGATCCT TTCTCCGCTG TCTCTCCGGT TGACGCCGTG AAGAACCTCG TCGGCGATTC CGCTAAGATC GCTTACTATC CCGGCCTCCA ACTCATCTCC GACGTTTTCA AGACCACCAG CTTCACCACC ACCGCCGACG GCGATACCCA CGGCTTAGTT ACAGAGTTCT TTAACAACAA AGACCTCACC GGTCCGCCCG CGCTCACCCG TACCGACGAG CACATTGCCT TCAACTGGAG CGGCGGCCCC TACGCGCCCA ACGGCCAGCA GGAAAACTTC TCCGCGCGAT TCACCGGCTA CTACACTCCC GCCGCCGACG GCACCTACAC CTTCGCCGTC TCCGGCGATG ACGGCTTCCG CCTCTTCGTC GACGACAAAC CCGTCATCGA ACAATGGGTC TATCAAGGCG AGACCATCGT CACCAAGGCG CTCGATCTCA AAGCTGGCCA GCACTACAAG CTCCGCCTCG AGTACTTCCA GGGCGGCGGC GGTGCCGCTC TCGGCTTCGG CGTCACTGAC GGCAAGTCTT CCGCTCTCAC CGATGCCGTC AACGCCGCGA CAAACGCCGA CCTCGTCATC CTCTGCGTCG GCTTCGACGA CAAGTCCGAA GGCGAAGGCG CCGACCGTAC TTTCGCGCTC CCGCAGCCCC AATACGAACT CATCAAGCAA GTTGAGGCCG CCAACAAGAA CACCGTCATG GTCCTCACCG CCGGCGGCAA CGTGGACATG GTGCCGTTCA TCGACAACAC GCCTGCGCTC CTGCACGTCT GGTATCCCGG ACAGGAAGGC GCCACCGCCA TGGCCCAGGT CCTCTTCGGC GACATCAACC CGAGCGGCAA ACTCCCCGCC TCGTTCGAGC GCCGTTGGGA AGACAACGCC ACCTACAACA GCTACTACGA CCCCGATAAG ACGCTCCACG TGAAGTACAC CGAAGGCATC TTCGTCGGCT ACCGCCACTT CGACAAAGAC AACGTCAAGC CGATGTTCCC CTTCGGCTAC GGCCTCAGCT ACACCACCTT CCAATACGGC GGCCTCAAGA TCGGCGCACC TTCCGCCGAC AGCACCGTCC CCGTCACCTT TACCGTGAAG AACACCGGCA AGCGCGCCGG CGCCGAGATC GCCGAAGTCT ACGTCGGCGA GAAAAATCCC AAAGTTCCGC GCCCCGTGAA AGAACTCAAA GGCTTCGCCC GCGTCGAACT CAAACCCGGC GAATCCCGCA GCATCACCGT CAACCTCGAC CGCCGCGCCT TCTCCTGGTA CGACGCCAAC TCGCACCAGT GGACCGCCGA TACCGGCAAC TACGACATCC TCATAGGCAG CAGCAGCGCC AAGATCGAAC TAACCGGCAA CGTCGCCCTG CGATAA
|
Protein sequence | MNLTFLRKFS ATLLLSVAVV ASAQKLPTKQ EAAARAEKIL TQMTLEEKVA YIGGDRDFYI RAIPRLNVPE IKMSDGPLGT RNDGNSTAYP AGIALAASWD IKLAHEMGAA LGSDSRARGV NILLGPGLNI YRAPMCGRNF EYFGEDPYLA SRMAVADVQG IQSMGVIATA KHYAANNQEW DRNRVSSDVD ERTLREIYLP SFEYAVKEGH AGAIMDSYNL VNGVHSTQNT FLNIDVARKD WNFTGIIMSD WEATYDGVAA ANGGLDLEMP SGKFMSPTTL LAAVKDGSVK ESVIDEKVRR ILRTSIEFGF FDRPQKTATP WNDPASRAVA LKVAQEGFVL LKNQGGVLPL DRTKFKNIAL IGPNAGIPAT GGGGSSKIDP FSAVSPVDAV KNLVGDSAKI AYYPGLQLIS DVFKTTSFTT TADGDTHGLV TEFFNNKDLT GPPALTRTDE HIAFNWSGGP YAPNGQQENF SARFTGYYTP AADGTYTFAV SGDDGFRLFV DDKPVIEQWV YQGETIVTKA LDLKAGQHYK LRLEYFQGGG GAALGFGVTD GKSSALTDAV NAATNADLVI LCVGFDDKSE GEGADRTFAL PQPQYELIKQ VEAANKNTVM VLTAGGNVDM VPFIDNTPAL LHVWYPGQEG ATAMAQVLFG DINPSGKLPA SFERRWEDNA TYNSYYDPDK TLHVKYTEGI FVGYRHFDKD NVKPMFPFGY GLSYTTFQYG GLKIGAPSAD STVPVTFTVK NTGKRAGAEI AEVYVGEKNP KVPRPVKELK GFARVELKPG ESRSITVNLD RRAFSWYDAN SHQWTADTGN YDILIGSSSA KIELTGNVAL R
|
| |