Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4061 |
Symbol | |
ID | 4072483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4805289 |
End bp | 4807484 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986092 |
Product | Beta-glucosidase |
Protein accession | YP_593135 |
Protein GI | 94971087 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.905446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0160169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTT CTCGCGTATT TCTTGCCACC ACTTTAGCGC TGTCAATGTT TGGTTCTGCG CAGCGCCCGG ATCCCAGGGC CGAACTCATG AAGAAGCCCT GGATGGACAA AAACAAATCC GCCGATGAGC GCGCCGAACT CGTGCTGAAG GAAATGACTG TAGACGAGAA GATCAACCTC ATCCACGGCC AGGGCATGGA AGAAATGGCA GAGTTCGGGA TTCCGATGCC GAACAAGGCG CTCGGAAATG GCGGCGCCGG ATTCGTCCTC GGTGTGGAAC GGCTTGGAAT CCCGCTCATC CAGATGAGCG ATGCGGCGTA TGGCGTTCGT AGCAGCGCAA AGAATGGCCG GTATTCCACA GCGTTACCGT CGAACCTGGG CTCAGCCGCG AGTTGGGACC CGCAGGCCGC GTGCGAGTAT GGCGCACTCA TCGGTCGTGA ACTGCGCGCG CAGGGCTACA ACATGACACT CGGCGGAGGG GTGAACATCA CCCGCGAACC GCGCAACGGC CGCACCTTCG AATACATGGG CGAAGACCCG ATTCTCGCTG GAACTTTGGT TGGCAATCGG ATCAAGTGTG AGCAGGCGCA GCACGTCATC GGCGACACGA AGCACTATGC CGTCAACGAC CAGGAGAGCG GTCGAACGGA AGTTGACGTC GTGATCAGCA AGCGCGCGAT GCGCGAGACC GACCTTCTCG CGTTTGAAAT CGGTATCGAG ATCGGACAGC CGGGCGCAGT GATGTGCTCG TACAACCTGG TGAACGGCAC GTACGCTTGC CAGAACAAGT ATCTGCTTAC CGATGTTTTG AAGAAGGACT GGAACTTTAA GGGCTTCGTA GTTTCCGATT GGGGGGCCAC CCACAGCACG ATCGAATCGT CCGCTGCCGG CCTAGACAAC GAACAGCCAT TCGGCATTTT CTATTCCGAC AAATTCAAGG CCGGTCTCGA CTCGGGCAAA ATCCCAATGT CGGAACTCGA CGACCATGTT CGGCGCATTC TGCGCACCGA GTTTGCGTCA GGCATCGTCG ACAATCCGGT TGTGAAAGCT GTCGTTGATG TTGAAGCAGG TCTCGAAACC GCGCGCAAGA TCGAAGAGGG GAGCATCGTT CTTCTAAAGA ATGGCAACAA CATTCTTCCG CTCGACAAAG ACTCGATCAA GTCTGTCGCA ATTATCGGAG AGAAGGCCGA TTTCGGAATG ATCTCTGGTG GTGGCTCCGC GCAGGTGGAT CCTCCGGGTC CGTCGCATGA GTGGCAAGCC CATGTCTGGT TCCCAACCTC GCCGCTGAAG GCTGTGCAAG CCAAGCTACC GAATGCGAAA GTTGACTGGC GCTCCGGCTC GGACAAGAGC AAAGCCGCCG CGCTTGCCAA GCAGTCCGAC ATCGCGATCG TGTTTGTACA TCAATGGATC AGTGAAGGCA TGGACCTGCC TGATCTGACA CTGCCTTTCG GCCAGGATGA ACTGATCGAG CGCGTTGCCG CTGCAAATCC GCGGACGATT GTTGTTCTAG AATCCGGTAC TGCGGTCACC ATGCCGTGGC TCGATAAAGT CAGCGGCGTA GTGGAAGCCT GGTACGGAGG CAGCAAGGGT GCCGACGCCG TCGCGAACAT TTTGTTCGGC GATGTGAATC CCTCCGGAAA GCTGCCGATG ACTTTCCCGC GCAGCGTAGA AGACGTGCCG CATCCGAAGC TCGCGGTGCC TCCCCCGAAC CCGAATCCAA TGGAAACCTA TATGCACCCC GAACTCGCGA AGGCTACGGT CACAGCGACA TACGACGAGG GCGTAAAAGT CGGGTACAAG TGGTATGACG CCGAGAAGAA GCCGGTCCTG TTCCCGTTTG GCTTCGGGCT CTCCTACACC ACGTATCAGT ACAGCGGATT GAAGGTTTCC AGCGGGAAGA CGACAACTGT GAGCTTCACG GTGAAAAATA CCGGAAAGCG AGGGGGCGAC GAGATCGCGC AAGTCTACGC ATCGCTTCCG GCGGAAGCGC AGGAGCCTCC GAAGCGTTTG GTGGGCTTCA CCAAAGTCCA TCTTAATGCG GGTGAATCGA AGGAAGTGAG CGTAACCGTG CCAGCGAAAT ATCTATCGGT CTTCGATGAA GCGTCGAACG GATGGAAGCT GGTTCCGGGC AGCTATAGCT TTATGGTCGG CGGATCGTCG CAAGACCTTC CGCAGAAACA ATCGGCTACG TTCTAA
|
Protein sequence | MKISRVFLAT TLALSMFGSA QRPDPRAELM KKPWMDKNKS ADERAELVLK EMTVDEKINL IHGQGMEEMA EFGIPMPNKA LGNGGAGFVL GVERLGIPLI QMSDAAYGVR SSAKNGRYST ALPSNLGSAA SWDPQAACEY GALIGRELRA QGYNMTLGGG VNITREPRNG RTFEYMGEDP ILAGTLVGNR IKCEQAQHVI GDTKHYAVND QESGRTEVDV VISKRAMRET DLLAFEIGIE IGQPGAVMCS YNLVNGTYAC QNKYLLTDVL KKDWNFKGFV VSDWGATHST IESSAAGLDN EQPFGIFYSD KFKAGLDSGK IPMSELDDHV RRILRTEFAS GIVDNPVVKA VVDVEAGLET ARKIEEGSIV LLKNGNNILP LDKDSIKSVA IIGEKADFGM ISGGGSAQVD PPGPSHEWQA HVWFPTSPLK AVQAKLPNAK VDWRSGSDKS KAAALAKQSD IAIVFVHQWI SEGMDLPDLT LPFGQDELIE RVAAANPRTI VVLESGTAVT MPWLDKVSGV VEAWYGGSKG ADAVANILFG DVNPSGKLPM TFPRSVEDVP HPKLAVPPPN PNPMETYMHP ELAKATVTAT YDEGVKVGYK WYDAEKKPVL FPFGFGLSYT TYQYSGLKVS SGKTTTVSFT VKNTGKRGGD EIAQVYASLP AEAQEPPKRL VGFTKVHLNA GESKEVSVTV PAKYLSVFDE ASNGWKLVPG SYSFMVGGSS QDLPQKQSAT F
|
| |