Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0907 |
Symbol | |
ID | 4069118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1136677 |
End bp | 1137735 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982914 |
Product | ABC sugar transporter, periplasmic binding protein |
Protein accession | YP_589984 |
Protein GI | 94967936 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.40288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0322478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAACT TCATCCTCTT ACTCTGTATA GTGGCTCTGC TCGAATTCTC AACATCCTGC CACCGTGGGC ATGAAGAGGC GCGGCAATCT CGCGGCAAGG GCCCGGTGAA GATTGGATTG TCGCTCGACT CGCTGCAATT GGAACGCTGG CAACATGATC GGGATGCCTT CGTGGCGAAA GCCAGCCAAC TCGGAGCCGA AGTATTCATA CAATCCGCCA ACGGAGTGGA TGCTGTCCAG ATCCGACAGT GCGAAAACTT GCTGACGATG GGCGTCGATG TTCTTGTCAT AGTGCCGCAT AACGGCGAAG TCATGGCATC GGCGGTGCGC AGTGCTGAGG CGCAGGGCGT GCCCGTGATC TCGTATGACC GACTGATTCG CGATTCGAAT GTGAGTCTGT ATGTCTCATT CGACAACAAG CTGATTGGTG AGTTACAGGC GAAATACCTC TACGCCCGCG CACCAGCGGG CAATTACATT CTTATCGGCG GTTCACCAAC GGATAACAAC GCTCACTTAA TTCGCGAAGG CCAAATGCAG GTTCTTAGTC CCGCCATCAA GCGCGGCGAC ATTCGCATCA TCGCGGACCA ATGGGCCAAA GACTGGCTGC CTAGTGAAGC TCTGCGTCAC ACCGAAAACG CTCTCACGCA GGCAAATAAT CATGTCGCCG CGGTGGTGAC TTCAAATGAC AGCACCGCCG GTGGCGCCAT TCAGGCGCTT GGAGAACAGG GCTTGGCGGG GAGAGTGCTT GTATCGGGTC AAGACACCGA CCTCGCGGCC GCGCAGCGCG TCGTCGAGGG CACGCAGTCG ATGACCGTGT ACAAGCCGAT CAAGCCCCTC GCAGAAAATG CAGCAGCGGC GGCGGTAGCG CTTGCGCGTG GCGAGAAGGT CCAGTCAAAT TCGAACGTTA ACAACGGCGC GAAAGAAGTT CCTTCAATTC TTCTCGCGCC GATTGTTGTT GATCGCACGA ACATCGATTC TACTGTCATT AAAGACGGCT TCTTGAAGCG CGAAGACATA TACAAAAATG TCTCGCGCAC GCAGTGGCCG AAGGACTAG
|
Protein sequence | MRNFILLLCI VALLEFSTSC HRGHEEARQS RGKGPVKIGL SLDSLQLERW QHDRDAFVAK ASQLGAEVFI QSANGVDAVQ IRQCENLLTM GVDVLVIVPH NGEVMASAVR SAEAQGVPVI SYDRLIRDSN VSLYVSFDNK LIGELQAKYL YARAPAGNYI LIGGSPTDNN AHLIREGQMQ VLSPAIKRGD IRIIADQWAK DWLPSEALRH TENALTQANN HVAAVVTSND STAGGAIQAL GEQGLAGRVL VSGQDTDLAA AQRVVEGTQS MTVYKPIKPL AENAAAAAVA LARGEKVQSN SNVNNGAKEV PSILLAPIVV DRTNIDSTVI KDGFLKREDI YKNVSRTQWP KD
|
| |