Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2339 |
Symbol | |
ID | 3849709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2634434 |
End bp | 2635462 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637842008 |
Product | D-xylose ABC transporter, periplasmic-D xylose binding protein |
Protein accession | YP_442860 |
Protein GI | 83720424 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.975392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCCG TCACGCGTCG TACCGTATTG AGTTCGCTTG CCGGCGCCGC GGCGCTGGCG GCGCTGGCGC TCGCCGCGCC GCTCGCGCAC GCGAGCAAGG ACAAGCCGGA GATCGGCTTT TGCATCGACG ACCTTCGCGT GGAACGCTGG TCGCGCGACC GCGATTATTT CGTCGCGGCC GCGACGAAGC TCGGCGCGAA GGTGTCGGTG CAGTCGGCGG ACGCGAGCGA GGAGCGGCAG ATCTCGCAGA TCGAAAACCT GATCTCGCGA GGCGTCGACG TGATCGTGAT CGTGCCGTTC AATTCGAAGA CGCTCGGCAA CGTCGTCGCC GAAGCGAAGA GGGCGGGCAT CAAGATCGTG TCGTACGACC GGCTGATCCT CGACGCCGAC GTCGACGCGT ACATCTCGTT CGACAACGTG AAGGTCGGCG AGCTGCAGGC GCGGGGCGTC TACGACGCGA AGCCGAAGGG CAACTACTTC CTGCTCGGCG GCGCGCCCAC CGACAACAAC GCGAAGATGC TGCGCGAAGG ACAGTTGAAG GTGCTCAAGC CCGCGATCGA CCGCGGCGAC ATCCGGATCG TCGGCCAGCA GTGGGTGCCC GAATGGAGCG CGTCGACCGC GCTGCGCATC GTCGAGGATG CGCTGACCGC GAACGACAAC AGGATCGACG CGATCGTCGC ATCGAACGAC GGCACCGCGG GCGGCGCGAT CCAGGCGCTG GCCGCGCAGC ATCTCGCGGG CAAGGTGCCG GTGTCGGGGC AGGACGCGGA TCTCGCCGCG CTCAGGCGCG TGATCGCCGG CACGCAGACG ATGACTGTCT ATAAACCGCT GAAGCTGATC GCAAGCGAAG CCGCGAGGCT CGCCGTGGAT CTCGCGAAAG GCACGAAGCC CGCGTACAAC GCGCAATACG ACAACGGCAA GAAGAAGGTC GATACGGTGC TGCTGCAGCC GACGCTGCTG ACCAAGCGCA ACGTCGACGT CGTCGTGAAG GACGGCTTCT ACACGCAGGC GCAACTGGCG GGCCAGTAA
|
Protein sequence | MRSVTRRTVL SSLAGAAALA ALALAAPLAH ASKDKPEIGF CIDDLRVERW SRDRDYFVAA ATKLGAKVSV QSADASEERQ ISQIENLISR GVDVIVIVPF NSKTLGNVVA EAKRAGIKIV SYDRLILDAD VDAYISFDNV KVGELQARGV YDAKPKGNYF LLGGAPTDNN AKMLREGQLK VLKPAIDRGD IRIVGQQWVP EWSASTALRI VEDALTANDN RIDAIVASND GTAGGAIQAL AAQHLAGKVP VSGQDADLAA LRRVIAGTQT MTVYKPLKLI ASEAARLAVD LAKGTKPAYN AQYDNGKKKV DTVLLQPTLL TKRNVDVVVK DGFYTQAQLA GQ
|
| |