Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1179 |
Symbol | |
ID | 3847199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1324913 |
End bp | 1325920 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637840851 |
Product | L-arabinose ABC transporter, periplasmic L-arabinose-binding protein |
Protein accession | YP_441726 |
Protein GI | 83718663 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCA GAACGTTCAT CACGCTGGCG GCAGCGGCAG CGGCAGCGGC GGCGGTCGCG GCGGCGGGCC TGTCCGCGCA CGCGGCCGAG CCCGTGAAGA TCGGCTTCCT CGTCAAGCAG CCCGAGGAGC CGTGGTTCCA GGACGAATGG AAATTCGCCG AGATCGCCGC GAAGGACAAG GGCTTCACGC TCGTGAAGAT CGGCGCGCCG TCCGGCGAGA AGGTGATGAG CGCGATCGAC AATCTCGCCG CGCAGAAGGC GCAGGGCTTC ATCATCTGCA CGCCGGACGT GAAGCTCGGG CCGGGCATCG TCGCGAAGGC GAAGTCGCAC GGCCTGAAGA TGATGACGGT CGACGACCGG CTCGTCGACG GCGCGGGCAA GCCGATCGAA TCGGTCCCGC ACATGGGCAT TTCCGCGTAC GACATCGGCA AGCAGGTCGG CGGCGGGATC GCGGCCGAGA TCAAGAAGCG CGGCTGGAAC ATGAACGAAG TCGGCGCGAT CGACATCACG TACGAGCAGT TGCCGACCGC GCACGACCGC ACGGCGGGCG CGACCGACGC GCTCGTCGCC GCGGGCTTTC CGAAGGCGAA CGTGATCGCC GCGCCGCAGG CGAAGACCGA TACCGAAAAC GCGTTCAACG CGGCGAACAT CGCGCTCACG AAGAATCCGA AGTTCAAGCA TTGGGTCGCC TACGGCCTGA ACGACGAGGC GGTGCTCGGC GCTGTGCGCG CGGCCGAAGG GCGCGGCTTC AAGGCGGCGG ACATGATCGG CATCGGCATC GGCGGGTCGG ACTCGGCGCT CAGCGAGTTC AAGAAGCCGC AGCCGACGGG CTTCTTCGGC ACCGTGATCA TCAGCCCGAA GCGACACGGC GAAGAGACGT CGGAGCTGAT GTATGCGTGG ATCACGCGAG GCAAGGCGCC GCCGCCGCTC ACGCTCACGA CGGGCATGCT CGCGACGCGC GAGAACGTCG CGCAGGTGCG CGAGACGATG GGGCTCGCGG CGAAGTAG
|
Protein sequence | MKRRTFITLA AAAAAAAAVA AAGLSAHAAE PVKIGFLVKQ PEEPWFQDEW KFAEIAAKDK GFTLVKIGAP SGEKVMSAID NLAAQKAQGF IICTPDVKLG PGIVAKAKSH GLKMMTVDDR LVDGAGKPIE SVPHMGISAY DIGKQVGGGI AAEIKKRGWN MNEVGAIDIT YEQLPTAHDR TAGATDALVA AGFPKANVIA APQAKTDTEN AFNAANIALT KNPKFKHWVA YGLNDEAVLG AVRAAEGRGF KAADMIGIGI GGSDSALSEF KKPQPTGFFG TVIISPKRHG EETSELMYAW ITRGKAPPPL TLTTGMLATR ENVAQVRETM GLAAK
|
| |