Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I3120 |
Symbol | |
ID | 3849288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 3558522 |
End bp | 3559790 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637842786 |
Product | major facilitator family transporter |
Protein accession | YP_443615 |
Protein GI | 83721345 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00127728 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGCC CTGCCGATCT TCGCGCCGGC CCGGCCGCCG GAGAACCGCG CGCCGCTCCC GCTCCCACAC TCACGCCTGC GCTCACCGTG TTCTTTTCGG CGACGGTCGG TGTGATCGTG CTCGACCTGT TCGCCGCGCA GCCGTTGACG GGACCGATCG CGGCCGAACT GCGGCTGCCT GCCGGCCTGA CCGGGCTCAT TGCGATGCTG CCGCAACTCG GCTATGCGGC GGGCCTCGTG CTGCTCGTGC CGCTCATCGA CCTGCTCGAG AACCGCCGGC TCATCGTGAC GACGCTCACC GTCTGCGCGG CGACGCTCGC GCTGCCTGCC GTCACGCACT CCGGCGCCCT GTACCTCGCT GCGGTGTTCG CCGCGGGCGC GGCATCGAGC GTGATCCAGA TGCTCGTGCC GATGGCGGCG TCGATGGCGC CGGACGAACA GCGCGGCCGC GCGGTCGGCA ACGTGATGAG CGGCCTGATG CTCGGCATCC TGCTGTCGCG GCCGCTCGCG AGCCTGATCG CCGGCACGGC CGGCTGGCGC GCGTTCTACG GCACGGCCGC CGCCGCCGAC ATCGCGATCG CCGCGGTGCT GGCCGCGAAG CTGCCGTTGC GCGCGCCGTC GCTGTCGACT CGTTATGCGG CGCTGCTCCG GTCGCTCTGG GTGCTCGTCG CGACCGAGCG CGTGCTGCAG CGGCGCGCGC TGTCCGCGGC GCTATCGATG GGCGCATTCA GCGCGTTCTG GACCGCGATC GGTCTGCGCC TCGCCGCCGC GCCATTCGAT CTCGGCTTGC ATGGAATCGC GATGTTCGCG TTCGCCGGCG CGACGGGCGC GATCGTCACG CCGTTCGCGG GGCTGGCCGG CGACCGCGGC TGGGAGCGCC GCGCATTGCG CGGCGCGCAC GTGGCCATGC TCGCCGCGCT GGTCGCGCTC GGCGTCGCAG GCGCGGGCTG GGCCCGGTTC GATCCGGCCG CGCATCCGAC GCTGGCGCTC ACGCTGCTCG TCGCCGGCGC GGCGGCGCTC GATGCGGGCG TCGTCGCCGA CCAGACGCTC GGCCGGCGCG CGATCAACCT GCTGAACCCC GCCGCGCGCG GACGGCTCAA CGGGCTGTTC GTCGGGCTGT TCTTCGTCGG CGGCTCGCTC GGCGCCGCGC TCGCCGGCGC GGCGTGGGCA TGGGCCGGCT GGAGCGCGGT GTGCGCGGTG GGTCTCGCGT TCGCGGGGGC CGCATTCGCG CTCGACTGGA TCGGCGCGCG CCGGCAAGCC GTGCGCTGA
|
Protein sequence | MNCPADLRAG PAAGEPRAAP APTLTPALTV FFSATVGVIV LDLFAAQPLT GPIAAELRLP AGLTGLIAML PQLGYAAGLV LLVPLIDLLE NRRLIVTTLT VCAATLALPA VTHSGALYLA AVFAAGAASS VIQMLVPMAA SMAPDEQRGR AVGNVMSGLM LGILLSRPLA SLIAGTAGWR AFYGTAAAAD IAIAAVLAAK LPLRAPSLST RYAALLRSLW VLVATERVLQ RRALSAALSM GAFSAFWTAI GLRLAAAPFD LGLHGIAMFA FAGATGAIVT PFAGLAGDRG WERRALRGAH VAMLAALVAL GVAGAGWARF DPAAHPTLAL TLLVAGAAAL DAGVVADQTL GRRAINLLNP AARGRLNGLF VGLFFVGGSL GAALAGAAWA WAGWSAVCAV GLAFAGAAFA LDWIGARRQA VR
|
| |