Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0100 |
Symbol | |
ID | 4887232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 85459 |
End bp | 86811 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640130041 |
Product | MFS transporter, metabolite:H+ symporter (MHS) family protein |
Protein accession | YP_001061106 |
Protein GI | 126444904 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000762356 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCCGA CCTTTACCGA AACCGGCGCG CGGGCGAGCG AGCCCCACGC AACAACGCTT GCCGGCCGAG CCGCATCGTC CGGCGATAAG CTGCAACGCC TGAAGTCGAT CTTCTCCGCA TCGGCGGGCA ATCTGATCGA GTGGTACGAC TGGTACGTGT ATTCGGCCTT TGCGCTGTAC TTCGCGCATT CGTTTTTTCC GGCCGGCAAC CAGACCGCGC AACTGCTCAA CACGGCCGCC GTCTTCGCGG TGGGCTTTCT GGCGCGCCCC GTCGGCGGCT GGCTGATGGG GATCTACGCG GATCGCTACG GCCGCCGTCC GGCCTTGCTC GTATCGGTCG TGCTGATGTG CGTGGGCTCG CTCATCATCG CCCTCTGCCC GGGGTATGAC GTGATCGGCA CGGCCGCGCC TGTCCTGCTC GTCGTCGCCC GCCTGCTGCA GGGCTTGAGC GTCGGGGGCG AATACGGCAC GTCGGCGACC TATCTCAGCG AAGTCGCGAC CGCCCGCGAT CGCGGCTTCT ACTCCAGCTT TCAGTACGTC ACGCTGGTTG CCGGCCAACT CGTCGCGCTG GCGCTGCTGA TCGTCCTGCA GCAATTCGTG CTGACGACAC AGCAGCTCGA AAGCTGGGGG TGGAGAATTC CGTTCTTCAT CGGCGCGGCG TGCGCGCTGG CCGCCATGCG CCTGCGCTCG TCGATGGAGG AAACGGGCGA ATTCAAGCGG ACCTTAAACG CCCGGGACAA GCGCGGCACG CTCGCGGAAC TGAGCAAGCA TCCGCGCGCG GTCCTGACCG TGGTCGGCCT GACGATGGGC GGCACGATCG CGTTCTACAC CTACTCGATC TACATGCAGA AATTCCTCGT GAACACCGTC GGCATGAGCA AACATGATGC GACCCTGGTA TCCGCCGCGT CGCTCGCCCT GTTCGCGATC TTGCAGCCGA TCGTCGGCTC GATCTCCGAT CGCATCGGCC GTCGCCCCGT GCTGATCGCG TTCGGCGTGC TCGGCACGTT GTTCACCGTG CCGATCATGA CGGCCATCAG CCGGACTCAT GACGTTTGGA CCGCGTTCTT CCTCAACATG GCGGCGCTGG TGATCGTGTC GGGGTATACC TCGATCAACG CAGTCGTGAA GGCGGAATTG TTCCCTGCGA AAATCCGTGC ACTGGGCGTC GGCTTTCCGT ATGCACTCAC CGTGTCGATC TTCGGCGGCA CGGCGGAGTA TCTGGCGCTC TGGCTCAAGC AGGCTGGCCA CGAATCGCTG TTCTATTGGT ACGTGACAGC CGCCATATTC TGTTCGTTGC TCGTCTACGT ATGCATGCGG GATACCGGGA AGCACTCGCT GATCAAGGAT TGA
|
Protein sequence | MEPTFTETGA RASEPHATTL AGRAASSGDK LQRLKSIFSA SAGNLIEWYD WYVYSAFALY FAHSFFPAGN QTAQLLNTAA VFAVGFLARP VGGWLMGIYA DRYGRRPALL VSVVLMCVGS LIIALCPGYD VIGTAAPVLL VVARLLQGLS VGGEYGTSAT YLSEVATARD RGFYSSFQYV TLVAGQLVAL ALLIVLQQFV LTTQQLESWG WRIPFFIGAA CALAAMRLRS SMEETGEFKR TLNARDKRGT LAELSKHPRA VLTVVGLTMG GTIAFYTYSI YMQKFLVNTV GMSKHDATLV SAASLALFAI LQPIVGSISD RIGRRPVLIA FGVLGTLFTV PIMTAISRTH DVWTAFFLNM AALVIVSGYT SINAVVKAEL FPAKIRALGV GFPYALTVSI FGGTAEYLAL WLKQAGHESL FYWYVTAAIF CSLLVYVCMR DTGKHSLIKD
|
| |