Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2250 |
Symbol | |
ID | 4900333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 2237065 |
End bp | 2238621 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640135479 |
Product | major facilitator transporter |
Protein accession | YP_001066514 |
Protein GI | 126452028 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.239801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCCG CGCCGGGCCG ACCGCCGCTC TGGAGCGCCG CGAACCTGCG CGGCGATTTC TTTCCATGGG TGCTCGCGAT CGTCACCGGC CTCGATTACT TCGACAACGC CGCGTTCTCG TTCTTCGCGA GCTACATCGC GGGTGGAATC AACGCGTCGC CGGACGAGCT CGTGTGGGCG TCGAGCGCTT ACGCGGTGAC GGCCGTGCTC GGCATCCTGC AGCAGCAATG GTGGGTCGAC CGGCTCGGTC ACCGGCGTTA CGTCGCCGGC TGCATGCTGA TGTTCTCGCT TGGCGCGATG GCCGCGGCGG CGGCCGACAC GTCGCTGCAG CTCGCGTTCG CGCGCGGCTT TCAGGGCTAT TTCATCGGCC CCATGATGGG CGCGTGCCGG ATCCTGATCC AGGTCAGCTT CGCGCCGAAG GATCGCCCGC CCGCGACGCG CGCGTTCCTC ATCATGCTGC TGCTCGGCAG CGCGCTCGCG CCGATCGCGG GCGGCCTGCT CGTCGCGCAC TTCACATGGC GCGCGCTGTT CGCCTGCACG GCGCCGGCCG GCATCCTGTT CGCGGCGCTC GCGTTCGTCG CGCTGCCCGA TTCCGGCCAC ACGCCGCCCG ACGAACGCGG CGGCGCGCAT TTCTGGCCGT ACGTGATCTT CGCGCTCGCG CAAGGCGCGC TGCAGATCGT CATGCAGCAG GTGCGCTTCC AGCTCTTCGC CGGCTCGCCG CTGCTCGTGC TGCTCGCCGC CGGCGGCCTC GCGGCGCTCG CGTGGTTCGG CCATCATCAG TGGCATCATC CGGCGCCGCT CGTGCGGCTG CACGCGTTTC GCGAGCGCAC GTTCCGGGTC GGCCTGCTGC TCTACCTGTT CTATTACTAC GAGACGACGG GCTACAGCTA TCTGATCTCC CGCTTCCTCG AAACCGGGCT CGGCTATCCG ATCGAGAACG CCGGGCGGCT CGTCGGCACG ATGTCGCTGA TCTCCGCGAG CGCGCTCTTC GTCTACCTGC GCTACGCGAA GCTTCTCACG CACAAGAAAT GGATCATCGT GCCCGGCTTC GCGCTCGCCG CGTTCGCCGC GCTATGGATG ACGCGGATGT CGCCCGAGGT CGGCGAAGCG GCGCTCGTCG CGCCGCTCCT GATGCGCGGG CTGCTGCTCC TGTTCATCGT GCTGCCCGTC GCGAACCTGA CGTTTCGCGT GTTCGCGATC GACGAGTATT CGCACGGCTA CCGGCTGAAG AACATCGTCC GGCAACTGAC GATTTCGTTT GCGACCGCCT CCGTCATCAT CGTCGAGCAG CATCGGCTCG CCGTACATCA GACGCGGCTT GTCGAGCAGG CGAACGTCTA CAATCCGCTG TTCCGGCAAA CCCTCGACAC GCTCACGCGC GGCTTCGCGG CCGCGGGCCA CGCGTTCTCC GACGCGCACG CGCTCGCGCT CGTCACCGTG AGCCGCACCA TCGCGCAACA GGCGTCGTTC CTGGCGTCGC TCGACGGCTT CTACTTCCTC GCGGGCGTCG CGCTCTGCGG CGGCCTGTTC GCCGCCTGGC AAAAAGAGAT CGATTGA
|
Protein sequence | MSAAPGRPPL WSAANLRGDF FPWVLAIVTG LDYFDNAAFS FFASYIAGGI NASPDELVWA SSAYAVTAVL GILQQQWWVD RLGHRRYVAG CMLMFSLGAM AAAAADTSLQ LAFARGFQGY FIGPMMGACR ILIQVSFAPK DRPPATRAFL IMLLLGSALA PIAGGLLVAH FTWRALFACT APAGILFAAL AFVALPDSGH TPPDERGGAH FWPYVIFALA QGALQIVMQQ VRFQLFAGSP LLVLLAAGGL AALAWFGHHQ WHHPAPLVRL HAFRERTFRV GLLLYLFYYY ETTGYSYLIS RFLETGLGYP IENAGRLVGT MSLISASALF VYLRYAKLLT HKKWIIVPGF ALAAFAALWM TRMSPEVGEA ALVAPLLMRG LLLLFIVLPV ANLTFRVFAI DEYSHGYRLK NIVRQLTISF ATASVIIVEQ HRLAVHQTRL VEQANVYNPL FRQTLDTLTR GFAAAGHAFS DAHALALVTV SRTIAQQASF LASLDGFYFL AGVALCGGLF AAWQKEID
|
| |