Gene BURPS1106A_2250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2250 
Symbol 
ID4900333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2237065 
End bp2238621 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID640135479 
Productmajor facilitator transporter 
Protein accessionYP_001066514 
Protein GI126452028 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.239801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCG CGCCGGGCCG ACCGCCGCTC TGGAGCGCCG CGAACCTGCG CGGCGATTTC 
TTTCCATGGG TGCTCGCGAT CGTCACCGGC CTCGATTACT TCGACAACGC CGCGTTCTCG
TTCTTCGCGA GCTACATCGC GGGTGGAATC AACGCGTCGC CGGACGAGCT CGTGTGGGCG
TCGAGCGCTT ACGCGGTGAC GGCCGTGCTC GGCATCCTGC AGCAGCAATG GTGGGTCGAC
CGGCTCGGTC ACCGGCGTTA CGTCGCCGGC TGCATGCTGA TGTTCTCGCT TGGCGCGATG
GCCGCGGCGG CGGCCGACAC GTCGCTGCAG CTCGCGTTCG CGCGCGGCTT TCAGGGCTAT
TTCATCGGCC CCATGATGGG CGCGTGCCGG ATCCTGATCC AGGTCAGCTT CGCGCCGAAG
GATCGCCCGC CCGCGACGCG CGCGTTCCTC ATCATGCTGC TGCTCGGCAG CGCGCTCGCG
CCGATCGCGG GCGGCCTGCT CGTCGCGCAC TTCACATGGC GCGCGCTGTT CGCCTGCACG
GCGCCGGCCG GCATCCTGTT CGCGGCGCTC GCGTTCGTCG CGCTGCCCGA TTCCGGCCAC
ACGCCGCCCG ACGAACGCGG CGGCGCGCAT TTCTGGCCGT ACGTGATCTT CGCGCTCGCG
CAAGGCGCGC TGCAGATCGT CATGCAGCAG GTGCGCTTCC AGCTCTTCGC CGGCTCGCCG
CTGCTCGTGC TGCTCGCCGC CGGCGGCCTC GCGGCGCTCG CGTGGTTCGG CCATCATCAG
TGGCATCATC CGGCGCCGCT CGTGCGGCTG CACGCGTTTC GCGAGCGCAC GTTCCGGGTC
GGCCTGCTGC TCTACCTGTT CTATTACTAC GAGACGACGG GCTACAGCTA TCTGATCTCC
CGCTTCCTCG AAACCGGGCT CGGCTATCCG ATCGAGAACG CCGGGCGGCT CGTCGGCACG
ATGTCGCTGA TCTCCGCGAG CGCGCTCTTC GTCTACCTGC GCTACGCGAA GCTTCTCACG
CACAAGAAAT GGATCATCGT GCCCGGCTTC GCGCTCGCCG CGTTCGCCGC GCTATGGATG
ACGCGGATGT CGCCCGAGGT CGGCGAAGCG GCGCTCGTCG CGCCGCTCCT GATGCGCGGG
CTGCTGCTCC TGTTCATCGT GCTGCCCGTC GCGAACCTGA CGTTTCGCGT GTTCGCGATC
GACGAGTATT CGCACGGCTA CCGGCTGAAG AACATCGTCC GGCAACTGAC GATTTCGTTT
GCGACCGCCT CCGTCATCAT CGTCGAGCAG CATCGGCTCG CCGTACATCA GACGCGGCTT
GTCGAGCAGG CGAACGTCTA CAATCCGCTG TTCCGGCAAA CCCTCGACAC GCTCACGCGC
GGCTTCGCGG CCGCGGGCCA CGCGTTCTCC GACGCGCACG CGCTCGCGCT CGTCACCGTG
AGCCGCACCA TCGCGCAACA GGCGTCGTTC CTGGCGTCGC TCGACGGCTT CTACTTCCTC
GCGGGCGTCG CGCTCTGCGG CGGCCTGTTC GCCGCCTGGC AAAAAGAGAT CGATTGA
 
Protein sequence
MSAAPGRPPL WSAANLRGDF FPWVLAIVTG LDYFDNAAFS FFASYIAGGI NASPDELVWA 
SSAYAVTAVL GILQQQWWVD RLGHRRYVAG CMLMFSLGAM AAAAADTSLQ LAFARGFQGY
FIGPMMGACR ILIQVSFAPK DRPPATRAFL IMLLLGSALA PIAGGLLVAH FTWRALFACT
APAGILFAAL AFVALPDSGH TPPDERGGAH FWPYVIFALA QGALQIVMQQ VRFQLFAGSP
LLVLLAAGGL AALAWFGHHQ WHHPAPLVRL HAFRERTFRV GLLLYLFYYY ETTGYSYLIS
RFLETGLGYP IENAGRLVGT MSLISASALF VYLRYAKLLT HKKWIIVPGF ALAAFAALWM
TRMSPEVGEA ALVAPLLMRG LLLLFIVLPV ANLTFRVFAI DEYSHGYRLK NIVRQLTISF
ATASVIIVEQ HRLAVHQTRL VEQANVYNPL FRQTLDTLTR GFAAAGHAFS DAHALALVTV
SRTIAQQASF LASLDGFYFL AGVALCGGLF AAWQKEID