Gene BURPS668_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2212 
Symbol 
ID4883929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2203525 
End bp2205081 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID640128140 
Productmajor facilitator transporter 
Protein accessionYP_001059247 
Protein GI126440688 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.508956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCG CGCCGGGCCG ACCGCCGCTC TGGAGCGCCG CGAACCTGCG CGGCGATTTC 
TTTCCATGGG TGCTCGCGAT CGTCACCGGC CTCGATTACT TCGACAACGC CGCGTTCTCG
TTCTTCGCGA GCTACATCGC GGGCGGAATC AACGCGTCGC CGGACGAGCT CGTGTGGGCG
TCGAGCGCTT ACGCGGTGAC GGCCGTGCTC GGCATCCTGC AGCAGCAATG GTGGGTCGAC
CGGCTCGGTC ACCGGCGTTA CGTCGCCGGC TGCATGCTGA TGTTCTCGCT TGGCGCGATG
GCCGCGGCGG CGGCCGACAC GTCGCTGCAG CTCGCGTTCG CGCGCGGCTT TCAGGGCTAT
TTCATCGGTC CCATGATGGG CGCGTGCCGG ATCCTGATCC AGGTCAGCTT CGCGCCGAAG
GATCGCCCGC CCGCGACGCG CGCATTCCTC ATCATGCTGC TGCTCGGCAG CGCGCTCGCG
CCGATCGCGG GCGGCCTGCT CGTCGCGCAC TTCACATGGC GCGCGCTGTT CGCCTGCACG
GCGCCGGCCG GCATCCTGTT CGCGGCGCTC GCGTTCGTCG CGCTGCCCGA TTCCGGCCAC
ACGCCGCCCG ACGAACGCGG CGGCGCGCAT TTCTGGCCGT ACGTGATCTT CGCGCTCGCG
CAAGGCGCGC TGCAGATCGT CATGCAGCAG GTGCGCTTCC AGCTCTTCGC CGGCTCGCCG
CTGCTCGTGC TGCTCGCCGT CGGCGGCCTC GCGGCGCTCG CGTGGTTCGG CCATCATCAG
TGGCATCATC CGGCGCCGCT CGTGCGGCTG CACGCGCTTC GCGAGCGCAC GTTCCGGGTC
GGCCTGCTGC TCTACCTGTT CTATTACTAC GAGACGACGG GCTACAGCTA TCTGATCTCC
CGCTTCCTCG AAACCGGGCT CGGCTATCCG ATCGAGAACG CCGGGCGGCT CGTCGGCACG
ATGTCGCTGA TCTCCGCGAG CGCGCTCTTC GTCTACCTGC GCTACGCGAA GCTTCTCACG
CACAAGAAAT GGATCATCGT GCCCGGCTTC GCGCTCGCCG CGTTCGCCGC GCTATGGATG
ACGCGGATGT CGCCCGAGGT CGGCGAAGCG GCGCTCGTCG CGCCGCTCCT GATGCGCGGG
CTGCTGCTCC TGTTCATCGT GCTGCCCGTC GCGAACCTGA CGTTTCGCGT GTTCGCGATC
GACGAGTATT CGCACGGCTA CCGGCTGAAG AACATCGTCC GGCAATTGAC GATTTCGTTT
GCGACCGCCT CCGTCATCAT CGTCGAGCAG CATCGGCTCG CCGTACATCA GACGCGGCTT
GTCGAGCAGG CGAACGTCTA CAATCCGCTG TTCCGGCAAA CCCTCGACAC GCTCACGCGC
GGCTTCGCGG CCGCGGGCCA CGCGTTCTCC GACGCGCACG CGCTCGCGCT CGTCACCGTG
AGCCGCACCA TCGCGCAACA GGCGTCGTTC CTGGCGTCGC TCGACGGCTT CTACTTCCTC
GCGGGCGTCG CGATCTGCGG CGGCCTGTTC GCCGCCTGGC AAAAAGAGAT CGATTGA
 
Protein sequence
MSAAPGRPPL WSAANLRGDF FPWVLAIVTG LDYFDNAAFS FFASYIAGGI NASPDELVWA 
SSAYAVTAVL GILQQQWWVD RLGHRRYVAG CMLMFSLGAM AAAAADTSLQ LAFARGFQGY
FIGPMMGACR ILIQVSFAPK DRPPATRAFL IMLLLGSALA PIAGGLLVAH FTWRALFACT
APAGILFAAL AFVALPDSGH TPPDERGGAH FWPYVIFALA QGALQIVMQQ VRFQLFAGSP
LLVLLAVGGL AALAWFGHHQ WHHPAPLVRL HALRERTFRV GLLLYLFYYY ETTGYSYLIS
RFLETGLGYP IENAGRLVGT MSLISASALF VYLRYAKLLT HKKWIIVPGF ALAAFAALWM
TRMSPEVGEA ALVAPLLMRG LLLLFIVLPV ANLTFRVFAI DEYSHGYRLK NIVRQLTISF
ATASVIIVEQ HRLAVHQTRL VEQANVYNPL FRQTLDTLTR GFAAAGHAFS DAHALALVTV
SRTIAQQASF LASLDGFYFL AGVAICGGLF AAWQKEID