Gene BURPS668_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0023 
Symbol 
ID4882216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp23389 
End bp24951 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content67% 
IMG OID640125951 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001057078 
Protein GI126439577 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCGA CGGCCCCCGC TTCCCCTTCC CGCTCCGCCG AGCCGGCGCC GCTGTCGGGC 
GGCACGCTCG CGCTGCTGAC GATCGGGCTC GCGCTCGGCA CGTTCATGGA GGTGCTCGAC
ACGTCGATCG CGAACGTCGC GGTGCCGACG ATCTCCGGCA GCCTCGGCGT CGCGACGAGC
GAAGGCACGT GGGTGATCTC GTCGTATTCG GTCGCCTCCG CGATCGCGGT GCCGCTGACC
GGCTGGCTCG CGCGGCGGGT CGGCGAGGTG CGGCTGTTCA CGCTGTCGGT GCTCGCGTTC
ACGATCGCGT CCGCGCTCTG CGGCCTCGCG GAGAACTTCG AGACGCTGAT CGCGTTCCGG
CTGTTGCAGG GGCTCGTGTC GGGGCCGATG GTGCCGCTGT CGCAGACGAT CCTGATGCGC
AGCTATCCGC CCGCGAGGCG CGGGCTCGCG CTCGGCCTAT GGGCGATGAC GGTGATCGTC
GCGCCGATCT TCGGCCCGCT GCTCGGCGGC TGGATCAGCG ACAACTACAC GTGGCCGTGG
ATCTTCTACA TCAACCTGCC GATCGGCGTG TTCTCCGCCG CGTGCGCGTT CTTCCTGTTG
CGCGGCCGCG AGACGAAGAC GACGAAGCAG CGGATCGACG CGATCGGGCT CGCGCTGCTC
GTGATCGGCG TGTCGTGCCT GCAGATGATG CTCGACCTCG GCAAGGACCG CGACTGGTTC
AACTCGACGT TCATCACCTC GCTCGCGCTG ATCGCCGTCG TGTCGCTCGC GTTCATGCTC
GTGTGGGAAT CCACCGAGAA GGAGCCGGTC GTCGACCTGT CGCTCTTCAA GGACCGCAAC
TTCGCGCTCG GCGCGATGAT CATCTCGTTC GGCTTCATGG CGTTCTTCGG CTCGGTCGTG
ATCTTTCCGC TGTGGCTGCA GACCGTGATG GGCTACACGG CGGGCCTCGC CGGCCTCGCC
ACCGCGCCCG TCGGCATCCT CGCGCTCGTG CTCTCGCCGA TGATCGGCCG CAACATGCAC
CGGCTCGATC TGCGGATGGT CGCGAGCTTC GCGTTCGTCG TGTTCGCCGT CGTGTCGATC
TGGAATTCGA TGTTTACGCT CGACGTGCCG TTCAACCATG TGATCCTGCC GCGGCTCGTG
CAGGGCATCG GCGTCGCGTG CTTTTTCGTG CCGATGACGA CGATCACGCT CTCCAGCATT
CCCGACGAGC GGCTCGCGAG CGCGTCGGGG CTGTCGAACT TCCTGCGTAC GCTGTCGGGC
GCGATCGGCA CCGCGGTGAG CTCGACGTTC TGGGAAAACG ACGCGATCTA TCACCACGCG
CGGCTCGCCG AATCGGTGAA CGTGTATGCG CAGAGCACGC TCGACTATCA AGGCGCGCTC
GCGCGGCTCG GCGTGATGGG CGACGTGTCG ACCGCGCAGA TCAACCAGAT CGTCACGCAG
CAGGGCTTCA TGATGGCGAC CAACGACTTT TTCCACATTT CGGCGCTCGC GTTCGTCGCG
CTCGCGGCGC TCGTGTGGGT GACGAAGCCG AAGAAAGGGG CCGGGCCCGC GATCGGGCAC
TGA
 
Protein sequence
MAATAPASPS RSAEPAPLSG GTLALLTIGL ALGTFMEVLD TSIANVAVPT ISGSLGVATS 
EGTWVISSYS VASAIAVPLT GWLARRVGEV RLFTLSVLAF TIASALCGLA ENFETLIAFR
LLQGLVSGPM VPLSQTILMR SYPPARRGLA LGLWAMTVIV APIFGPLLGG WISDNYTWPW
IFYINLPIGV FSAACAFFLL RGRETKTTKQ RIDAIGLALL VIGVSCLQMM LDLGKDRDWF
NSTFITSLAL IAVVSLAFML VWESTEKEPV VDLSLFKDRN FALGAMIISF GFMAFFGSVV
IFPLWLQTVM GYTAGLAGLA TAPVGILALV LSPMIGRNMH RLDLRMVASF AFVVFAVVSI
WNSMFTLDVP FNHVILPRLV QGIGVACFFV PMTTITLSSI PDERLASASG LSNFLRTLSG
AIGTAVSSTF WENDAIYHHA RLAESVNVYA QSTLDYQGAL ARLGVMGDVS TAQINQIVTQ
QGFMMATNDF FHISALAFVA LAALVWVTKP KKGAGPAIGH