Gene BURPS1106A_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0023 
Symbol 
ID4902041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp23442 
End bp25004 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content67% 
IMG OID640133253 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001064308 
Protein GI126455044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.636069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCGA CGGCCCCCGC TTCCCCTTCC CGCTCCGCCG AGCCGGCGCC GCTGTCGGGC 
GGCACGCTCG CGCTGCTGAC GATCGGGCTC GCGCTCGGCA CGTTCATGGA GGTGCTCGAC
ACGTCGATCG CGAATGTCGC GGTGCCGACG ATCTCCGGCA GCCTCGGCGT CGCGACGAGC
GAAGGCACGT GGGTGATCTC GTCGTATTCG GTCGCCTCCG CGATCGCGGT GCCGCTGACC
GGCTGGCTCG CGCGGCGGGT CGGCGAGGTG CGGCTGTTCA CGCTGTCGGT GCTCGCGTTC
ACGATCGCGT CCGCGCTCTG CGGCCTCGCG GAGAACTTCG AGACGCTGAT CGCGTTCCGG
CTGTTGCAGG GGCTCGTGTC GGGGCCGATG GTGCCGCTGT CGCAGACGAT CCTGATGCGC
AGCTATCCGC CCGCGAAGCG CGGGCTCGCG CTCGGCCTGT GGGCGATGAC GGTGATCGTC
GCGCCGATCT TCGGCCCGCT GCTCGGCGGC TGGATCAGCG ACAACTACAC GTGGCCGTGG
ATCTTCTATA TCAACCTGCC GATCGGCGTG TTCTCCGCCG CGTGCGCGTT CTTCCTGTTG
CGCGGCCGCG AGACGAAGAC GACGAAGCAG CGGATCGACG CGATCGGGCT CGCGCTGCTC
GTGATCGGCG TGTCGTGCCT GCAGATGATG CTCGACCTCG GCAAGGATCG CGACTGGTTC
AACTCGACGT TCATCACCTC GCTCGCGCTG ATCGCCGTCG TGTCGCTCGC GTTCATGCTC
GTGTGGGAAT CCACCGAGAA GGAGCCGGTC GTCGACCTGT CGCTCTTCAA GGACCGCAAC
TTCGCGCTCG GCGCGATGAT CATCTCGTTC GGCTTCATGG CGTTCTTCGG CTCGGTCGTG
ATCTTTCCGC TGTGGCTGCA GACCGTGATG GGCTACACGG CGGGCCTCGC CGGCCTCGCC
ACGGCGCCCG TCGGCATCCT CGCGCTCGTG CTCTCGCCGA TGATCGGCCG CAACATGCAC
CGGCTCGATC TGCGGATGGT CGCGAGCTTC GCGTTCGTCG TGTTCGCCGT CGTGTCGATC
TGGAATTCGA TGTTTACGCT CGACGTGCCG TTCAACCATG TGATCCTGCC GCGGCTCGTG
CAGGGCATCG GCGTCGCGTG CTTTTTCGTG CCGATGACGA CGATCACGCT CTCCAGCATT
CCCGACGAGC GGCTCGCGAG CGCGTCGGGG CTGTCGAACT TCCTGCGTAC GCTGTCGGGC
GCGATCGGCA CCGCGGTGAG CTCGACGTTC TGGGAAAACG ACGCGATCTA TCACCACGCG
CGGCTCGCCG AATCGGTGAA CGTGTATGCG CAGAGCACGC TCGACTATCA AGGCGCGCTC
GCGCGGCTCG GCGTGATGGG CGACGTGTCG ACCGCGCAGA TCAACCAGAT CGTCACGCAG
CAGGGCTTCA TGATGGCGAC CAACGACTTC TTCCACATTT CGGCGCTCGC GTTCGTCGCG
CTCGCGGCGC TCGTGTGGGT GACGAAGCCG AAGAAAGGGG CCGGGCCCGC GATCGGGCAC
TGA
 
Protein sequence
MAATAPASPS RSAEPAPLSG GTLALLTIGL ALGTFMEVLD TSIANVAVPT ISGSLGVATS 
EGTWVISSYS VASAIAVPLT GWLARRVGEV RLFTLSVLAF TIASALCGLA ENFETLIAFR
LLQGLVSGPM VPLSQTILMR SYPPAKRGLA LGLWAMTVIV APIFGPLLGG WISDNYTWPW
IFYINLPIGV FSAACAFFLL RGRETKTTKQ RIDAIGLALL VIGVSCLQMM LDLGKDRDWF
NSTFITSLAL IAVVSLAFML VWESTEKEPV VDLSLFKDRN FALGAMIISF GFMAFFGSVV
IFPLWLQTVM GYTAGLAGLA TAPVGILALV LSPMIGRNMH RLDLRMVASF AFVVFAVVSI
WNSMFTLDVP FNHVILPRLV QGIGVACFFV PMTTITLSSI PDERLASASG LSNFLRTLSG
AIGTAVSSTF WENDAIYHHA RLAESVNVYA QSTLDYQGAL ARLGVMGDVS TAQINQIVTQ
QGFMMATNDF FHISALAFVA LAALVWVTKP KKGAGPAIGH