Gene BURPS1106A_A1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1497 
Symbol 
ID4904571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1448188 
End bp1451298 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content69% 
IMG OID640144603 
ProductRND HAE1/HME family protein 
Protein accessionYP_001075531 
Protein GI126456706 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG GCGCGCCCCA CGATTCCGGC GCGGCCCGCG CGCGCTTCAC GGACGTGTTC 
GTCCGCCGCC CGGTGCTGTC GCTCGTGTTC AGCCTGCTGA TCCTGCTCGT CGGCCTGCGC
GCGCTGATGT CGCTGCCGGT GCGCCAGTAT CCGGCGCTGG AGAGCGCGAC GATCAGCGTC
GCGACCGATT ATCCCGGCGC GTCGCAAGAG CTGATGCAGG GCTTCGTGAC GACGCCGATC
GCGCAATCGA TCGCGACCGC CGACGGCATC GAGTACATCA CGTCGACGAC GACGCAGGGC
AAGAGCGTCG TGAAGGCGCG GCTGCGCCTG AACGCGAACG CGGACCGCGC GATGACCGAG
GTGATGGCCA AGGTCCAGCA GGTGAAGTAC AAGCTGCCCG CCGACGCGTA CGATTCGGTG
ATCACGAAGC TCACCGACGC GCCGACCGCG GTGATGTATC TCGGCTTCGC GAGCGATGCG
CTGTCCATTG CGCAGATCAC CGATTATGTG GTGCGCGTCG CGCAGCCGCT CGTGACGACG
GTGCCGGGCG TCGCCTCCGC CGAGATTCTC GGCGGCCAGA ACCTCGCGAT GCGCGTCTGG
CTCGACGCGA CGCGGCTCGC CGCGCACGGG CTGTCCGCGG GCGACGTCGC CGCGGCGATC
CGCGCGAACA ACGTGCAGGC GGCGCCGGGT CAGGTCAAGG GCTCGCTCAC GATCGCCGAC
ATCTCGGCGA ACACCGATCT GACCGACGTC GGCGCATTCA ACGAGATGGT CGTGAAAAGC
GCGCCGAACG GTGGCGGCCT CGTGCGGCTG AAGGATGTCG CGACCGTGGA GATCGGCGGC
GAGAACTACA ACTCGAGCGC GCTGATGAAC GGCCGCCGCG CGGTGTACAT CGCGGTCAAC
GCGACGCCCG CGGGCAACCC GCTCGAGATC GTGCGCGGCG TCGGCGCGGT GCTGCCCGCG
ATGGAGCGCA ACAAGCCCGC GAGCGTGCAG ATCGCGAACA TGTTCGACGG CGCGCGCTTC
GTCAACGCGT CGATCGCCGA AGTCCGCTCG ACGCTCACCG AGGCGATCGC GATCGTCGTC
GTCGTGATTT TCCTGTTCCT CGGCTCGTTT CGCGCGGTGA TCATCCCGGT GCTGACGATT
CCGCTGTCGC TCGTCGGCTC GGCGGCGCTG ATGCTGGCCG CGGGCTTCTC GCTGAACCTG
CTGACGCTGC TCGCGATGGT GCTCGCGATC GGGCTCGTCG TCGACGACGC GATCGTCGTC
GTCGAGAACA TCCACCGCCA CATCGAGCAG GGCGAGCCGC CCGTGCGCGC GGCGCTGATC
GGCGCGCGCG AGATCGCGTC GCCCGTGCTC GTGATGATGG CGACGCTGAT CTCCGTCTAC
GCGCCGATCG GGCTGATGGG CGGCCTCACC GGCTCGCTGT TCAAGGAGTT CGCGTTCACG
CTCGCGGGCG CGGTGCTCGT GTCGGGCATC GTCGCGCTCA CGCTCTCGCC GATGCTCGGC
TCGCTGCTGC TCACGAGCAG GATGTCCGAG GGCCGCGTCG CGAAGGCGAT CGAGCGCACG
CTCGAGCGCG TGACGCACGC GTACCGGCGC GGCCTGCACG CAAGCCTCGC CGCGCGGCCC
GCGATTCTGC TGATCGGCGC CGGCGTGCTC GCGGGCATCG TGCTGCTCTT CTCGGGCGTG
AAGCGCGAGC TCGCGCCGCA GGAGGACCAG GGCTCGATCA TGGTCGCGGT GAAGGCGCCG
CAGTATGCGA ACCTCGATTA CATGGAGCGC TACGCGCCCG ACATCGAGCG CGTGTTCCGC
ACGCTGCCCG AGGCCGACAC GAGCTTCATC CTGAACAGCT ACGGCGGCAG CAATCTCGGC
TTCGCGGGCG TGAACCTCGT CGATTGGGAC AAGCGCACGC GCAGCGCGGC GGCGCTGCAG
GCGCTGATCC AGGCGCGCGG CGACGGGATC CGCGGCGAGC GCGTGTTCGC GTTCCAGTTG
CCGGCGCTGC CCGCGTCGAG CGGCGGGCTG CCGATCCAGA TGGTGCTGCG CACGCCGTCG
GGTTTCGCCG ATCTCTACGC GCAGATGGAG CGGATCAAGG CGGCCGCGCA AAAGAGCGGC
CTCTTCGCGG TGCTCGACAG CGATCTGACG TTCGACAGCC GCGCGGTGGG CGTGACGATC
GACCGCAATC AGGCGAACAC GCTCGGCGTG ACGATGAAGG ACATCGCCGA CACGCTCGCC
GTGCTCGTCG GCGAGAACTA CGTGAACCGC TTCAACTTCA AGGGCCGCTC ATACGACGTG
ATCCCGCAGG TGTCGCGCAG CGAGCGGCTG TCCTCCGAGA TGCTCAGCCG CTATTACGTG
AAGACGGGCT CGGGGCAGAT GATTCCGCTG TCGACCGTGA TTCAAGTGAC CAACGGCAGC
CAGGCGAACG CGCTCAGCCA GTTCAACCAG ATGAACGCGG CGACGTTCTC CGCGGTGCCC
GCGCCCGGCG TGACGATGGG CGACGCGGTG GCGTTCCTCG AGGCGCAGAA GCTGCCCGCC
GGCTTCTCGA TCGACTGGCT CGGCGAATCG CGGCAATACG TGCAGGAAGG CAACCGGCTC
GCCGTCACGT TCGGCTTCGC GCTGATCGTG ATCTTCCTCG TGCTCGCCGC GCAGTTCGAG
AGCCTGCGCG ATCCGCTCGT GATCCTGGTG TCGGTGCCGC TGTCGATCTG CGGCGCGCTC
GCCCCGCTCT ACCTCGGTTT CGCGACGCTC AACATCTATA CGCAGATCGG GCTCGTCACG
CTGATCGGGC TGATCTCGAA GCACGGGATC CTGATGGTCA GCTTCGCGAA CGATCTGCAG
CGCCACGAAG GGCTCGACCG GCGCGCGGCG ATCGAGCGCG CGGCGGCCGT GCGGCTGCGT
CCGATCCTGA TGACGACCGC GGCGATGGTC GCGGGCCTCG TGCCGCTCGT GTTCGCCGCG
GGCGCCGGCG CGGCGAGCCG CTTCGCGATC GGCATCACGG TGTCGACCGG CATGCTGGTC
GGCACGCTGT TCACGCTGTT CGTGCTGCCG ACCGTCTATA CGTTCGTCGC GAAGGATCAC
GGCGCGGCGA TGCGCAGCGA GCGCGCGCAA CTGCTCGCCG CCACTCGGTA A
 
Protein sequence
MNAGAPHDSG AARARFTDVF VRRPVLSLVF SLLILLVGLR ALMSLPVRQY PALESATISV 
ATDYPGASQE LMQGFVTTPI AQSIATADGI EYITSTTTQG KSVVKARLRL NANADRAMTE
VMAKVQQVKY KLPADAYDSV ITKLTDAPTA VMYLGFASDA LSIAQITDYV VRVAQPLVTT
VPGVASAEIL GGQNLAMRVW LDATRLAAHG LSAGDVAAAI RANNVQAAPG QVKGSLTIAD
ISANTDLTDV GAFNEMVVKS APNGGGLVRL KDVATVEIGG ENYNSSALMN GRRAVYIAVN
ATPAGNPLEI VRGVGAVLPA MERNKPASVQ IANMFDGARF VNASIAEVRS TLTEAIAIVV
VVIFLFLGSF RAVIIPVLTI PLSLVGSAAL MLAAGFSLNL LTLLAMVLAI GLVVDDAIVV
VENIHRHIEQ GEPPVRAALI GAREIASPVL VMMATLISVY APIGLMGGLT GSLFKEFAFT
LAGAVLVSGI VALTLSPMLG SLLLTSRMSE GRVAKAIERT LERVTHAYRR GLHASLAARP
AILLIGAGVL AGIVLLFSGV KRELAPQEDQ GSIMVAVKAP QYANLDYMER YAPDIERVFR
TLPEADTSFI LNSYGGSNLG FAGVNLVDWD KRTRSAAALQ ALIQARGDGI RGERVFAFQL
PALPASSGGL PIQMVLRTPS GFADLYAQME RIKAAAQKSG LFAVLDSDLT FDSRAVGVTI
DRNQANTLGV TMKDIADTLA VLVGENYVNR FNFKGRSYDV IPQVSRSERL SSEMLSRYYV
KTGSGQMIPL STVIQVTNGS QANALSQFNQ MNAATFSAVP APGVTMGDAV AFLEAQKLPA
GFSIDWLGES RQYVQEGNRL AVTFGFALIV IFLVLAAQFE SLRDPLVILV SVPLSICGAL
APLYLGFATL NIYTQIGLVT LIGLISKHGI LMVSFANDLQ RHEGLDRRAA IERAAAVRLR
PILMTTAAMV AGLVPLVFAA GAGAASRFAI GITVSTGMLV GTLFTLFVLP TVYTFVAKDH
GAAMRSERAQ LLAATR