Gene BURPS668_A1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1084 
Symbol 
ID4888648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1044622 
End bp1046187 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content72% 
IMG OID640131024 
Producthypothetical protein 
Protein accessionYP_001062083 
Protein GI126443596 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAGG GCAACATGTC CGACCCGTCA CGGGAGAATC CCCCGACCGA CGCGCACGCT 
TCGTCGCTCG TCACGCACGA CGAACCGGTG CCGCGCGGCC GCCGCGCGCC GCGTCCGCTC
TGGACGCTCG TGCCGCGCGC CGCCGGCTAC GGCGTCGTGT TCTTCGTCAT CTGGATGGTG
CTGACCATCA TGTTTCCGAA CGTCTTCACG CGCTCGTCCG AGCGCGCGGT CGTCAACAAC
GAAGTGACGC TCGTCACGTC GCCGGTCGAA GGCGTCGTGA CCGAGCAGCA CGTGACGGCC
GGCAAGCCGT TCGACGCGAA CCAGCCGCTC GCGACCGTGC AGAACCCGAA CGTCGATCGC
GCGCTGCTGA TCGACCTGAC GGGCAAGAAG CTCGACAACC AGCAGCGCGA GGACGCGGCG
CGCGCCGAGC TCGCGGGCGA CGAGAGCCAG CTCGCGTCGA CCGAGCACGA TCTGCAGCGC
TATCAGTCCG TGGCACAGAA GGAGCACGCG GCGACCATCC GCGCGCTCGA GGCGCGGCTC
GCGGTCGCGC GCGCGCAGGT GGACCAGCAG GAAGACATCG TCAACCGCAA TCAGGCGATG
CAGTGGGCAG GCGCCGTGAG CGAGGCGTAC ACCAGCGCGT CGCGCTATCA GCTATCGATC
CTGTCGAACG CGAAAGCGGC CGCGGCCGCC GAGCTCGAGC ACGCGGTCGC GAACGGCGAC
GCGTCGCGCA GCAAGGTCTA CGCGTCCGCC ACCGACGGGC CGGCCGCGTC GCTGTCGCAG
CGCGGCCGGC TGCTCGGCGC GGACATCGCG CAGCGCAAGG CCGAGATCGC GCAGTTCGAC
GCTTACGGCC AGTCGGTCGA CAAGCTGATC GCCGCCGAGC AGCAGCGGCT CGACAGGCTG
AGCCGCATCG AGATCCGCTC GGGCGAGCCG GGCGTCGCGG AGGACGTGCT CGCCCCGCCC
GGCACGCGCG TCGCCGCCGG CGCGACGCTG ATCCGTGCGA GCAACTGCGC GCGCTCGCGT
GTGGTCGCCG TGTTCCCGCG CAGCCTGAGC GACGACCTGC TGCCCGGCAC GCATCTGAAC
GTGCGGATGG ACGGTGTGCC GGCCGTGCTG CCCGCGTCGA TCGCCGAAGT GCTGCCGCGC
GCGTCCGAAG GCGAGCAGGC GCGCTACTTC GTGCCCTTCC CGCCGATCGA AAAGAACGAG
ATCTATGTGA TCGCGAAGCT CGACGAGCCG CTCGCGCCGC TGTCGCGCCG CGCGAGCGCC
CGGCCGGACG CGCGCTGCGC GATGGGGCGC TGGGCGAGAG TCAGCCTCGA TCGGGGCTGG
CTCGCGAGCA ACGTGTCCGG TCTCGCGAAC GTCGATCCGA ACTGGACCGC CGGCGCGCGT
TCGGCGCTCG CGCGCGGCGG GCAATGGCTG CGCGACGCCG GCGTGCAAGC CCGGCGCCGG
CTCGAGGACT TCGCGACGAA CGCCGCGCGC CGGCTCGGCG AGCGGGCCGC GGACGGGCGG
CGCTGGCTGA ACGAGCGGGC GAGCGCGGCC CGACGCTGGC TCGACGATCT GAAATCGGCG
GCCTGA
 
Protein sequence
MQEGNMSDPS RENPPTDAHA SSLVTHDEPV PRGRRAPRPL WTLVPRAAGY GVVFFVIWMV 
LTIMFPNVFT RSSERAVVNN EVTLVTSPVE GVVTEQHVTA GKPFDANQPL ATVQNPNVDR
ALLIDLTGKK LDNQQREDAA RAELAGDESQ LASTEHDLQR YQSVAQKEHA ATIRALEARL
AVARAQVDQQ EDIVNRNQAM QWAGAVSEAY TSASRYQLSI LSNAKAAAAA ELEHAVANGD
ASRSKVYASA TDGPAASLSQ RGRLLGADIA QRKAEIAQFD AYGQSVDKLI AAEQQRLDRL
SRIEIRSGEP GVAEDVLAPP GTRVAAGATL IRASNCARSR VVAVFPRSLS DDLLPGTHLN
VRMDGVPAVL PASIAEVLPR ASEGEQARYF VPFPPIEKNE IYVIAKLDEP LAPLSRRASA
RPDARCAMGR WARVSLDRGW LASNVSGLAN VDPNWTAGAR SALARGGQWL RDAGVQARRR
LEDFATNAAR RLGERAADGR RWLNERASAA RRWLDDLKSA A