Gene BURPS1106A_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0794 
Symbol 
ID4900248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp780841 
End bp782649 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content66% 
IMG OID640134024 
Productsensory box histidine kinase/response regulator 
Protein accessionYP_001065076 
Protein GI126452141 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.214236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGG TCGAATGGCG CAATGAAAAA ATCATCGTCG CCCTGGGTTC CCTGTGGATC 
CTGGGCTTCG CCGCGTGGGC GTTCCTGCTG TACGACCTGC TCGGCACGTC CGTCAAGGAA
GGCATTCTGG AAGGCCCGCG CGAAGGCGTG TTCTGGACTG CCGCGCAATA CCGGAACTCG
TTCAGCCGGT TCGATCGGCA ACTGATTCTC TATGCGGCCG GCGAGAACCG CGACTTCGAC
GCCGTGCTGC TCCAGCTCGA CAGCCTCGAA GCGTCGTTCG GCTTTCTCGA GCGGCCGTCC
GAAGTCTCCG CGTACTGGCT GAGCATTCCG AAGGCCCGTG GCGACATCGC CGAGCTGTCG
CGCTTCATGG CGAGCCTGCG CCGCGACGTT CCGGCGCTGC GCGCGCGCCC CGACGATACG
AAGCGCGTGC TGGGCGAGCT CGCCCGGCAA TGGCCCAAGG TGAACGCGCT CGCGAATTAC
TTTCGCGCGA TCGAAATGGA GCAGCGCGAT TTCACGTTCC ATCAGTTGAA GGAAAAGCGG
CGCGCGATCG TCATGCTGGG CGGCGTGCTC GGCATCATTC TCGGCGCGCT GTTCCTGCTG
CTGTTCTACA CGGTCCGCAC GCGCGGCAGC CTGCTCGAGC AGCAGCAGGC GGCGCTCGAC
GCGGAACGCA AGGCGTCCGA TCGCGCGTTC GAGATGATCG CCGCGAAAAA CGCGTTCCTC
GGCATGGTGA GCCATGAGCT GCGCACGCCG CTGCAGGCGA TCTGCGGCTC GATCGAAGTG
CTGCTCGCGC GGCCGCAGTC CGAAGCGAAC ATGAAGACGA TCAAGCGGCT GCAGAACTCG
GCCGCGTCGC TCGAAGCGCA GGTGAAGGAC CTGACCGACT ACATCAAGCT GCGGTCCACG
AATCGATCGG TGCAGTCGGA GATCGTCGAG GTCGCGCCGC TGCTCGCCGA CGTGCTCGAT
CCGTTGCGCG GCCGCATCCG CGACAAGCAT CTGAGCACGT CGCTGCGCGT GGAGCCGCCC
GCGCTCGTCG TCCGGTCCGA TCGCAAGCTG ATCCAGCAGA TCGTGTCGAA CCTCGTCGAG
AACGCGATCA AATACACGAA CAGCGGAACG ATCGAGATCT CGGCCGCGCT CGGCGGCACG
CCCGCCAATC GGACCATGGC GATCACGGTG CGCGACACCG GCGTCGGCAT CGCGCGGCAT
CTGCTCGCGA AGATCTTCGA GCCGTTCTTT CGCGTGAACG ATCCGGGCGT GCGGCACGTC
GACGGCATCG GCATGGGGCT CGCGGTCGTC CAGGAGCTCG TCGTCGCGCT GCGCGGGCAC
GTCGATGTGC GCAGCGCCGT CGGCGAAGGC AGCGAATTCG CCGTCACGCT GCCCGTCGAG
CTGCCCGACC GCGCCGATGC GCTCGACGAC GACGCGCAGC CGTCACCGCG GGCGGCGCAT
CGCGATCTGC GCGCGCTCGT CGTCGACGAC AACGAGAACG CGCGCGAAAC GCTGGGCGCG
ATGCTCGCGA CGCTCGGCAT CCGGGTGGAT CTGCGCGGCA CCGGCAAGGA AGGCTTGCGC
TGCTTCGGCG AATGTCAGCA CGACATCGTC GTGCTGGATC TGGAGTTGCC CGACATCAGC
GGTTTCGAGG TGGCCGAGCA GATACGCTGG GCGACGTCGT CCGACGCGGC CAGGAAGACG
ACGATACTCG GCGTGAGCGC GTACGAATCG GCGCTGCTCA AGGGCGATCA CGCGATCTTC
GACGCCTTCA TCCCCAAGCC GATTCATCTG GACACGCTCG GCGGCATCGT GAGCCGGCTG
CGAAGCTGA
 
Protein sequence
MSRVEWRNEK IIVALGSLWI LGFAAWAFLL YDLLGTSVKE GILEGPREGV FWTAAQYRNS 
FSRFDRQLIL YAAGENRDFD AVLLQLDSLE ASFGFLERPS EVSAYWLSIP KARGDIAELS
RFMASLRRDV PALRARPDDT KRVLGELARQ WPKVNALANY FRAIEMEQRD FTFHQLKEKR
RAIVMLGGVL GIILGALFLL LFYTVRTRGS LLEQQQAALD AERKASDRAF EMIAAKNAFL
GMVSHELRTP LQAICGSIEV LLARPQSEAN MKTIKRLQNS AASLEAQVKD LTDYIKLRST
NRSVQSEIVE VAPLLADVLD PLRGRIRDKH LSTSLRVEPP ALVVRSDRKL IQQIVSNLVE
NAIKYTNSGT IEISAALGGT PANRTMAITV RDTGVGIARH LLAKIFEPFF RVNDPGVRHV
DGIGMGLAVV QELVVALRGH VDVRSAVGEG SEFAVTLPVE LPDRADALDD DAQPSPRAAH
RDLRALVVDD NENARETLGA MLATLGIRVD LRGTGKEGLR CFGECQHDIV VLDLELPDIS
GFEVAEQIRW ATSSDAARKT TILGVSAYES ALLKGDHAIF DAFIPKPIHL DTLGGIVSRL
RS