Gene BURPS1710b_2747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2747 
Symbolcsx050 
ID3689683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3043446 
End bp3046085 
Gene Length2640 bp 
Protein Length879 aa 
Translation table11 
GC content66% 
IMG OID637729203 
Productsensory box histidine kinase 
Protein accessionYP_334133 
Protein GI76808726 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAATA TTGCTGCGCC GCACCGACTC GACCCGCGTT TGCCATGCGT CGTGTATCGA 
CAATTCCGCT TTCGGTTCAA CCTTTCTTGT ACGCCTTTTG TGCAGGACGC GCTGAGCTAC
AATCCTGCCA TGTTGACCGA TCGGCTTTTC GCTCGCTCGG CGCGACCGTC GGGCTCGCCT
GCGGAGTCGC AGCCGTCCCG CTGGCACCAC GGACCGTGGT GGTCCAACTC TTACCTGCTC
ACCCCCCTGC TGTCGATCCT CGTCTTTCTC GTGGTGATGA GTCTCATTCT GTGGAGCCTC
AATCGCCGCG AGGAGCAGCA GCAGGAAGAC ACGCTGTTTC GCAACGTCGC GTGGGCGCAG
CAGCAGATCC GCCTGTCGAT GACGAGCGCG CAGGAACAGC TCCAGGCGTT CTCGCGCGAC
ATCGCCGCGG GCCGCATCGA CGAGCATGCG TTCCAGGCGA CGGTGGGCGA CGTGATGCAG
GCGCACCCCG AGATCCTCTA TCTGAACTGG TACACGTCGC CCGGCACGAA ACGCTGGCCG
ACGATGCAAT TGCCGCTCCT CGGCCAGCGG CTCGCGAAGC CGAACGACGC GCAGATGGAC
GAAGTCGTGC GCGGCGCGTA CGCGCAGGCG CGCGGCACGC GCCGCCAGTC GTACTCGCCG
CTCGTCTACG ACGACTTCGG CAACGGCTAC CTGACGCTGC AAACGCCCGT GATCCGCGAG
CGCGAGTATC TCGGCTCGAT CGCCGCGGTG TTCTCGGTCG AAGGCATCCT GAAGCACGAC
ATCCCGCCCG AGCTGTCCGC GAAATACAAG ATCTCGATCA CCGACGCGAA CAACCGCGAG
CTCGCATCGA CGTCGTCGCG CCCGCGGCTG CCGCGCGACG CGCATTACGA CCTGCCGCTC
GACCCGCCCG GCCAGGGCCT CACGGTACGC GTCTACGCGT ACCCGCAAAC GACGAACCTG
ACCAACAACA CGCTCGTATG GCTCGTCGCG GGCCTGTCGT GCTTCGTGCT GTGGAGCCTC
TGGAGCTTGT GGAAGCACAC GCGGCAGCGC TTCGAGGCGC AGCAGGCGCT GTACGCCGAG
GCGTTCTTCC GCCGCGCGAT GGAGAATTCG GTGCTGATCG GCATGCGCGT GCTCGACATG
CACGGCCGGA TCACGCACGT GAACCCGGCG TTCTGCCGGA TGACGGGCTG GGACGAAAGC
GACCTCGTCG GCAAGACCGC GCCGTTCCCG TACTGGCCGC GCGACGCTTA CCCGGAAATG
CAGCGCCAGC TCGACATGAC GCTGCGCGGC AAGGCGCCTT CGTCCGGCTT CGAGCTGCGC
GTGCGCCGCA AGGACGGCTC GCTCTTTCAC GCACGCCTGT ACGTATCGCC GCTCATCGAC
AGCGCCGGCC GGCAGACGGG CTGGATGTCG TCGATGACCG ACATCACCGA GCCCAAGCGC
GCGCGCGAGG AGCTCGCGGC CGCGCACGAG CGCTTCACGA CAGTGCTCGA GAGCCTCGAC
GCCGCGGTGT CGGTGCTCGC CGCGGACGAA GCCGAGCTGC TGTTCGCGAA CCGCTACTAC
CGGCACCTGT TCGGCATCCG CCCGGACGGC CACCTCGAAC TGTCGGGCGG CGGCTTCGAC
ACCGCGCAGG CGTCGTCCGA TTCGATCGAC ATGGTCGACG CCTATGCCGG CCTGCCCGCC
GCGGCGCTCA CCGAGAGCAC GGCGGACGCG CAGGAGGTGT ACGTCGAGAG CATCCAGAAG
TGGTTCGAGG TGCGCCGCCA GTACATCCAG TGGGTCGACG GCCACCTCGC GCAGATGCAG
ATCGCGACCG ACATCACGAC GCGCAAGAAG GCGCAGGAGC TCGCGCACCA GCAGGAAGAA
AAGCTGCAGT TCACGAGCCG GCTGATGACG ATGGGTGAAA TGGCGTCGTC GATTGCACAC
GAACTGAACC AGCCGCTCGC GGCGATCAAC AACTACTGCT CGGGCACGCT CGCGCTCGTG
AAGAGCGGCC GCGCGTCGCC GGAGACGCTC GCGCCCGCGC TCGAGAAGAC CGCGCAGCAG
GCGCTGCGCG CCGGGATGAT CGTCAAGCGG ATCCGCGAGT TCGTCAAGCG CAGCGAGCCG
AAGCGGCAGC CGTCGCGGGT CGCCGACATC GTCGCCGACG CGGTCGGGCT CGCCGAAATC
GAGGCGAGAA AGCGCCGGAT TCGGATCGTC ACCGAAATCC GCGCAAGAAT GCCTATTATT
TATGTCGACC CCGTGTTGAT CGAGCAGGTG CTCGTGAACC TGATGAAGAA CGCGGCCGAG
GCGATGCAGG AGGCGCGGCC GCAGGCGGAG AACGGCGTGA TCCGCGTCGT CGCCGACCTC
GAGGCGGGTT TCGTCGACAT TCGCGTGATC GACCAGGGTC CGGGCGTCGA CGAGGCGACG
GCCGAACGCC TGTTCGAACC GTTCTACAGC ACCAAGTCGG ACGGGATGGG CATGGGGCTC
AATATCTGCC GCTCGATCAT CGAATCGCAT CGCGGGCGTC TGTGGGTGGT CAACAACGTC
GAGCCGGACG GCCTCGTGTC GGGCGCGACG TTTCACTGCA GCCTGCCCAT TGGGGAACCG
GAGGATCTCG GTCGCGGATC CGAGACATCG CCATCACAAA CCGTAACGGG AGAGATATGA
 
Protein sequence
MRNIAAPHRL DPRLPCVVYR QFRFRFNLSC TPFVQDALSY NPAMLTDRLF ARSARPSGSP 
AESQPSRWHH GPWWSNSYLL TPLLSILVFL VVMSLILWSL NRREEQQQED TLFRNVAWAQ
QQIRLSMTSA QEQLQAFSRD IAAGRIDEHA FQATVGDVMQ AHPEILYLNW YTSPGTKRWP
TMQLPLLGQR LAKPNDAQMD EVVRGAYAQA RGTRRQSYSP LVYDDFGNGY LTLQTPVIRE
REYLGSIAAV FSVEGILKHD IPPELSAKYK ISITDANNRE LASTSSRPRL PRDAHYDLPL
DPPGQGLTVR VYAYPQTTNL TNNTLVWLVA GLSCFVLWSL WSLWKHTRQR FEAQQALYAE
AFFRRAMENS VLIGMRVLDM HGRITHVNPA FCRMTGWDES DLVGKTAPFP YWPRDAYPEM
QRQLDMTLRG KAPSSGFELR VRRKDGSLFH ARLYVSPLID SAGRQTGWMS SMTDITEPKR
AREELAAAHE RFTTVLESLD AAVSVLAADE AELLFANRYY RHLFGIRPDG HLELSGGGFD
TAQASSDSID MVDAYAGLPA AALTESTADA QEVYVESIQK WFEVRRQYIQ WVDGHLAQMQ
IATDITTRKK AQELAHQQEE KLQFTSRLMT MGEMASSIAH ELNQPLAAIN NYCSGTLALV
KSGRASPETL APALEKTAQQ ALRAGMIVKR IREFVKRSEP KRQPSRVADI VADAVGLAEI
EARKRRIRIV TEIRARMPII YVDPVLIEQV LVNLMKNAAE AMQEARPQAE NGVIRVVADL
EAGFVDIRVI DQGPGVDEAT AERLFEPFYS TKSDGMGMGL NICRSIIESH RGRLWVVNNV
EPDGLVSGAT FHCSLPIGEP EDLGRGSETS PSQTVTGEI