Gene BURPS668_A1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1038 
Symbol 
ID4888920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1003286 
End bp1004791 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content73% 
IMG OID640130978 
ProductSignal transduction histidine kinase 
Protein accessionYP_001062037 
Protein GI126442825 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCGGC TCAATCTGCG CGCGCAGGTC GCGTTATGGT TGCTGCTGCC GTTCCTCGGG 
CTGCTCGCGC TCGATTCGTG GCTCACGTAC CAGCGCGCGA TGAACGCCGC GCACGTCGCG
TTCGATCGCA CGCTCGCGTC GTCGCTGAAG TCGATCCGCG AGGGCGTGCG GCTCGTCGGC
GGCGAGGTCG AGGTCGATCT GCCGTATCTC GCGCTCGAGA TGTTCGAGTC GAACGACGGC
GGCAAGATCT ACTACCTGAT TCGCGGCGAC GACGGCCGCG CGGTCACCGG CTACCGCGAT
CTGCCGATGC CGGGCGCGGG CGCGCCGCTC TATGCGACGT CGTTCTACGA CGCCGTGTAT
CGCGGCGAGC AGTTGCGCAT GGCCGCGCTG CGGCTGCCCG TGCACGACGT GCCGAGCGCG
CGGACGCGCG TCGTGTGGGT GATGGTCGGC GAGACGATCG AGGCGCGGCA GGCGCTCGCG
CGCGAGATCC TGGTTGGCTC GCTGCTGCAG GAGGGCCTGC TCGTCGTGCT CGCGCTCGGC
ATCGTGTGGC TCGGCGTCGG GCGCGGGCTG CGGCCGCTGA ACCGGCTGTC CGCGAAGGTC
GCCGCGCGCG CGGAGGACGA CCCGACGCCG CTCGAGACGC TCGGGCTGCC GAGCGAGGTC
GCGCCGCTCG TCGAATCGAT CAACCAGTAC GTCGCGCGCA CGCAGCGCAT GCAGGTCGCG
CGGCGGCGCT TCTTCGCCGA CGCCGCGCAT CAACTGAAGA CGCCGCTCGC GGCGGCGCAG
GCGGGCGTCG AGCTTGCGCT GCGGCCCGCC GAGCGCGAGC GCGTGAGCGT GCATCTGCGG
CGCGTGAACG GCGCGGTGCG GCAGGCGGCG AAGATCGTCC AGCAGTTGCT GTCGCTGTCG
CGGCTCGAAT CGGACGTCGC GCCGGCGATC GAGCGCAAGC CGGTCGCGCT CGCGAAGCTC
GCGCGCAGCG TGACGCTCGA CTGGTCGGGC GTCGCCCGCG CGCGCGGCAT CGATCTCGGC
TTCGAGCAGC GCGCGAGCGT CGACGTGATG GGGCGCGCGG ATCTGCTGGG CGAGCTCGTC
GGCAACCTGA TCGACAACGC GATCCGCTAT GCGGGCGACG GCGCGGTGAT CACCGTGCGT
GTCGCGCGCG AAGGCGCGCT CGCGCGGCTC GAGGTGATCG ACGACGGGCC GGGGATCGCA
CCCGGCGAGC GCGACGCGGT GTTCGAGCGC TTCTACCGGA GCCACGCGAC GCTCGCGGTC
GAGGGCACGG GGCTCGGGCT GTCGATCGTG CGCGAGATCG CGCGCGTGCA TCGGGGCGCG
GTCGAATTGG ATGATGCGGC GCTCGCGGGC GGGGCAGCGG GCGATGCGGG CGACAGAGCC
GGTGAGAGGC GTGAAAGAGG CGAGGCGGAC TGGCGCGGCG ACAGTGACGG CGAGCGCTCG
GCTGAACGGC CGGCGCGCGG GCGCGGGCTC GTCGTGCGCG TGACGCTGCC GGCGCTCGCG
GTGTGA
 
Protein sequence
MTRLNLRAQV ALWLLLPFLG LLALDSWLTY QRAMNAAHVA FDRTLASSLK SIREGVRLVG 
GEVEVDLPYL ALEMFESNDG GKIYYLIRGD DGRAVTGYRD LPMPGAGAPL YATSFYDAVY
RGEQLRMAAL RLPVHDVPSA RTRVVWVMVG ETIEARQALA REILVGSLLQ EGLLVVLALG
IVWLGVGRGL RPLNRLSAKV AARAEDDPTP LETLGLPSEV APLVESINQY VARTQRMQVA
RRRFFADAAH QLKTPLAAAQ AGVELALRPA ERERVSVHLR RVNGAVRQAA KIVQQLLSLS
RLESDVAPAI ERKPVALAKL ARSVTLDWSG VARARGIDLG FEQRASVDVM GRADLLGELV
GNLIDNAIRY AGDGAVITVR VAREGALARL EVIDDGPGIA PGERDAVFER FYRSHATLAV
EGTGLGLSIV REIARVHRGA VELDDAALAG GAAGDAGDRA GERRERGEAD WRGDSDGERS
AERPARGRGL VVRVTLPALA V