Gene RoseRS_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4039 
Symbol 
ID5211022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5058869 
End bp5061988 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content58% 
IMG OID640597627 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001278333 
Protein GI148658128 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTGGT TGCTGCCGGT TTTTGTTTCG CTTCTCATTG TCGCAGCCCT GATCGAAGCC 
GCAGTTGCGC TGTACGCCTG GCGTCAGCGC ATGACTGCCG GGGCGATGCA ACTGGTGGCG
CTGATGGCAG CGGCCATTAT CTGGACGCTG GCGTATGGTC TGGAACTGGT CAGCACGTCG
CTGGCGATCA AAATGGTCTG GAACGGCATT GCGTATCTTG GCGTCGCATT TCTGCCCATT
CTCTACCTGC GCTTCATTAT CGCCTACACT CAACGCGCAA CATGGCTCCT GCGCTGGCGT
GGCGCACCCT GGTACATCAT TCCGATAGCC ACGATTGCGC TCAACTGGAC GAATAGCGCC
CATGGTCTGT ACTATCAGCA GGTCACACTG GTGAACGAAG GACCGCTTGT GCTCCTCAGC
GTTACGCCTG GTGCGTGGCT ATGGGTGTTT GCTCTGTATT GCTACATCTC GATAGCACTT
GGCGCTGTTC TGTTCTGGCG TGCAGCCCGT GATACGTCGC CGTTGCAGGG CGGACGCTCC
GTCATGGTTG CTATCAGCAC ATTCGCGCCA TGGATCGCCA ATCTGCTGCA ACTCTTCGAA
CTCCATCCCT TCATACTGCT GGACATCACT CCGCTTGCTG TTGCATGTTC AGGGCTGGCG
ATTATGTGGG GTATGTTTCG CTATCGGCTC TTTTCGCTGG CGCCGATTGC GCGTGAGCAT
GTTGTCGAGA TGATGAGCGA TGCGGTCATC GTTGTCAATG CGCGCAACCG GATCATCGAT
CTGAACCCGA CGGCGCAGCA CCTCTTCGCT CCTGATGGCA GTTCACTCTT CGGTCAACGG
ATCGAGACGC TGCTGGCGCC GTGGCCCGTG CTGAAGCAGG CGTGCCGCGA GCCGGTCGAA
CAGGTGTTAG AGGTCGAATA TGACCGATCA ACCCGGCAGC TCTTCGAGGT CAAAATCAGC
CCGCTTGTCG GTGACGGCGG CAGGATCAAC GGGCGGCTCC TGATCTGGCG CAATATCACG
CAGCGCAAGC GCAATCAGGC GTTGCTCCGC GATCGGTTGC GGTTCATTCA GTTGATCCAG
CACGCCGCCA ACGATTTCGT GCGCTGCGAA GTGACGCAGA TCGACGAACA GATCGTCGCC
ATGCTGCAGA AAGTCGCTGT CTTCACCGGC GTCGAGCGCA GTTATATCGC CCTGATTACG
CCAGAGGGCG ACCGCGCGAC CATTGCCTAT GAGTGGCTGC ATCCGGCGGC GACATACAGC
GATGAACGGT TGCAACCGGT TGTCCTGTCG AAGTTCGCCC GCTGGGTCGA ACAGTTGCGG
CGGGGCGAAG TTGTTGCGCT TCAGTATGCC GATATTCCCG AAACACCTGA GTATGAGGTG
CTGCGACAGG CACTCCATAC ACTCCAACTC AAAACGATGG CGGCTATTCC GCTGTTTGTT
GGTGCAACGT TCATCGGGTG GATCGGCTTT GACACAATGA GCTACGCATA TGAGCTCTCT
GAAGCCGCAA TAGAGGCGTT TCAGTTGACC GGCCAGTTAA TCGCCAGTGC CATTCATCGC
CAGCGCACCG AAATGGCGCT TCGCCAGCGT GAGCAGTACT TCCGTTCGCT ATTCGACAAT
ATGCTGGAAG GGGTGGCGCT GCACCGGCTG ATCTACGATG AGCGCGGCAG GGCCATCAAT
TATGAGATCA TCGATGTCAA CCATCAGTAC GAGCGCATCC TCAATCTGCG CCGCGATGAT
GTCGTCCATC GCCTTGCAAC CGACGTCTAC GGCACATCCG AACCCCCCTA CCTGGCAGAG
TTCAGCGCTG TGGCGGCGAC CGGTCAACCG GCGCATTTCG AGGTTTATTT TGCGCCAATG
CAGAAACACT TCTTCATCTC AGTCTCACCG CTTGGACCAG ACCAGTTCGC AACGATCTTC
TTCGACATGA CCGCGCGCAA ACAGGAAGAA GCCGAACGAG AGCAATTGCA GATGCAGTTG
ATGCAGGCGC AACGTCTGGA GTCCGTGGGG CGGCTGGCGG GCGGCATCGC CCACGATTTC
AACAATATCC TGACGGTCAT CAGCGGTAGC GCCGACCTGG CGCTGGCAAT GATGCCTCCC
GACTCTCCGG CATATGCAGA TGTGCAGACG ATCCAGCAGG GGGCCAGACG TGCTGCGAAT
CTGGTGCGCC AGTTGCTCGC CTTTGCCCGA CGTCAGCCCG GTCATCCCCA GACGGTTGAT
GCAAACGAGG TGATTGAAGG GATGCTCCCG CTCGTGCGGC GCTTGATCGG CGAGGATATT
CGCTTCGTGT GGGCCCCGGC GCCGTATCCC TGCGTAATCA TTATCGATCC ACATCAATTC
GAGCAGGTGG TGATGAACCT GATTGTCAAT GCGCGCGATG CCATGCCGGA TGGCGGTGCA
CTGACGATCA CCACATCCGT CACCGAGATG GCTGCCGGGG ATGAAGATTC GGCAGAGACG
CCGTTTGTTT GTATTACGGT TGAAGATACC GGCGTCGGCA TGGATGCGGA GGTCATGGCG
CATATTTTTG AGCCGTATTT CACAACGAAA GCCCCCGGTC AGGGTACCGG TCTTGGTCTG
GCGACCTGTC TGGGCATTGT CCAGCAGTAT GCCGGCTTCA TCCGCGTCGA GAGCGAACCA
GGGGAGGGTA GTCGGTTCGA GGTCTTTCTC CCCTATGCTC CTGGTCTGAC GACACAGGTG
GACGAGAGCC AGAGCGATGC CCAACCGGAA GGAGGACTGG AAACAATCCT GGTTGTCGAA
GATGAACCGA CGGTACGGCA ACTGGCGGTG CGGATACTGC GTGATTATGG CTATACCGTG
TTCGAAGCGG GCAACGGGCA CGAGGCGCAG CAGGTTGTTA CCACGCTGGC TGGCGCACCT
GTTCATTTGC TGTTGACCGA TCTGGTCATG CCCGGCATGA GCGGCGTTGA ACTGGCAACG
TGGTTTCAAA CGCGCTATCC TGGCGCGAGT ATTTTGTTCA TGTCCGGGTA TGCGCGTCAA
CTGCCGGATG GCGTCGACAA TCCAACCATC GCCTTCTTGC AGAAGCCGTT CAGCCGCCAT
ACCCTGTTGA CCCAGGTGCG CCGCCTGCTC GACGCGGTGG CGACACATGT TTCCGCCTGA
 
Protein sequence
MVWLLPVFVS LLIVAALIEA AVALYAWRQR MTAGAMQLVA LMAAAIIWTL AYGLELVSTS 
LAIKMVWNGI AYLGVAFLPI LYLRFIIAYT QRATWLLRWR GAPWYIIPIA TIALNWTNSA
HGLYYQQVTL VNEGPLVLLS VTPGAWLWVF ALYCYISIAL GAVLFWRAAR DTSPLQGGRS
VMVAISTFAP WIANLLQLFE LHPFILLDIT PLAVACSGLA IMWGMFRYRL FSLAPIAREH
VVEMMSDAVI VVNARNRIID LNPTAQHLFA PDGSSLFGQR IETLLAPWPV LKQACREPVE
QVLEVEYDRS TRQLFEVKIS PLVGDGGRIN GRLLIWRNIT QRKRNQALLR DRLRFIQLIQ
HAANDFVRCE VTQIDEQIVA MLQKVAVFTG VERSYIALIT PEGDRATIAY EWLHPAATYS
DERLQPVVLS KFARWVEQLR RGEVVALQYA DIPETPEYEV LRQALHTLQL KTMAAIPLFV
GATFIGWIGF DTMSYAYELS EAAIEAFQLT GQLIASAIHR QRTEMALRQR EQYFRSLFDN
MLEGVALHRL IYDERGRAIN YEIIDVNHQY ERILNLRRDD VVHRLATDVY GTSEPPYLAE
FSAVAATGQP AHFEVYFAPM QKHFFISVSP LGPDQFATIF FDMTARKQEE AEREQLQMQL
MQAQRLESVG RLAGGIAHDF NNILTVISGS ADLALAMMPP DSPAYADVQT IQQGARRAAN
LVRQLLAFAR RQPGHPQTVD ANEVIEGMLP LVRRLIGEDI RFVWAPAPYP CVIIIDPHQF
EQVVMNLIVN ARDAMPDGGA LTITTSVTEM AAGDEDSAET PFVCITVEDT GVGMDAEVMA
HIFEPYFTTK APGQGTGLGL ATCLGIVQQY AGFIRVESEP GEGSRFEVFL PYAPGLTTQV
DESQSDAQPE GGLETILVVE DEPTVRQLAV RILRDYGYTV FEAGNGHEAQ QVVTTLAGAP
VHLLLTDLVM PGMSGVELAT WFQTRYPGAS ILFMSGYARQ LPDGVDNPTI AFLQKPFSRH
TLLTQVRRLL DAVATHVSA