Gene RoseRS_3876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3876 
Symbol 
ID5210858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4851486 
End bp4854554 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content58% 
IMG OID640597471 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001278179 
Protein GI148657974 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGATC AGCATAGTAT CTGGAAAGCG ATCTCTGTCT TGTTCTCCGC CCACTCCGAA 
TCCTTGCGCG TTACAACGCT CAAGATTATT GCTATCGCGC TGGTCATCCT GCTGGCGTTG
CAGATCACCG TTTCCGAGTT CATCATCGGG CGCAGTTTCT ACGAACTCGA AGAACGCAGC
ACCCGCAGCG CCATGCAGCA AACGCTCAAA ACGCTCCAGA ACGAGATCAA TGTGCTGTAC
GGCAATGCGA AAGACTATGC TGTCTGGGAT CCCACCTACG AGTTCATCGA ACAACGTGAT
GTTGCCGGGT ATGTTGATGT TCACATGACG ACGGAAGCGT TGCTCGCCAT TCGCGTATCG
TATGTTGCGT TCGCCACTCC TGCCGGCGAG ATCATCTATA CCCGTCGCTT CGATCTGCGC
GACGGTCGTG ATCTGCCGAC GCCAGCGGAA TTTGCTTCGT TTGACGGCGA CAATGCGGTG
TTTTTGCGCA CCGCAGCGCA GACCGAAGGC ATCAGCGGCG TGGTCGTGGC GGATGGGCAA
CCAATGCTCA TTGCAGCCCA TCCAATCCTG CGCAGCACGG GTGCGAGCGA ACCGCGCGGC
GTGCTGATCC TGGGACGCGA CTTCAACGAC GATGAACTGG ATCATCTCAG CGATCTCACC
GGTTTCCCGG TCTCGTTCAC GCTGACCGCG AACGCCGCAG TTGCGCCAGA TTTCGATCTG
GCATACCGGT TGATGACTCC TGACACGCCG ATCATTGTGC GCCCGATGTC ATTCTCCGAC
GACCGGGTGT ACGCATACGC ACAGATCAAC GATCTGCGCG GCGGCGAGGG TATCATCCTC
CGCATCAACG CCCCGCGCGA TATTGTTCAG TATGGGCAGT CTTCATCACG CCTGTACATG
CTGATCATGC TGCTGGTCAT TGGCGCATTC GCCGCCGTCA TGATCCTGCT CCTGGAACGC
AACGTCCTGT CGCGCATCAT CGCCCTCAGT CTCCAGGCAG GGCGGATCGG GCGCACTGGC
GACGTGCAGG CGCGCCTGGC GGTGATCGGT AACGATGAGG TGGCGCAGCT CGGCAGAGCG
ATCAACGCCA TGCTCGACGA CATTGCACAG GCTGCCCGAC GCCTGGCGGA AAGTGAAGCG
CGCTACCGGC AACTGGTCGA AATATCCCCT GAAGCGATCA TCGTCCACGA CGGCAAACGG
ATCATCTACA CCAACCAGGC AGGTGCGCGC CTGGTTGGTC ACACCGACCC ATCGCAGTTG
ATCGGCGCTG ACGCCACACC CTTTTTGCCA CCCGCACTGC ACCTGGAAAC AGCAGAGGGA
GTCATGCGGT ATGAGCGCGA TCTTACCATC GCTGACGGAA CAGGCATATC CCTTGAACTT
GTCGCAGCCC CGTTCCTGGC TGAAGGTAAA CCCGCGTGGC AGATCGTTGC GCACAACATC
ACGGTGCGGA AGCAAACTGA GGAGGCGCTG CGACAGGCAA AAGAATGGGC GGAAGAGGCA
AACCGCACAA AAAGTCGCTT CCTGGCGAAT ATGAGCCACG AACTCCGCAC ACCACTGACG
ACGATTATCG GGTATGCCGA CCTGATTACG ATCTCTGTGC ACAGCGGGGA ATTCGATCAG
GTTGCCAGCG ATATTGCGCG GGTACGAGAT GCAGGAAAAC ACCTGCTGGC GATTATCAAC
GATCTGCTCG ATCTGTCGAA AATCGAAGCC GGACGGATGG AAATCCACAG CGAGCGATTT
TCGGTGCGTG CGCTTGCAGA AGAAGTGATT GCCAGCATGC GCGTATTTGC GCAGAAACGG
AACAATGATC TCACCTTGAA TATCGACCCG ACCGTCGAGA TGATGCACTC CGATGACGTG
CGCGTGCGGC AGATCCTGTA CAACCTGGTG CACAATGCGT GTAAATTTAC TGAAGATGGC
GCCGTGACTC TCGATATCGC CCGCACGGTG AGTGACCATG ATCACGCCGC ACTCCTCGTG
TTTACTATCA GCGATACCGG CATCGGGATG ACCGCCGATC AGATCGCCGG TCTGTTCCGT
GAGTTTACCC AGGCAGACTC ATCAACAACG CGCAAATACG GCGGAACTGG ACTGGGGCTT
GCGCTGTGTC GGCGCCTGAC TCATCTGCTT GGCGGTAAGA TCACGGTCAC CAGTCAGCCG
GGCGTTGGAA CGACATTTGT GGTCACACTT CCCGAACATC TGGCGTCTGC AACTGCATCC
GAGCCAGCGC CGGTCGATGC AGCGCCTGCA TCAGACACGC CGCCTGCTCC TGAATACAGT
GAGGACACCA GGCGTCTGGT GCTGTTGATC GACGATGATC CTGCTGTGCG CGATCTGCTG
CCGCGCATGC TGGAACGCCC CGATCTCCAT ATCGAGACTG CTGCGGACGG AACGAGCGGG
CTGGAACTGG CGCGCCTGCT CATGCCCGAC CTGATTATTC TGGACATCCT GATGCCGGAG
ATGGACGGGT GGACCGTGCT CCGTGAGTTG AAGGCGTCGA ACGAAACCGC TGCCATCCCT
GTTATACTAC TTACGATAGC AGACGACAGA GAACACGGGA TGCTTCTGGG AGCTGCCGAA
ATGATCCATA AACCGGCAGA CCTTGATCGG CTTGATCAGC GCATTCGTGC ATTAACCCGG
GGGCGATCGG CACAGGTGGA AGCCGGTAAC CAGCAGATTT TGATCGTTGA GGATGATGAG
ACAGTGCGCC AGTATCTCCG CCGCACTCTG GAGCGCGAAT GTGAAGACTG GATTATCATG
GAAGTCGCCG ACGGTCAGAC GGCGCTTGAA CGTTGCACAA CCGCCATGCC GGACGTTATT
GTGCTCGACC TTATGATCCC CGGTATCGAT GGCTTACAGT TCATCGAAGC ATTGCGCGCG
CTTCCCAATG GATGTTCGAC GCCAATTATC GTTGTCACTG CCCAGGATCT GACCGCTGAT
GAACGCGAGC GTCTCTGTCA CTCAGTCACC CGCATTCTCT ACAAGGGTTC CTTTCACTGC
CACGAATTCG CGCGCGAGGT GCGCGCAGCC ATTGCCACGT ATGCACAGTT ATACCCCCTG
GAGGTTTGA
 
Protein sequence
MIDQHSIWKA ISVLFSAHSE SLRVTTLKII AIALVILLAL QITVSEFIIG RSFYELEERS 
TRSAMQQTLK TLQNEINVLY GNAKDYAVWD PTYEFIEQRD VAGYVDVHMT TEALLAIRVS
YVAFATPAGE IIYTRRFDLR DGRDLPTPAE FASFDGDNAV FLRTAAQTEG ISGVVVADGQ
PMLIAAHPIL RSTGASEPRG VLILGRDFND DELDHLSDLT GFPVSFTLTA NAAVAPDFDL
AYRLMTPDTP IIVRPMSFSD DRVYAYAQIN DLRGGEGIIL RINAPRDIVQ YGQSSSRLYM
LIMLLVIGAF AAVMILLLER NVLSRIIALS LQAGRIGRTG DVQARLAVIG NDEVAQLGRA
INAMLDDIAQ AARRLAESEA RYRQLVEISP EAIIVHDGKR IIYTNQAGAR LVGHTDPSQL
IGADATPFLP PALHLETAEG VMRYERDLTI ADGTGISLEL VAAPFLAEGK PAWQIVAHNI
TVRKQTEEAL RQAKEWAEEA NRTKSRFLAN MSHELRTPLT TIIGYADLIT ISVHSGEFDQ
VASDIARVRD AGKHLLAIIN DLLDLSKIEA GRMEIHSERF SVRALAEEVI ASMRVFAQKR
NNDLTLNIDP TVEMMHSDDV RVRQILYNLV HNACKFTEDG AVTLDIARTV SDHDHAALLV
FTISDTGIGM TADQIAGLFR EFTQADSSTT RKYGGTGLGL ALCRRLTHLL GGKITVTSQP
GVGTTFVVTL PEHLASATAS EPAPVDAAPA SDTPPAPEYS EDTRRLVLLI DDDPAVRDLL
PRMLERPDLH IETAADGTSG LELARLLMPD LIILDILMPE MDGWTVLREL KASNETAAIP
VILLTIADDR EHGMLLGAAE MIHKPADLDR LDQRIRALTR GRSAQVEAGN QQILIVEDDE
TVRQYLRRTL ERECEDWIIM EVADGQTALE RCTTAMPDVI VLDLMIPGID GLQFIEALRA
LPNGCSTPII VVTAQDLTAD ERERLCHSVT RILYKGSFHC HEFAREVRAA IATYAQLYPL
EV