Gene Rcas_4343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4343 
Symbol 
ID5541856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5594309 
End bp5595979 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content60% 
IMG OID640896449 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001434385 
Protein GI156744256 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.180185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGACC GGTATGATCA ACAGTTGCGG CAGCTTGAGT CGCTGCTGCG CGTCAGTCGC 
GCGATCACGG CGCAGCTCGA TCTCACCAGT GTGCTCAATC TGGTCATCGA GGCGGCGGTC
GATTTGCTTG CCGGCAGCTC CGGCTTGATT GCCCTGCGCG ATGATGATGG CACGACGCGC
ATTTATGCGG CGTATGGACT GGCGCGCGAA ATCTGGCCCG TGTTCGACGA CCTGCTGGCG
ACGCCGCTTA GCGATCAGCA GGCGCTGGTG CATCGCCTGC GCGAAGCAGG CGCCAGCATC
GGGCTGCCAT TACGGCATGT CAGCGCGTTA CCGCTGATGT TTCGCGGCGC GACGGTCGGG
GTGATCTATG TCTTTCGCGC TGCGCTGAAC GTCGAGTTTA CTGCCGAAGA ACATCAACTC
CTGACCGCAT TCGCCGATCA GGCGGCGATT GCGGTCTCGA ATGCACGCCT GTTCCAGAGC
GTCTTGCGTG AGAAACAACA TCTTGATGCG CTGATCGAGA ACAGCGCCGA CGGCGTGATG
ATCCTCGATG AGCGCTGGCG CATTGCCACA TTCAACCGTG CAATGGAACT GCTGACCGGT
TGGAGCCGTG AGGAGGCGAT CGGGCGCCCG TGCGCCGAGG TGCTGGCTAT CCACACGCCG
CAAGGCGCCA ATCTCTGCCT TACCGATTGC CCGTTGCAAC GGCAACCGTT CGAGCCTAAT
CCGGTCGCCG AGGGGTGGAT CACCACACGG GACGGACGGC GTCTCTACAT CCAGAGTCGC
TACGCAGCAC AACGCAGCCC GCAGGGCGCG TTTCTTGGCG CCATCGCCAA CGTGCGTGAT
GTCACCGAGC AGAAGATCGA AGCTGAGATG CAGAACACCT TCATTTCAGT CATCTCGCAC
GAATTGCGCA CACCGGTCAG CATTATTAAA GGATACGCCG AGACGCTGGC GCGCCAGGAT
GCGGCATGGG ACGCAGCGAC TCTCCGTGAA GGACTGGCCG TTATCATCGA AGAGGCGGAT
CGCCTGGCGC AGCAGATCAA CACGCTGCTG GAAGCCTCCC GTTTACAGAC CGACAGTATG
CGCCTCGAGT TGAGCGACTG GTCGGTACGC CCTCTGGTGG AGCGCGTGGT CGAACGCTTC
GCACCACAGG CAGGCGACCG GTTCACGTTC CAGATCGACA TTCCCGACGA CTTTCCGCCA
GTCCATGCCG ATTATGAGCG GACCCGCACC GTGGTGGAGA ATCTGATCAG CAACGCGATT
AAGTACAGCC CGAACGGTGG GTTGATACGC ATCACGGCGC GGGTGAGCGG CGATTTTGCG
ATTATCTCGG TGAGCGATCA GGGTATTGGC ATACCGCTCG AAGAGCAGAA AAAACTCTTT
CGCCGCTTCT ATCGCGTCGA TAACCGCCTG CGGCGTGAAA CGCAAGGAGC AGGATTGGGG
TTGTTCCTGT CGCGCGTTAT TGTTGAAGCG CAGGGTGGGC GAATCTGGGT CGATAGCCGA
CCGGGGCGCG GGTCGCGCTT TTCGTTTACT GTGCCGCTGG CAACGCCAAT GCTGAGCGAT
CAGATGGCGT CGGGTGAGAT CGAAACTGCC ACATCTGTCG ATCATCCTGA GTCAACCGTA
GTAACGCTTC CACGGATGGA ACCGCCGCTG CTCGAGGATC ATGAACGTTA A
 
Protein sequence
MLDRYDQQLR QLESLLRVSR AITAQLDLTS VLNLVIEAAV DLLAGSSGLI ALRDDDGTTR 
IYAAYGLARE IWPVFDDLLA TPLSDQQALV HRLREAGASI GLPLRHVSAL PLMFRGATVG
VIYVFRAALN VEFTAEEHQL LTAFADQAAI AVSNARLFQS VLREKQHLDA LIENSADGVM
ILDERWRIAT FNRAMELLTG WSREEAIGRP CAEVLAIHTP QGANLCLTDC PLQRQPFEPN
PVAEGWITTR DGRRLYIQSR YAAQRSPQGA FLGAIANVRD VTEQKIEAEM QNTFISVISH
ELRTPVSIIK GYAETLARQD AAWDAATLRE GLAVIIEEAD RLAQQINTLL EASRLQTDSM
RLELSDWSVR PLVERVVERF APQAGDRFTF QIDIPDDFPP VHADYERTRT VVENLISNAI
KYSPNGGLIR ITARVSGDFA IISVSDQGIG IPLEEQKKLF RRFYRVDNRL RRETQGAGLG
LFLSRVIVEA QGGRIWVDSR PGRGSRFSFT VPLATPMLSD QMASGEIETA TSVDHPESTV
VTLPRMEPPL LEDHER