Gene Rcas_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4334 
Symbol 
ID5541847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5586757 
End bp5588034 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content57% 
IMG OID640896440 
Producthistidine kinase 
Protein accessionYP_001434376 
Protein GI156744247 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.35369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCAGG ATGCAACTGA GATTGTACCT CTTCAACAGA GCGAAGGTCG GCAGAGTAGT 
TATGCTGAAA GCATCGCATT GCCGCAGCGT GCTGGACATC TGTCCTGGAT CGCGCAGACC
GGTCTGGCAC TGACACAGTG TCGCACGGCT GCTGATGCGC TCAGTACGAT TGCGGCGCGG
CTCGATGCAG CGCCGTTTTT GCTGCGCGGC TATGTCATTC TGCTGGAAGA TGATACGTTG
CGCGTCAGGC GGGCTGTCTC GTTTGGAAAG GCGCCGGCAT TACCGGTTGC GCTGTCGCTG
TCCGGTGGTC TCTGTGGCTC AGCCATAGTG CAGAAGCGAG TGGTTGCGTC ATCAACCGGC
GGAATTGCTC TTATGTCGTG GGAACGAACC CTGGCAACCG ACGCTGCGCT GATCTGCGCG
CCCTTTACCA GCGATAGAGC GAAGGGTATT ATTATTGCCG CGTTGCCTGA TGGCGCGCAT
GATGATGTTG AGATGAGAGC GTGTCTCCAG GCGATAGCCG CTCTCGCAGG TGCAACGTTC
AATCGCGCCG CCGAGTGCGA GGCGCAACTG CTCAAGCGCA TGCGCCAGAT CGAGCAGGCG
CATACGGAGC GCCTGGCGCT GGTCGGACGC CTGACCGCCG CACTTGCCCA TGAGATCAAT
AATCCGCTCC AGGCGATTGC CAATACGCTC TATCTGTTGC AGCATCGTCC GCTCGACGAC
GAGAAACGCC AGCGCTATCT GGCGATGGCG CAGCAGGAAA CGGAACATGT GATAGCCGGT
GTTCGGCGCA TGCTCGATCT GTATCGTTCG TCGGGGCAGG AGAAACGACC GGTTGCGTTG
CATCGAGTGA TCGATCAGGC GCTGCACCAG GCTCATGATA TTCTGAATGA ACGCGCGATC
GATCTGTGTC GTGAATGGAC ATCCGACGAT CTGCGGGTGC TGGGATTTAC CGGTCATTTG
CGCTATGCCT GCTACAATCT GATTCTCAAT GCTATCTATG CAATGCCCAA AGGGGGACGG
TTGACGATTC GCACGTATCG AGAGGTCGAG GGCGCAACGC ATCTGGCAGT GGCAGAGTTT
GCAGATACAG GCGTTCCTAT CGATGATGCC GATCTGCATC GGTTGTTCGA ACCAGACGGT
CCTGTGCGGG GTGAGGCGAA CGGTATCGAA TTGCCGTTAA GCTACAGTGT CATTGAGCAG
CATAACGGCA CTCTGACCGT CCATCGGCAT GGGGACGAAA TGGTGTTTCG TGTAAGCCTT
CCCGCCATCA ATTCCTGA
 
Protein sequence
MHQDATEIVP LQQSEGRQSS YAESIALPQR AGHLSWIAQT GLALTQCRTA ADALSTIAAR 
LDAAPFLLRG YVILLEDDTL RVRRAVSFGK APALPVALSL SGGLCGSAIV QKRVVASSTG
GIALMSWERT LATDAALICA PFTSDRAKGI IIAALPDGAH DDVEMRACLQ AIAALAGATF
NRAAECEAQL LKRMRQIEQA HTERLALVGR LTAALAHEIN NPLQAIANTL YLLQHRPLDD
EKRQRYLAMA QQETEHVIAG VRRMLDLYRS SGQEKRPVAL HRVIDQALHQ AHDILNERAI
DLCREWTSDD LRVLGFTGHL RYACYNLILN AIYAMPKGGR LTIRTYREVE GATHLAVAEF
ADTGVPIDDA DLHRLFEPDG PVRGEANGIE LPLSYSVIEQ HNGTLTVHRH GDEMVFRVSL
PAINS