Gene Rcas_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1122 
Symbol 
ID5538588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1454781 
End bp1455854 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content58% 
IMG OID640893256 
Producthistidine kinase 
Protein accessionYP_001431239 
Protein GI156741110 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.277853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.812788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG CCCCTCCGTC CCTCAAAGAA CTCTACGACA AAGCGAACGC TGATCTGGAA 
AAAGCGCGCC GTGAATTGAA CGAGATCGAA GCGCTGCTCC GACAGACATC GAATGAAGTC
GAAAAGTTAC AACAGCGTGA GTTGACGGTC TCGAATCGCC TGCGCGATCT GGATGTCAAT
ATTGATCGTT ACAGCAAAGC CGATGTTAAG AATTATTATG CGTCTGCGCA GGAAGTGCAG
ATGCGTCTCC TGACCATGCG CAGCCAACTC GAACAACTCC AGTACCGTCA GCAGGCGACG
AAGCAGCGCC AGAGTCTGTT GTTCGAACTG ATCACTGCGC TCGAACCGCT CCTCGGCGTC
GCCACACCCG CCGGCAATGC TGGCGGGATG GTCGGAAGCG ACCAGTTAAT CGCCGACATC
ATCCAGGCGC AGGAAAAGGA GCGTCTGCGC ATCTCACTCC AAATGCACGA TGGTCCTGCT
CAGTCGATGA GCAACCTGGT GCTCCGCGCC GAAATCTGCC AGCGACTACT CGACCGCGAC
CCCGAAATGG CGCGCGCTGA GCTTGGTGCG CTCAAAAATG CGATCAATGC CACGCTTCAG
GATACGCGCC GTTTCATTTT CGATCTGCGC CCCATGATTC TCGACGATCT CGGTCTGGCG
CCGACCCTGC GCCGGTACGT GCAACAGGTC AGCGAAAAGA ACAAACTCGA GATCAACCTG
ATGGTGCAGA ATCTGGAAAT GCGGCTCCCT CCGCACTACG AAGTCGCCAT TTTCCGATTC
ATTCAAGAAG CGCTCAACAA CGTCATCAAG CACGCCAACG CCTCCCAGGC GCGGGTCCTG
GTCAGCATCC GCGATGATGT CGGCGCAGCG CGTGTTATCC ATGTCTCGGT CGAAGATGAC
GGCAGCGGGT TCCACGTCGG CGAGGTGCTG TCCGACGACA GCGGGCGGCG CAACATGGGC
ATCGCCACCC TACGCCAGCA GGTCGAAACG CTTCTCCGAG GCGAGTTCGG CATCGAAAGC
GCCGTTGGTC GCGGCACGCG CGTCGAAGCG ATTATTCCCT TGCCTGCGGG TTAA
 
Protein sequence
MTDAPPSLKE LYDKANADLE KARRELNEIE ALLRQTSNEV EKLQQRELTV SNRLRDLDVN 
IDRYSKADVK NYYASAQEVQ MRLLTMRSQL EQLQYRQQAT KQRQSLLFEL ITALEPLLGV
ATPAGNAGGM VGSDQLIADI IQAQEKERLR ISLQMHDGPA QSMSNLVLRA EICQRLLDRD
PEMARAELGA LKNAINATLQ DTRRFIFDLR PMILDDLGLA PTLRRYVQQV SEKNKLEINL
MVQNLEMRLP PHYEVAIFRF IQEALNNVIK HANASQARVL VSIRDDVGAA RVIHVSVEDD
GSGFHVGEVL SDDSGRRNMG IATLRQQVET LLRGEFGIES AVGRGTRVEA IIPLPAG