Gene Rcas_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0996 
Symbol 
ID5538462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1301442 
End bp1302632 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID640893139 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001431122 
Protein GI156740993 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.778544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGA TCATTGCTAC GCTCTTGCTG GCGCTTGTGT TATCCCTCAT TTGGGGATGG 
CGTCAGAGAA CTCGTCTTCA ACCGCCTCCA TCATTACCCG AATCACCGGT CCCCTCCCCG
GATGCATTCT CCTCGCTCTT TCACGCTGCT GCTGCTGCGT TCGACGATGG ATTGGTGCTG
GTCGATGCTG ATCGTCGCGT GCAGTATCTC AATGCGCATG CGGAAGAGTT GCTGAACCTC
AGTCGAACGG TGAGCGTCGG GCAGGGGTTG ATCATGCTGG CGCGCGATTA CCAGGTGGAC
GCCATGGTCG AAGAGGCGAT CGCGTCTGCC GAGCCGCGCG AAAGTATCCT TCAACCGCTG
GGAAGACAAC GGACGCTGCG ATTACGAGCG GTTCCACTCG ATCATGGGAC GAAGGGCGCG
CTCCTGCTGG TGCGCGATGT AACGCAACTG AGCCAGCTTG AACGGTCGCG GCGCGATCTG
GTCGCCAATG TGTCGCACGA ACTCCGCACC CCGCTGGCAT CGATCAAGTT GCTGGTCGAA
ACGTTGCAAT CCGATCCGCC GGCGCCGGTG GCACAGCGCA TGCTGAGCCA AATTGCCCAG
GAAGTCGATG CAATTACGCA ACTGGTTGAG GAGTTGCACG AACTATCGCA GATCGAGTCA
GGACGGGTGG CGCTTCAACT CGTGCCCACA CCGCTGGCGC CGCTGGTCGC GCACACAATC
GAACGCATTC GCCCGCAGAT GGAACGGAAG AGCATTCGAG TCATCGCCTG CCTGCCCGAT
GATCTGCCGC CTGCGCTGAT CGATGGGAAC CGCGTCGGTC AGGTATTGCT CAACCTGTTG
CACAACGCCT CGAAATTTAC CCCCGATGGC GGGCAGGTCG CCATCGAAGC CTCGGTGATC
ATGGTAGGCG ACGGTTCACC GCTGCCGCCC GGAATACCAT CGTCGCACCT TCCCGGTCAA
TGGTTGCTGG TCTCGATAGC CGACAACGGC ATCGGTATTC CGGCGCGCGA TCTCGCGCGC
ATTTTCGAGC GCTTCTACAA AGTGGATCGC GCGCGAACGC GCAATGCCGG CGGAACGGGA
CTGGGTCTGG CGATTGCCAA ACATCTGGTC GAGGGGCACG GCGGACGTAT CTGGGCAACC
AGTGTGGAAG GAGAAGGGAG TATCTTCTAT CTCACGTTGC CGGTGGCATA G
 
Protein sequence
MEWIIATLLL ALVLSLIWGW RQRTRLQPPP SLPESPVPSP DAFSSLFHAA AAAFDDGLVL 
VDADRRVQYL NAHAEELLNL SRTVSVGQGL IMLARDYQVD AMVEEAIASA EPRESILQPL
GRQRTLRLRA VPLDHGTKGA LLLVRDVTQL SQLERSRRDL VANVSHELRT PLASIKLLVE
TLQSDPPAPV AQRMLSQIAQ EVDAITQLVE ELHELSQIES GRVALQLVPT PLAPLVAHTI
ERIRPQMERK SIRVIACLPD DLPPALIDGN RVGQVLLNLL HNASKFTPDG GQVAIEASVI
MVGDGSPLPP GIPSSHLPGQ WLLVSIADNG IGIPARDLAR IFERFYKVDR ARTRNAGGTG
LGLAIAKHLV EGHGGRIWAT SVEGEGSIFY LTLPVA