Gene Rcas_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3420 
Symbol 
ID5540919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4457044 
End bp4459203 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content59% 
IMG OID640895538 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001433488 
Protein GI156743359 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.688425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.99318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCGC AGACTCCAAC AAGTGCAACC CGTGAGGACG TTGAGCAATC CTCTGTCCTC 
GATCCGCGTT TGGATGGTCT AATGCGCATC GCACAGGCGG CTGAGACTGC GGCGACGCTG
GAGCAGTTGC TGTACCGTTC GCTGATTGAA CTGAACCGCT TGTTCCAGTC GGATCGCGCA
TTTGTGTTGT TGGTGGATGC TTCCGGTCGG ATGACGCTTG CGTGTGAACA TCCATCGCCG
GGTGACGCGC CTGCCCTGAC GCTCGATAGT CTGGCAGGCG CGATTGATGT CATACGGATG
CGGCGTCCGG TTGTCATCGA GACGGAAAAG GCGCGGGAAG GGTGGGATAT TGCACTGCTC
CGGGCGCGCG GCGTGCAGAC GATGATCGTC ACCCCGCTTA TTGCGTGTGA TGAAGCCCTT
GGGGTTCTGG CGGTGTGCAG CGCCAACGCT GCGCGGGTGT TCGGGTCGGA GGAGGTGGCG
TTCATCAGAA CGCTGAGCGG CCAGATGGCG CTGGCGATCG CATCGTTTCG CAGCCGTGAC
GCTGCCGAGC GGCGCACCCA AGAACTCAAA ACGCTCAACG AGATTGCGGC GACGGTCACC
TCAACCCTGG ATACGCACGA AGTATACCGC CTGGTGGTTC AGCAACTCAG TGATTACTTT
CACGTCGAAG CCGGTTCGCT GTTGCTCCTT GATGAGTCAA CCGGCGATCT TGAGTTTGTT
ATGACCATCG AAGGCGGCGA GGAGAAGCTG GCCGGCATCC GGGTGCCGGC TGGACAGGGC
GTCGTCGGGC ATGTAGTGCG CACCGGTCGG TGGGAAATCG TGCACGATGT CACCCGCGAT
CCGCGATTTT ATTCCAAAAT CAGCGAAACG ACCGGTTTCC CGACGCGCTC GATCCTCTGT
GTGCCGATGA TTGCCAGGGG GCGTGTGATC GGGGCGATTG AACTGCTCAA CAAAATCGGC
GGTGATTTCG ATGAGGAGGA AGCACAGCGA TTGATGCGCA TGGCCGCATT TATTGCGGTT
GCTATCGAAA ATGCGCGCCT CTTCCAGCAG ATCGCTGCCG GACGCGATCG CATGGCATCC
ATTCTGAACT CAACAGCCGA TGGCATTCTG ATGGCGGATA TGCGGGGTGA CATTCAGCTT
GCCAATCCGC TGGCAGCGCG GATATGCGCA TGTACAGAAG AGGCCTTGAT CGGACGACGG
ATCGATGATG TGGTGACTGA GTTGCAGGCA CGCGCTCACG AAGTCTCGGC GCCTGCGTGG
GGTCAGGATG CGTCGGCGCC GGTGCAGATC AGGGATCTGG CGCTGACCGA TGGAATGCAC
CGCTATGTGC GCCTGTTGCG GCTCCCCGTC TATGATGCGC ACAATGAACC CCACGGCGAG
TTGCTCATCC TGCGTGACAT TACCCAGGAA CGCGAACTGG AGCAGTTGCG CGAAGATTAC
ACCAGCATGC TGGTGCACGA CCTGCGCGCG CCGTTGACAT CGATCATGAA CGGCATCATG
ATGCTCCAGC GCGGTATCGT TGGTCCGGTG AACGAGCAAC AGCAGGAGTT GCTGAAGATT
GCGTATCAGG GCAGCCAGAC GATGCTGCAC CTGATCAACA CCCTGCTCGA CATCTCAAAG
TTGGAGCAGG GTCAGATGAC GCTCGATCTC AAGCCACTGC CCATTTTCAG TGTGATCGAT
CAGGCAATTG AACGCCTTCA CAACCTGGCG AGCAGTCGCC ATGTCACTAT TGAGCAGCGC
CTGGCGCCGT ACCTTCCGCC GGTTGAGATC GATGGCGAAA AGATCGTTCG GGTACTCCAG
AATCTGCTGG ATAATGCAAT CAAGTTCTCG CCGCCGCAGA GTGTGGTGAC GATTGGGGCG
TTTCTGGCCG GCAGCACGTC TCCCCTTCCA GAAGATGCTC CTGTGCATCT TTCGATTGAG
GGTGAGGATT ATCTGGTAGT ATGGGTGCAG GATCGCGGTC CGGGCATACC GCCAGCCTAT
TTTCAACGCA TCTTCGAGAA GTTTGGTCAG GTGCGCGGGC GAAAGGTGCG CGGGACCGGT
CTGGGGTTGA CGTTCTGCCG CCTGGCTGTC GAGGCGCACG GCGGGCGTAT CTGGGTCGAA
AGCGTCGAGG GGTCGGGGAG CGTCTTTGCG TTTACCCTGC CGGTGAGACG TGATAGTTGA
 
Protein sequence
MRPQTPTSAT REDVEQSSVL DPRLDGLMRI AQAAETAATL EQLLYRSLIE LNRLFQSDRA 
FVLLVDASGR MTLACEHPSP GDAPALTLDS LAGAIDVIRM RRPVVIETEK AREGWDIALL
RARGVQTMIV TPLIACDEAL GVLAVCSANA ARVFGSEEVA FIRTLSGQMA LAIASFRSRD
AAERRTQELK TLNEIAATVT STLDTHEVYR LVVQQLSDYF HVEAGSLLLL DESTGDLEFV
MTIEGGEEKL AGIRVPAGQG VVGHVVRTGR WEIVHDVTRD PRFYSKISET TGFPTRSILC
VPMIARGRVI GAIELLNKIG GDFDEEEAQR LMRMAAFIAV AIENARLFQQ IAAGRDRMAS
ILNSTADGIL MADMRGDIQL ANPLAARICA CTEEALIGRR IDDVVTELQA RAHEVSAPAW
GQDASAPVQI RDLALTDGMH RYVRLLRLPV YDAHNEPHGE LLILRDITQE RELEQLREDY
TSMLVHDLRA PLTSIMNGIM MLQRGIVGPV NEQQQELLKI AYQGSQTMLH LINTLLDISK
LEQGQMTLDL KPLPIFSVID QAIERLHNLA SSRHVTIEQR LAPYLPPVEI DGEKIVRVLQ
NLLDNAIKFS PPQSVVTIGA FLAGSTSPLP EDAPVHLSIE GEDYLVVWVQ DRGPGIPPAY
FQRIFEKFGQ VRGRKVRGTG LGLTFCRLAV EAHGGRIWVE SVEGSGSVFA FTLPVRRDS