Gene Rcas_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0284 
Symbol 
ID5537746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp350981 
End bp352756 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content59% 
IMG OID640892448 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001430435 
Protein GI156740306 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAG CAGAGCGCAT CCCGCTCCCA TCGAAGATTT CTCTGCAACG CCAGATGATC 
GGCGGCGACC GTTTCTACAA TCTATCGCGC TGGGGCGTGA CTATCCTGCT CATCGGGATT
GTTCAACTGA TTGCCGGTGA GTCGCTCTGG TCCCCCACTG CTGCGCACAG CCCGATGATC
GTCGTGATGT GGGGATACGT CGTCTTCACT ACGCTGGCGA CGATTATGGT GTTTATTCCG
CAAATTGCGG CGTTCCTGCG GTTCAGTTTT GCCGTCGATA TTCTGGCATT GCTGCTGCTG
ACGCTCTTCA ACCCCGATCC GCGCGAGATT CTCTATCCGC TCTTCCTGCT ACCGCTCGTC
AATGCGGCAT TCCGCCTGAA TTCGTCGGCA AGTTTGCTAA CCGGGCTGAT CACCGCATTC
GCGTACATCG GAGCCTATCT GATCGCGCGC ATTGGTCCGG GCAACGGCTC GATTCCGTCC
GATCCACTCG GGTATGTGGG GCTGACGTTG CGCGCGCTGG CGCTCGTGTT CATTCCCTGG
ATTACGGGGG GGTTGGCGGA ACGCTGGAGC GCCAGCAACC GGTTGAGTGT GGCGCTGGCG
GAAGAGAAGG CGGCTGAGGC GTTGCGTGAG GCGAATGCCT ACCGCGATCA GACGCGCGCG
CTGTATGAGG CAGCGTATAC GCTGGCGACG ACCAACGATT ATCAGAACGT GCTCGAAGTG
ACGCTTGACG AAAGCCAGAA ACTGGCGCCG TATACGTGTG GGATGGTGCT CCTCTCCACC
GGGCAGACCA ATCAACTGTT CGTCGCTTCG TCGCGCGGGC TGGACCCCAA CGACCAGGAA
CGCATTGTAA CGATCGGACC CGGCGCCATC GGGCGAACGA TACGCGGTTC GGACCCGGTG
GTGCTGGATA ATTTCCGGGA TGACCCCGAA TTGATGGCGT TCACATCGCT GCACCGGCAT
CGCACCGCCT GCCTGTTGCC GCTCCGCTTC GGACTGAACA CATACGGGGT CATGGTTTTC
GCCAGCGACC ACGAACGCGC CTTTGGACAC GAAGAGGTCG AGAAACTCGC GGCGCTGGCA
TCCTATGCGC TGGTGGCGCT ACAGAACGCC CAACTGATCT ACGATCTGCA ACAGGAACGC
ACGAAACTCC TGTCGAAAGA GGAAGAGGTG CGCCATCAAC TGGCGCGCGA CCTGCACGAC
GGACCGGCAC AGGCGCTGGC GGCAATCACG ATGAACGTCG AGTTTATCAA GCGCCTGCTC
GAACGCGACC CGGCGCGCGT CCTGCCCGAA CTCGACAAAC TCGGACAACT CGCCAAGCGC
ACCACTCACG ATGTGCGCAC GATGCTGTTC GAATTGCGTC CGCTGGCGCT CGAAACGCAG
GGTCTCGATG TGACGCTACA ACAGTACTTT GAGCGCTTCC GCGACAATCC GACCAAAATC
ATTCTCGAAG CCGATACCAT TGATGCGCAA CTCGACACGA AAGTCGAGGG TACGCTGTTC
AACATCGTTC AGGAAGCAGT TAATAATGCG CTGAAACACG CAAAAGCGCA GCACATCTGG
GTGCGCCTGC GCCAGACGCC AACGACGCTC GAAACGATCA TTGAAGACGA CGGGCGCGGC
TTTGATGTCG AGAAGGTGCG TGCCAGCTAC GATCAGCGCG GTTCGTTCGG CTTGCTGAAC
ATCGAGGAAC GCGCCTCACT GATGGGTGGC GTCGCCGAAG TGACCTCCAC GCCAGGCAAA
GGCACGACGT GGAAGGTGAT TGTGCCACTC CAATGA
 
Protein sequence
MDAAERIPLP SKISLQRQMI GGDRFYNLSR WGVTILLIGI VQLIAGESLW SPTAAHSPMI 
VVMWGYVVFT TLATIMVFIP QIAAFLRFSF AVDILALLLL TLFNPDPREI LYPLFLLPLV
NAAFRLNSSA SLLTGLITAF AYIGAYLIAR IGPGNGSIPS DPLGYVGLTL RALALVFIPW
ITGGLAERWS ASNRLSVALA EEKAAEALRE ANAYRDQTRA LYEAAYTLAT TNDYQNVLEV
TLDESQKLAP YTCGMVLLST GQTNQLFVAS SRGLDPNDQE RIVTIGPGAI GRTIRGSDPV
VLDNFRDDPE LMAFTSLHRH RTACLLPLRF GLNTYGVMVF ASDHERAFGH EEVEKLAALA
SYALVALQNA QLIYDLQQER TKLLSKEEEV RHQLARDLHD GPAQALAAIT MNVEFIKRLL
ERDPARVLPE LDKLGQLAKR TTHDVRTMLF ELRPLALETQ GLDVTLQQYF ERFRDNPTKI
ILEADTIDAQ LDTKVEGTLF NIVQEAVNNA LKHAKAQHIW VRLRQTPTTL ETIIEDDGRG
FDVEKVRASY DQRGSFGLLN IEERASLMGG VAEVTSTPGK GTTWKVIVPL Q