Gene Rcas_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3064 
Symbol 
ID5540560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3966937 
End bp3968571 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content59% 
IMG OID640895183 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001433136 
Protein GI156743007 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.980645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCAC TGCTCGTCGA TCAGCGTCAG TTGCAAACCC TGAGTCAACT TATTTCGGCT 
GGGGGCCACC AGGATGTGCA TGAATTGCTT CAAGCGTCGC TGGAGCGATT GGTCGCCTTC
TGGCCCGCGC AGGGAGGTGC ATTGCTCTAC CATGCGCCCC ATGGCGAGGT CATTCGCCTG
CATCATGGGG TGATCGACAG CGAGGCTGGC GCACTGATCG CCGAGGCGCG CGAAACATTC
GCCCGCCGCG AAGAAGGGCG TGAACCAACC ATCGGCTACT ACACGCTCGA CGACGAACGC
AAATTGCTCG AACTTCCACT ATCGACCAGT TCACAGACCG TGGGTTTGCT CCATCTGGTC
GTGCTCGATG CCGAAACGGA GCGCCCGTCT CCCGCGCCCG ATGAGGACCT TGCCATGCTG
CTGGTGCGCG CCATTGGCGG AGAAGCCGAC AAACTGGCAC GCCTGCGACG CGCCGAGCAG
GACCTGCGCG AACTTCATCT GCTCTATCAG GTGGGGCAGG CGCTGGCGAG CAATCTCGAT
CTCTCCAGTC TGCTGAACGA TATTCGCAAC CAGGTTCCTC AGGCGATGGG CGCTGAGCGT
TGCTCGATCA TGCTGCTCGA TGAGCACACC CGTGAACTGA TCCTGGAACT GCCCGATCCG
CATACCGGTG AGCAGCGCGA GTTTCGCATC CCGCTTGATC GCGGCATTGC CGGTTGGGTG
GCCACCAATG GGATTGGCCA GATTGTCAAT GATGTCGAAC AAGACCCACG CTGGTTCGAT
GGAGTTGCGC GTGATGTCGA TTTTGAGACG CGCCAGATCC TCTGTGCGCC GATGCGTATC
GGCGATCGTG TCGTCGGCGT AATGCAGGTG CTCAACAAAC GTGATGGGAC GCCTTTCGAC
GACCAGGACC TGCGCCTTCT GACGACGCTG GCGACGCAGG CGGCGATTGC CGTCGAGAAT
GCGCGCCTGG TGCGCAGTCT GAAAGAGGAA CGTGACCGGT TGCTCGCCAA GGAAGCGGAG
GTACGCGCGG CGATTGCGCG CGACCTGCAC GATGGTCCGA CCCAGAGTAT TGCAGCGATT
GCGATGAATA TCGAGTTCAT CAAGAAATTG CTGCGTGCCA TGCCGGAACG AGTCGAAGGT
GAATTGGAAG TTCTCGCCGA ACTGGTGCAA AAAACCGCCT ACGATATTCG CACGCTCCTC
TTTGAACTGC GACCGCTCGG GCTGGAAACG CAAGGATTGC TATCGACGTT GCAGCAATAT
GTGGCCCGCT TCCGTGATCC TGCCGGTGTC ATGAAACTGC GCCTCGAGGC GCCTTCCAGC
ATTCCACGAT TGCCGGTCGA AGTGGAGGGC GCCATTTTCA TTATTATTCA GGAAGCAGTC
AATAATGCAC GCAAACATGC GCGCACCGAC GAAGTGGTGA TTTACCTGTA CCTGGAAGAA
GGGCAACTGG TTGCCAGTGT GCGGGATCGC GGACGCGGAT TCAACCTTGC TGCCGTTGAA
TCGAGTTACA ATACTCGCGG TTCGTTGGGT TTGCTGAATA TGCGCGAGCG CGCGCGCCTG
ATCGGCGGTG AGTGCCGCAT CCGCTCGGCG GAAGGCGAAG GTACGACGGT CGAATTGCGG
GTGCCGCTTG CCTGA
 
Protein sequence
MAPLLVDQRQ LQTLSQLISA GGHQDVHELL QASLERLVAF WPAQGGALLY HAPHGEVIRL 
HHGVIDSEAG ALIAEARETF ARREEGREPT IGYYTLDDER KLLELPLSTS SQTVGLLHLV
VLDAETERPS PAPDEDLAML LVRAIGGEAD KLARLRRAEQ DLRELHLLYQ VGQALASNLD
LSSLLNDIRN QVPQAMGAER CSIMLLDEHT RELILELPDP HTGEQREFRI PLDRGIAGWV
ATNGIGQIVN DVEQDPRWFD GVARDVDFET RQILCAPMRI GDRVVGVMQV LNKRDGTPFD
DQDLRLLTTL ATQAAIAVEN ARLVRSLKEE RDRLLAKEAE VRAAIARDLH DGPTQSIAAI
AMNIEFIKKL LRAMPERVEG ELEVLAELVQ KTAYDIRTLL FELRPLGLET QGLLSTLQQY
VARFRDPAGV MKLRLEAPSS IPRLPVEVEG AIFIIIQEAV NNARKHARTD EVVIYLYLEE
GQLVASVRDR GRGFNLAAVE SSYNTRGSLG LLNMRERARL IGGECRIRSA EGEGTTVELR
VPLA