Gene Rcas_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4206 
Symbol 
ID5541717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5439744 
End bp5441969 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content61% 
IMG OID640896313 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_001434251 
Protein GI156744122 
COG category[K] Transcription
[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.351121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.245941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGG CGTCGTTCTA CAGCCAATTT CGTGATGAAA CGGCGGAGAA CATCCGCATT 
GTCACCGAAG GTCTTGTGGC GCTCGAAGGC AACGGACTGG AGGGCGAGGC GCGCCGTGAG
CGGATCGACG CCATCTTTCG CGCCATGCAC ACGATCAAGG GGTCGGCGCG AATGCTCGGT
CTGGATCAGA TCGGCAAAGT TGCGCACACC TGTGAGCACA TCCTGGCAGC AGTGCGCGAC
GGTCGGCGCG TTCTTGATCG CTTTCTCACC GATGAATTAC TGAAGGGCAG CGATGCGATT
CTGGAGTTGC TGGCAGCCGC TATCGATGGC AAGCCGTCGT CGATTGACGT TGAAGCGCTG
ACGAGCCATC TTGGACGAGG ATCGCCACAG CCTTCGCCTG AAACGCCGGT GTCACCCCCG
CTCGAACCGG CGCAATCGCC CGGTGATGAA AGCGCGCCAC AACCCACGTC GAAGGGTGGG
CGCGAACGTA TGCGACAGAC GGTGCGCGTG CGGGTGGATC GCCTGGATCG GTTACTGAAT
CTGGCAGGCG AATTGTCAAT CGGTCGGCAG ATTGAGGAAT CGCACCTTCA GGCGTTAGAG
GAGTTGAAGG GGTTGGTTGA ACGCCAACAA CGGTCGTTGC TCACCCTCGA AGCCGAATTG
CGCCGTGTGC GCCTTGCCCC TGCCCAGCGC GACCTGTTCG ACCGCGAAAT GAACGGTGCG
CTGAACGCAG GCGAGCGCGC CGGTTCCATG CTCAGAGCTC AACTCGAACG CATCGGACAA
CATACGATGC ACAAAGCGCA GTTGATCGAC GACCTCGAAC AAGAGGTGAT GGCGATTCGC
CTGACGCCGG TCGCAACGCT GTACGCCAAT CTGCCGCGCG CCGTGCGCGA ACTGGCGCGC
GATCTTGGCA AGGAAGCATC GCTGGCGCTG ATCGGCGAAA CGACCGAACT GGATCGGAAG
ATCATCGAGG TGCTCACCGA TCCGATGGTG CACTTAATTC GGAATGCGCT CGATCACGGC
ATCGAGCCGG CAGAGGAACG TGAGCGCCAG GGTAAGCCGC GCCAGGGATT GATCGAGATC
GCAGCACAGG CGCATGGCGG ACGAGTGTTG ATCAGCGTGC GCGACGATGG GCGCGGCATG
GATCCACAAC AGTTGCGCGA GGCGGCAGTG CGCAAAGGAT TGATCGGCGC TGATGCCGCA
GCAGCGCTCT CAGATCAGGA GGCGCTGGAA TTGATCTTCA TGCCGGGCTT TACGACGGCA
AAAATCATTA CCGACGTATC AGGACGAGGC GTCGGGATGG ATGTGGTTCG CACCAATCTT
GCCGAGATTG GCGGGGAAGT GCAGATTGAG TCGCAGCCAG GCGTCGGTAC GACGGTCCTC
CTCTCGTTGC CATTGACCCT GGTTACCACG CGGGTGCTGT TGGTTGAGGC TGGCAGTCAA
CTGTTCGGCA TTCCGGCGTC GGGGTGCCAG GGAACGGTAT GGGTGCGCCG CACTCAGATC
CGCACTATCG AGGGTCGGGC GGTTTTTCAG CACAATCAGG TGTTAACGCC GATCCTGCGG
TTGGATGAAT TGTTGGGTAT TGCGAATGGG AATCCGTTTG CCAATGCAGT GCGCATGCCG
GCGTTGCTGA TCGGTGGTGT GCGCCGCCCT ATGGCGTTGC TGATCGATCG CCTGATTGAT
GAACGCGAGG CGGTGATCAA GCCGCTTGGA CCCCTCCTGG AAAAACAGCG GCGGTATAGC
GGCGTATTGC AACTCGGCGA TGGAAGGCTG GCGCTGCTGC TCAATCCCAC GATGCTGGCG
CAATTGGGAC GCGGCACGGC GCTTGTTGCG CCAACGCCGG AACAGAGTCA GCAGCGCCGC
GCGCGATTAC TCGTGGTGGA CGACTCGTTC GCTACTCGCG AGTTGATCCG CAGCATCCTG
AGCGCCGCCG GGTACGAGGT GGCGACTGCC GTCGATGGGC TTGATGCGCT TGACAGAATC
CGCGCCGAAA CCTACGATCT CGTCGTCAGC GACGTCGAAA TGCCGCGCGT GGACGGTTTC
ACTCTGACCA GCCGCATCCG CAGCGAGTTG GGCAAAACCG ATCTGCCGGT CATTATCGTC
ACCAGTCTGG CATCGGAGGC GCATCGCCGG CGTGGGCTGG AAGTCGGTGC GCAGGCGTAT
ATTGTCAAAA GTCAGTTCAA TCAGAACAAC CTGCTGGAGA CGATCCGGCA GTTGATCGGC
ATGTGA
 
Protein sequence
MDLASFYSQF RDETAENIRI VTEGLVALEG NGLEGEARRE RIDAIFRAMH TIKGSARMLG 
LDQIGKVAHT CEHILAAVRD GRRVLDRFLT DELLKGSDAI LELLAAAIDG KPSSIDVEAL
TSHLGRGSPQ PSPETPVSPP LEPAQSPGDE SAPQPTSKGG RERMRQTVRV RVDRLDRLLN
LAGELSIGRQ IEESHLQALE ELKGLVERQQ RSLLTLEAEL RRVRLAPAQR DLFDREMNGA
LNAGERAGSM LRAQLERIGQ HTMHKAQLID DLEQEVMAIR LTPVATLYAN LPRAVRELAR
DLGKEASLAL IGETTELDRK IIEVLTDPMV HLIRNALDHG IEPAEERERQ GKPRQGLIEI
AAQAHGGRVL ISVRDDGRGM DPQQLREAAV RKGLIGADAA AALSDQEALE LIFMPGFTTA
KIITDVSGRG VGMDVVRTNL AEIGGEVQIE SQPGVGTTVL LSLPLTLVTT RVLLVEAGSQ
LFGIPASGCQ GTVWVRRTQI RTIEGRAVFQ HNQVLTPILR LDELLGIANG NPFANAVRMP
ALLIGGVRRP MALLIDRLID EREAVIKPLG PLLEKQRRYS GVLQLGDGRL ALLLNPTMLA
QLGRGTALVA PTPEQSQQRR ARLLVVDDSF ATRELIRSIL SAAGYEVATA VDGLDALDRI
RAETYDLVVS DVEMPRVDGF TLTSRIRSEL GKTDLPVIIV TSLASEAHRR RGLEVGAQAY
IVKSQFNQNN LLETIRQLIG M