Gene ECH74115_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3358 
SymbolrcsC 
ID6970850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3091376 
End bp3094225 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content51% 
IMG OID643387169 
Producthybrid sensory kinase in two-component regulatory system with RcsB and YojN 
Protein accessionYP_002271632 
Protein GI209396281 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00504503 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATACC TTGCTTCTTT TCGTACAACC CTGAAAGCCT CGCGCTACAT GTTCAGAGCA 
TTGGCGTTAG TGCTCTGGCT GTTGATTGCT TTTTCATCCG TTTTTTACAT CGTTAATGCG
TTACATCAGC GAGAATCGGA AATTCGTCAG GAATTTAATC TGAGTTCCGA TCAGGCTCAG
CGCTTTATTC AACGCACCTC TGATGTGATG AAAGAGCTGA AGTACATCGC CGAAAATCGC
TTATCGGCAG AAAACGGTGT GCTTTCCCCG CGTGGACGAG AAACGCAGGC GGATGTGCCT
GCGTTTGAAC CGCTGTTTGC CGACTCCGAT TGTTCAGCAA TGAGTAACAC CTGGCGAGGT
TCTCTGGAGT CATTGGCGTG GTTTATGCGC TACTGGCGCG ATAATTTTTC TGCGGCTTAC
GATCTCAACC GGGTATTTTT AATCGGCAGC GATAACCTCT GCATGGCCAA TTTCGGTCTG
CGTGATATGC CAGTGGAACG CGATACCGCG TTGAAAGCTT TGCATGAACG CATCAATAAA
TATCGAAATG CACCACAAGA TGATAGCGGC AGTAACCTCT ACTGGATCAG CGAAGGTCCG
CGCCCTGGCG TCGGGTATTT TTACGCGTTG ACGCCAGTTT ATCTGGCGAA CCGGTTGCAG
GCGCTTTTGG GTGTCGAGCA GACCATCCGG ATGGAGAACT TTTTCTTACC GGGTACGTTG
CCGATGGGGG TTACCATTCT TGATGAAAAT GGTCATACCC TGATTTCGCT TACCGGGCCA
GAAAGCAAAA TTAAGGCCGA TCCTCGCTGG ATGCAGGAAC GCTCCTGGTT TGGCTATACG
GAAGGGTTCC GGGAGCTGGT GCTGAAGAAA AATCTGCCAC CCTCATCGCT CAGCATCGTG
TATTCGGTGC CGGTTGATAA GGTGCTGGAA CGCATTCGCA TGTTGATCCT TAACGCAATT
TTGCTGAATG TGCTTGCCGG AGCTGCATTG TTTACTCTCG CGCGGATGTA CGAGCGACGT
ATTTTCATTC CGGCGGAAAG CGACGCCCTG CGACTGGAAG AACATGAGCA GTTCAATCGC
AAGATTGTCG CCTCCGCGCC AGTGGGTATC TGCATTTTGC GTACCGCTGA TGGCGTCAAT
ATTTTAAGTA ACGAACTGGC GCATACCTAT CTCAATATGC TTACGCATGA GGACCGCCAA
CGACTGACAC AAATTATCTG TGGGCAGCAG GTCAATTTTG TTGATGTCCT GACCAGCAAC
AATACCAATC TGCAAATCAG CTTCGTCCAT TCGCGCTATC GTAATGAAAA CGTGGCCATT
TGTGTGCTGG TGGATGTTTC TTCGCGCGTG AAGATGGAAG AGTCGTTGCA GGAGATGGCA
CAAGCAGCGG AACAGGCGAG CCAGTCAAAA TCGATGTTCC TTGCCACCGT CAGTCATGAG
CTGCGAACGC CGCTGTATGG CATTATCGGT AACCTGGATC TGTTGCAAAC CAAAGAGTTA
CCGAAAGGCG TCGATCGTCT GGTGACGGCA ATGAACAACT CTTCCAGCCT GTTGTTGAAA
ATTATCAGCG ATATTCTCGA TTTCTCGAAG ATTGAATCGG AACAGTTGAT GATCGAACCG
CGTGAGTTTT CACCGCGTGA AGTGATGAAC CACATCACCG CCAACTATTT ACCGCTGGTG
GTACGCAAGC AGTTAGGCTT GTACTGCTTT ATTGAACCGG ATGTGCCAGT GGCCTTAAAT
GGCGACCCGA TGCGTTTACA GCAGGTCATC TCCAACCTGT TGAGTAACGC CATAAAATTC
ACCGATACCG GCTGTATAGT TTTGCATGTC CGCGCTGATG GCGATTATCT CTCTATCCGT
GTTCGCGATA CCGGCGTGGG GATTCCGGCG AAAGAAGTGG TGCGCTTGTT TGATCCCTTC
TTCCAGGTCG GAACGGGCGT ACAGCGAAAC TTCCAGGGGA CCGGTCTGGG TCTGGCGATT
TGTGAAAAAC TGATCAGCAT GATGGACGGC GATATCTCGG TAGATTCAGA ACCGGGAATG
GGCAGCCAGT TTACCGTGCG TATTCCGTTG TACGGCGCTC AGTACCCGCA GAAAAAAGGC
GTGGAAGGGT TGAGTGGTAA ACGCTGCTGG CTGGCGGTCC GCAATGCGTC GCTCTGTCAG
TTCCTGGAAA CCAGTTTGCA GCGCAGCGGC ATCGTCGTAA CAACATACGA AGGGCAGGAA
CCGACTCCCG AAGATGTGTT AATCACTGAC GAGGTAGTGA GTAAAAAATG GCAGGGCAGA
GCGGTAGTGA CCTTCTGTCG TCGACATATT GGTATTCCGC TGGAGAAAGC GCCAGGGGAG
TGGGTACACA GTGTGGCTGC TCCGCATGAG CTACCGGCAT TGTTGGCGCG TATTTATTTG
ATCGAGATGG AGAGCGACGA TCCTGCTAAC GCTCTGCCGT CGACGGACAA AGCGGTCAGC
GATAATGACG ATATGATGAT TCTGGTCGTG GATGATCATC CGATTAACCG GCGTTTGCTG
GCAGATCAGT TGGGATCGTT GGGCTATCAA TGTAAAACCG CGAATGATGG CGTCGATGCG
CTTAATGTAC TTAGCAAGAA TCATATTGAT ATCGTGCTTA GCGACGTCAA CATGCCAAAT
ATGGATGGTT ACCGCTTGAC ACAACGCATT CGTCAGTTGG GACTGACGTT GCCGGTAATC
GGAGTAACGG CTAATGCGTT GGCTGAAGAG AAGCAGCGGT GTCTGGAGTC CGGTATGGAC
AGCTGCCTGT CGAAGCCAGT AACGCTGGAT GTTATAAAAC AGACGCTGAC GGTATATGCC
GAGAGGGTCA GGAAATCGCG GGAATCGTAG
 
Protein sequence
MKYLASFRTT LKASRYMFRA LALVLWLLIA FSSVFYIVNA LHQRESEIRQ EFNLSSDQAQ 
RFIQRTSDVM KELKYIAENR LSAENGVLSP RGRETQADVP AFEPLFADSD CSAMSNTWRG
SLESLAWFMR YWRDNFSAAY DLNRVFLIGS DNLCMANFGL RDMPVERDTA LKALHERINK
YRNAPQDDSG SNLYWISEGP RPGVGYFYAL TPVYLANRLQ ALLGVEQTIR MENFFLPGTL
PMGVTILDEN GHTLISLTGP ESKIKADPRW MQERSWFGYT EGFRELVLKK NLPPSSLSIV
YSVPVDKVLE RIRMLILNAI LLNVLAGAAL FTLARMYERR IFIPAESDAL RLEEHEQFNR
KIVASAPVGI CILRTADGVN ILSNELAHTY LNMLTHEDRQ RLTQIICGQQ VNFVDVLTSN
NTNLQISFVH SRYRNENVAI CVLVDVSSRV KMEESLQEMA QAAEQASQSK SMFLATVSHE
LRTPLYGIIG NLDLLQTKEL PKGVDRLVTA MNNSSSLLLK IISDILDFSK IESEQLMIEP
REFSPREVMN HITANYLPLV VRKQLGLYCF IEPDVPVALN GDPMRLQQVI SNLLSNAIKF
TDTGCIVLHV RADGDYLSIR VRDTGVGIPA KEVVRLFDPF FQVGTGVQRN FQGTGLGLAI
CEKLISMMDG DISVDSEPGM GSQFTVRIPL YGAQYPQKKG VEGLSGKRCW LAVRNASLCQ
FLETSLQRSG IVVTTYEGQE PTPEDVLITD EVVSKKWQGR AVVTFCRRHI GIPLEKAPGE
WVHSVAAPHE LPALLARIYL IEMESDDPAN ALPSTDKAVS DNDDMMILVV DDHPINRRLL
ADQLGSLGYQ CKTANDGVDA LNVLSKNHID IVLSDVNMPN MDGYRLTQRI RQLGLTLPVI
GVTANALAEE KQRCLESGMD SCLSKPVTLD VIKQTLTVYA ERVRKSRES