Gene EcSMS35_2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2367 
SymbolrcsC 
ID6146456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2400624 
End bp2403473 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content51% 
IMG OID641617240 
Producthybrid sensory kinase in two-component regulatory system with RcsB and YojN 
Protein accessionYP_001744412 
Protein GI170679899 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101959 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATACC TTGCTTCTTT TCGTACAACC CTGAAAGCCT CGCGCTACAT GTTCAGAGCA 
TTGGCGTTAG TGCTCTGGCT GTTGATTGCT TTTTCATCCG TTTTTTACAT CGTTAATGCG
TTACATCAGC GAGAATCGGA AATTCGTCAG GAATTTAATC TGAGTTCCGA TCAGGCTCAG
CGCTTTATTC AACGCACCTC TGATGTGATG AAAGAGCTGA AGTACATCGC CGAAAATCGC
TTATCGGCAG AAAACGGTGT GCTTTCCCCG CGTGGACGAG AAACGCAGAC GGATGTGCCT
GCGTTTGAAC CGCTGTTTGC TGACTCTGAT TGTTCCGCAA TGAGTAACAC CTGGCGAGGT
TCTCTGGAGT CATTGGCGTG GTTTATGCGC TACTGGCGCG ATAATTTTTC TGCGGCTTAC
GATCTCAACC GGGTATTTTT AATCGGCAGC GATAACCTCT GCATGGCCAA TTTCGGTCTG
CGTGATATGC CAGTGGAACG CGATACCGCG CTAAAAGCTT TGCATGAACG CATTAATAAA
TATCGAAATG CACCACAAGA TGATAGTGGC AGTAACCTCT ACTGGATCAG CGAAGGTCCG
CGCCCTGGCG TCGGGTATTT TTACGCGTTG ACGCCAGTTT ATCTGGCGAA CCGGTTGCAG
GCGCTTTTGG GTGTCGAGCA GACCATCCGG ATGGAGAACT TTTTCTTGCC TGGTACGTTG
CCGATGGGGG TTACCATTCT TGATGAAAAT GGTCATACCC TGATTTCGCT TACCGGACCA
GAAAGCAAAA TTAAGGGCGA TCCTCGCTGG ATGCAGGAAC GCTCCTGGTT TGGCTATACG
GAAGGGTTCC GGGAGCTGGT GCTGAAGAAA AATCTGCCAC CCTCATCGCT AAGCATCGTG
TATTCGGTGC CGGTTGATAA GGTGCTGGAA CGCATTCGCA TGTTGATCCT TAACGCAATT
TTGCTGAATG TGCTTGCCGG AGCTGCATTG TTTACTCTCG CGCGGATGTA CGAGCGACGT
ATTTTCATTC CGGCGGAAAG CGACGCCCTG CGACTGGAAG AACATGAGCA GTTTAATCGC
AAGATTGTCG CCTCCGCGCC AGTGGGTATC TGCATTTTGC GTACCGCTGA TGGCGTCAAT
ATTTTAAGTA ACGAACTGGC GCATACCTAT CTCAATATGC TTACGCATGA GGATCGCCAA
CGGCTGACAC AAATTATCTG TGGGCAGCAG GTCAATTTTG TTGATGTCCT GACCAGCAAC
AATACCAATC TGCAAATCAG CTTCGTCCAT TCGCGCTATC GTAATGAAAA CGTGGCCATT
TGTGTGCTGG TGGATGTTTC TTCGCGCGTG AAGATGGAAG AGTCGTTGCA GGAGATGGCA
CAAGCAGCGG AACAGGCGAG TCAGTCAAAA TCGATGTTCC TTGCCACCGT CAGTCATGAG
CTGCGAACGC CGCTGTATGG CATTATCGGT AACCTGGATC TATTGCAAAC CAAAGAGTTG
CCGAAAGGCG TCGATCGTCT GGTGACGGCA ATGAACAACT CTTCCAGCCT GTTGTTGAAA
ATTATCAGCG ATATTCTCGA TTTCTCGAAG ATTGAATCAG AACAGTTGAA GATCGAACCG
CGTGAGTTTT CACCGCGTGA AGTGATGAAC CACATCACCG CCAACTATTT ACCGCTGGTG
GTACGCAAGC AGTTAGGCTT GTACTGCTTT ATTGAACCGG ATGTGCCAGT GGCCTTAAAT
GGCGACCCGA TGCGTTTACA GCAGGTCATC TCCAACCTGT TGAGTAACGC CATAAAATTC
ACCGATACCG GCTGTATAGT TTTGCAAGTT CGCGCTGATG GCGATTATCT CTCTATCCGT
GTTCGCGATA CCGGCGTGGG GATTCCGGCG AAAGAAGTCG TGCGCTTGTT TGATCCCTTC
TTCCAGGTCG GAACGGGCGT ACAGCGAAAC TTCCAGGGGA CCGGTCTGGG TCTGGCGATT
TGTGAAAAAC TGATCAGCAT GATGGACGGC GACATCTCGG TAGATTCAGA ACCGGGAATG
GGCAGCCAGT TTACCGTGCG TATTCCGTTG TACGGCGCTC AGTACCCGCA GAAAAAAGGC
GTGGAAGGGT TGAGTGGTAA ACGCTGCTGG CTGGCGGTCC GCAATGCGTC GCTCTGTCAG
TTCCTGGAAA CCAGTTTGCA GCGCAGCGGC ATCGTCGTTA CAACATACGA AGGGCAGGAA
CCGACTCCCG AAGATGTGTT AATTACTGAC GAGGTAGTGA GTAAAAAATG GCAGGGTAGA
GCGGTAGTGA CCTTCTGTCG TCGCCATATT GGTATTCCGC TGGAGAAAGC GCCAGGGGAG
TGGGTACACA GTGTGGCGGC CCCGCATGAG CTACCGGCAT TGTTGGCGCG TATTTATTTG
ATCGAGATGG AGAGCGATGA TCCTGCTAAC GCTCTGCCGT CGACGGACAA AGCGGTCAGC
GATAACGACG ATATGATGAT TCTGGTCGTG GATGATCATC CGATTAACCG GCGTCTTCTG
GCAGATCAGT TGGGATCGTT GGGCTATCAA TGTAAAACCG CGAATGATGG CGTCGATGCG
CTTAATGTAC TTAGCAAGAA TCATATTGAT ATCGTGCTTA GCGACGTCAA CATGCCAAAT
ATGGATGGTT ACCGCTTGAC GCAACGCATT CGTCAGTTGG GACTGACGTT GCCGGTAATC
GGAGTAACTG CTAATGCGTT GGCTGAAGAG AAGCAGCGGT GTCTGGAGTC CGGTATGGAC
AGCTGCCTGT CGAAGCCGGT AACGCTGGAT GTGATAAAAC AGACGCTGAC GGTATATGCC
GAGAGGGTCA GGAAATCGCG GGAATCGTAG
 
Protein sequence
MKYLASFRTT LKASRYMFRA LALVLWLLIA FSSVFYIVNA LHQRESEIRQ EFNLSSDQAQ 
RFIQRTSDVM KELKYIAENR LSAENGVLSP RGRETQTDVP AFEPLFADSD CSAMSNTWRG
SLESLAWFMR YWRDNFSAAY DLNRVFLIGS DNLCMANFGL RDMPVERDTA LKALHERINK
YRNAPQDDSG SNLYWISEGP RPGVGYFYAL TPVYLANRLQ ALLGVEQTIR MENFFLPGTL
PMGVTILDEN GHTLISLTGP ESKIKGDPRW MQERSWFGYT EGFRELVLKK NLPPSSLSIV
YSVPVDKVLE RIRMLILNAI LLNVLAGAAL FTLARMYERR IFIPAESDAL RLEEHEQFNR
KIVASAPVGI CILRTADGVN ILSNELAHTY LNMLTHEDRQ RLTQIICGQQ VNFVDVLTSN
NTNLQISFVH SRYRNENVAI CVLVDVSSRV KMEESLQEMA QAAEQASQSK SMFLATVSHE
LRTPLYGIIG NLDLLQTKEL PKGVDRLVTA MNNSSSLLLK IISDILDFSK IESEQLKIEP
REFSPREVMN HITANYLPLV VRKQLGLYCF IEPDVPVALN GDPMRLQQVI SNLLSNAIKF
TDTGCIVLQV RADGDYLSIR VRDTGVGIPA KEVVRLFDPF FQVGTGVQRN FQGTGLGLAI
CEKLISMMDG DISVDSEPGM GSQFTVRIPL YGAQYPQKKG VEGLSGKRCW LAVRNASLCQ
FLETSLQRSG IVVTTYEGQE PTPEDVLITD EVVSKKWQGR AVVTFCRRHI GIPLEKAPGE
WVHSVAAPHE LPALLARIYL IEMESDDPAN ALPSTDKAVS DNDDMMILVV DDHPINRRLL
ADQLGSLGYQ CKTANDGVDA LNVLSKNHID IVLSDVNMPN MDGYRLTQRI RQLGLTLPVI
GVTANALAEE KQRCLESGMD SCLSKPVTLD VIKQTLTVYA ERVRKSRES