Gene EcSMS35_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2365 
SymbolyojN 
ID6145116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2397199 
End bp2399871 
Gene Length2673 bp 
Protein Length890 aa 
Translation table11 
GC content50% 
IMG OID641617238 
Productphosphotransfer intermediate protein in two-component regulatory system with RcsBC 
Protein accessionYP_001744410 
Protein GI170683966 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00195405 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAGA AAGAGACAAC GGCCACGACC CGCTTTTCAC TCCTACCGGG GAGCATTACC 
CGCTTCTTTT TACTGTTGAT CATTGTGTTA CTGGTGACGA TGGGAGTAAT GGTACAAAGC
GCTGTTAACG CCTGGCTGAA AGATAAAAGT TACCAGATTG TCGACATTAC CCACGCTATC
CAAAAGCGCG TCGATACCTG GCGTTACGTG ACCTGGCAGA TCTACGACAA CATTGCCGCG
ACGACCTCCC CCTCCTCCGG CGAAGGTTTA CAAGAGACGC GCCTGAAACA GGATGTCTAC
TATCTGGAGA AACCACGCCG CAAAACGGAA GCGTTAATCT TTGGCTCTCA CGACAACTCA
ACGCTTGAGA TGACTCAACG GATGTCCACC TATCTGGACA CATTGTGGGG CGCAGAAAAT
GTACCGTGGT CGATGTATTA CCTGAATGGT CAGGATAACA GTCTGGTGCT GATCTCAACC
CTGCCCCTCA AAGATCTCAC CTCCGGATTT AAAGAATCGA CCGTCAGCGA CATTGTTGAT
TCACGTCGTG CAGAGATGTT GCAACAGGCC AACGCCCTCG ATGAACGCGA AAGCTTTTCT
AACATGCGCC GCCTGGCCTG GCAGAACGGT CATTACTTTA CCTTACGTAC CACCTTTAAC
CAGCCGGGAC ATCTGGCAAC GGTTGTGGCT TTTGATCTGC CGATTAATGA TTTGATCCCA
CCGGGCATGC CGCTGGACAG TTTCCGCCTT GAGCCAGACG CGACGGCAAC GGGAGACAAT
GATAATGAGA AAGAAGGGAC GGATAGCGTC AGTATCCACT TTAACAGTAC GAAGATTGAA
ATCTCCTCGG CACTCAACTC TACCGATATG CGACTGGTCT GGCAGGTTCC TTATGGCACC
TTATTGCTGG ATACGTTGCA AAACATTCTG CTGCCACTGC TGCTGAACAT CGGTTTGCTG
GCGCTGGCGT TATTTGGCTA TACCACATTC CGCCATTTCT CCAGCCGCAG TACAGAAAGC
GTCCCCAGCA CGGCGGTCAA TAACGAATTG CGCATTTTAC GGGCAATTAA TGAAGAGATA
GTCTCACTGC TGCCGCTCGG CCTGCTGGTT CACGATCAGG AATCGAACCG CACAGTCATA
AGTAACAAAA TTGCCGATCA TTTGCTGCCG CATTTGAATC TGCAAAACAT CACCACCATG
GCGGAACAGC ATCAGGGGAT TATTCAGGCG ACGATCAATA ACGAGCTGTA TGAGATCCGC
ATGTTCCGCA GCCAGGTTGC GCCGCGCACA CAAATTTTCA TTATTCGCGA TCAGGATCGC
GAAGTGCTGG TAAACAAGAA ACTCAAGCAG GCGCAGCGTC TGTATGAGAA AAACCAGCAG
GGGCGGATGA CCTTTATGAA AAACATTGGC GATGCGCTGA AAGAACCCGC ACAGTCCCTG
GCGGAGAGCG CGGCTAAACT CAACGCCCCG GAAGGCAAAC AACTGACGAA TCAGGCGGAT
GTGCTGGTGC GGCTGGTCGA TGAAATACAG TTAGCGAACA TGCTTGCGGA CGATAGCTGG
AAAAGTGAGA CGGTGCTGTT CTCCGTGCAG GATTTAATTG ATGAAGTTGT GCCTTCAGTG
TTGCCTGCCA TCAAGCGTAA AGGTCTGCAA CTGCTGATTA ACAATCATCT GAAAGCACAC
GATATGCGCC GCGGCGATCG CGATGCCTTA CGACGTATTT TGCTGCTACT GATGCAATAT
GCCGTGACCT CAACGCAATT GGGAAAAATC ACCCTTGAGG TTGATCAGGA TGAGTCCTCC
GAAGACCGCC TGACGTTCCG CATTCTGGAC ACGGGAGAAG GCGTAAGTAT TCATGAAATG
GATAATTTGC ACTTCCCGTT TATCAACCAG ACCCAAAACG ATCGCTATGG CAAGGCGGAC
CCGCTGGCAT TCTGGCTGAG CGATCAACTG GCGCGTAAAC TGGGCGGTCA TTTAAACATC
AAAACGCGGG ATGGGCTTGG TACACGCTAC TCTGTGCATA TCAAAATGCT CGCAGCTGAC
CCGAAAGTTG AAGAGGAAGA AGAGCGATTA CTGGATGATG TCTGCGTAAT GGTGGATGTT
ACTTCGGCAG AAATTCGGAA TATTGTCACT CGCCAGTTAG AAAATTGGGG TGCAACCTGT
ATCACACCCG ATGAAAGATT AATTAGTCAA GATTATGATA TCTTTTTAAC GGATAATCCG
TCTAATCTTA CTGCCTCTGG CTTGCTTTTA AGCGATGATG AGTCTGGCGT ACGGGAAATT
GGGCCTGGTC AATTGTGCGT CAACTTCAAT ATGAGCAACG CTATGCAGGA AGCGGTCTTA
CAATTAATTG AAGTGCAACT GGCGCAGGAA GAGGTGACAG AATCGCCTCT GGGCGGAGAT
GAAAATGCGC AACTCCATGC CAGCGGCTAT TATGCGCTCT TTGTAGACAC AGTACCGGAT
GATGTTAAGA GGCTGTATAC TGAAGCAGCA ACCAGTGACT TTGCTGCGTT AGCCCAAACG
GCTCATCGTC TTAAAGGCGT ATTTGCCATG CTAAATCTGG TACCCGGCAA GCAGTTATGT
GAAACGCTGG AACATCTGAT TCGTGAGAAG GATGTTCCAG GAATAGAAAA ATACATCAGC
GACATTGACA GTTATGTCAA GAGCTTGCTG TAG
 
Protein sequence
MRQKETTATT RFSLLPGSIT RFFLLLIIVL LVTMGVMVQS AVNAWLKDKS YQIVDITHAI 
QKRVDTWRYV TWQIYDNIAA TTSPSSGEGL QETRLKQDVY YLEKPRRKTE ALIFGSHDNS
TLEMTQRMST YLDTLWGAEN VPWSMYYLNG QDNSLVLIST LPLKDLTSGF KESTVSDIVD
SRRAEMLQQA NALDERESFS NMRRLAWQNG HYFTLRTTFN QPGHLATVVA FDLPINDLIP
PGMPLDSFRL EPDATATGDN DNEKEGTDSV SIHFNSTKIE ISSALNSTDM RLVWQVPYGT
LLLDTLQNIL LPLLLNIGLL ALALFGYTTF RHFSSRSTES VPSTAVNNEL RILRAINEEI
VSLLPLGLLV HDQESNRTVI SNKIADHLLP HLNLQNITTM AEQHQGIIQA TINNELYEIR
MFRSQVAPRT QIFIIRDQDR EVLVNKKLKQ AQRLYEKNQQ GRMTFMKNIG DALKEPAQSL
AESAAKLNAP EGKQLTNQAD VLVRLVDEIQ LANMLADDSW KSETVLFSVQ DLIDEVVPSV
LPAIKRKGLQ LLINNHLKAH DMRRGDRDAL RRILLLLMQY AVTSTQLGKI TLEVDQDESS
EDRLTFRILD TGEGVSIHEM DNLHFPFINQ TQNDRYGKAD PLAFWLSDQL ARKLGGHLNI
KTRDGLGTRY SVHIKMLAAD PKVEEEEERL LDDVCVMVDV TSAEIRNIVT RQLENWGATC
ITPDERLISQ DYDIFLTDNP SNLTASGLLL SDDESGVREI GPGQLCVNFN MSNAMQEAVL
QLIEVQLAQE EVTESPLGGD ENAQLHASGY YALFVDTVPD DVKRLYTEAA TSDFAALAQT
AHRLKGVFAM LNLVPGKQLC ETLEHLIREK DVPGIEKYIS DIDSYVKSLL