Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3356 |
Symbol | yojN |
ID | 6970967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3087837 |
End bp | 3090509 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387167 |
Product | phosphotransfer intermediate protein in two-component regulatory system with RcsBC |
Protein accession | YP_002271630 |
Protein GI | 209400425 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000212804 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCAGA AAGAGACAAC GGCCACGACC CGCTTTTCAC TCCTACCGGG GAGCATTACC CGCTTCTTTT TACTGTTGAT CATTGTGTTA CTGGTGACGA TGGGTGTAAT GGTACAAAGC GCCGTTAACG CCTGGCTGAA AGATAAAAGT TACCAAATTG TCGACATTAC CCACGCTATC CAAAAGCGCG TCGATACCTG GCGTTACGTG ACCTGGCAGA TCTACGACAA CATTGCCGCG ACGACCTCCC CCTCCTCCGG CGAAGGTTTA CAAGAGACGC GCCTGAAACA GGATGTCTAC TATCTGGAAA AACCGCGCCG CAAAACGGAA GCGTTAATCT TTGGCTCTCA CGACAACTCA ACGCTTGAGA TGACTCAACG GATGTCCACC TATCTGGACA CATTGTGGGG CGCAGAAAAT GTACCGTGGT CAATGTATTA CCTGAATGGT CAGGATAACA GTCTGGTGCT GATCTCAACC CTGCCCCTCA AAGATCTCAC CTCCGGATTT AAAGAATCGA CCGTCAGTGA CATTGTTGAT TCACGTCGTG CAGAGATGTT GCAACAGGCC AACGCCCTCG ATGAACGCGA AAGTTTTTCT AACATGCGCC GCCTGGCCTG GCAGAACGGT CATTACTTTA CCTTGCGTAC CACGTTCAAC CAGCCGGGAC ATCTGGCAAC GGTCGTGGCT TTTGATCTGC CGATTAATGA TTTGATCCCA CCGGGTATGC CGCTGGACAG TTTCCGCCTT GAGCCAGACG CGACGGCAAC GGGAAACAAT GATAATGAGA AAGAAGGAAC GGATAGCGTC AGTATCCACT TTAACAGTAC GAAGATTGAA ATCTCCTCGG CACTCAACTC TACCGATATG CGCCTGGTCT GGCAGGTTCC TTATGGCACC TTATTGCTGG ATACGTTGCA AAACATTCTG CTGCCACTGC TGCTGAACAT CGGTTTGCTG GCGCTGGCGT TATTTGGCTA TACCACATTC CGCCATTTCT CCAGTCGCAG TACAGAAAGT CTACCCAACA CGGCGGTCAA TAACGAATTG CGCATTTTAC GGGCAATCAA TGAAGAGATA GTCTCACTGC TGCCGCTCGG CCTGCTGGTT CACGATCAGG AATCGAACCG CACTGTCATA AGTAACAAAA TTGCCGATCA TTTGCTGCCG CATTTGAATC TGCAAAACAT CACCACCATG GCGGAACAGC ATCAGGGGAT TATTCAGGCG ACGATCAATA ACGAGCTGTA TGAGATCCGC ATGTTCCGCA GCCAGGTTGC GCCGCGCACA CAAATTTTCA TTATTCGCGA TCAGGATCGC GAAGTGCTGG TAAACAAGAA ACTCAAGCAG GCGCAGCGTC TGTATGAGAA AAACCAGCAG GGGCGGATGA CCTTTATGAA AAACATTGGC GATGCGCTGA AAGAACCCGC ACAGTCCCTG GCGGAGAGCG CGGCTAAACT CAACGCCCCG GAAAGCAAAC AACTGGCGAA TCAGGCAGAT GTGCTGGTGC GATTGGTCGA TGAAATACAG TTAGCGAACA TGCTTGCGGA CGATAGCTGG AAAAGTGAGA CGGTGCTGTT CTCCGTGCAG GATTTAATTG ATGAAGTTGT GCCTTCAGTG TTGCCTGCCA TCAAGCGTAA AGGTCTGCAA CTGCTGATTA ACAATCATCT GAAAGCACAC GATATGCGCC GCGGCGATCG CGATGCATTA CGACGTATTT TGCTGCTACT GATGCAATAT GCCGTGACCT CAACGCAATT GGGAAAAATC ACCCTTGAGG TTGATCAGGA TGAGTCCTCC GAAGACCGCC TGACGTTCCG CATTCTGGAC ACCGGAGAAG GCGTAAGCAT TCATGAAATG GATAATTTGC ACTTCCCGTT TATCAACCAG ACCCAAAACG ATCGCTATGG CAAGGCGGAC CCGCTGGCAT TCTGGCTGAG CGATCAACTG GCACGTAAAC TGGGCGGTCA TTTAAACATC AAAACGCGGG ATGGGCTTGG TACACGCTAC TCTGTGCATA TCAAAATGCT CGCAGCTGAC CCGGAAGTTG AAGAGGAAGA AGAGCGTTTA CTGGATGATG TCTGCGTAAT GGTGGATGTT ACTTCGGCAG AAATTCGGAA TATTGTCACT CGCCAGTTAG AAAATTGGGG TGCAACCTGT ATCACACCCG ATGAAAGATT AATTAGTCAA GATTATGATA TCTTTTTAAC GGATAATCCG TCTAATCTTA CTGCCTCTGG CTTGCTTTTA AGCGATGATG AGTCTGGCGT ACGGGAAATT GGGCCTGGTC AATTGTGCGT CAACTTCAAT ATGAGCAACG CTATGCAGGA AGCGGTCTTA CAATTAATTG AAGTGCAACT GGCGCAGGAA GAGGTGACAG AATCGCCTCT GGGCGGAGAT GAAAATGCGC AACTCCATGC CAGCGGCTAT TATGCGCTCT TTGTAGACAC AGTACCGGAT GATGTTAAGA GGCTGTATAC TGAAGCAGCA ACCAGTGACT TTGCTGCGTT AGCCCAAACG GCTCATCGTC TTAAAGGCGT ATTTGCCATG CTAAATCTGG TACCCGGCAA GCAGTTATGT GAAACGCTGG AACATCTGAT TCGTGAGAAG GATGTTCCAG GAATAGAAAA ATACATCAGC GACATTGACA GTTATGTCAA GAGCTTGCTG TAG
|
Protein sequence | MRQKETTATT RFSLLPGSIT RFFLLLIIVL LVTMGVMVQS AVNAWLKDKS YQIVDITHAI QKRVDTWRYV TWQIYDNIAA TTSPSSGEGL QETRLKQDVY YLEKPRRKTE ALIFGSHDNS TLEMTQRMST YLDTLWGAEN VPWSMYYLNG QDNSLVLIST LPLKDLTSGF KESTVSDIVD SRRAEMLQQA NALDERESFS NMRRLAWQNG HYFTLRTTFN QPGHLATVVA FDLPINDLIP PGMPLDSFRL EPDATATGNN DNEKEGTDSV SIHFNSTKIE ISSALNSTDM RLVWQVPYGT LLLDTLQNIL LPLLLNIGLL ALALFGYTTF RHFSSRSTES LPNTAVNNEL RILRAINEEI VSLLPLGLLV HDQESNRTVI SNKIADHLLP HLNLQNITTM AEQHQGIIQA TINNELYEIR MFRSQVAPRT QIFIIRDQDR EVLVNKKLKQ AQRLYEKNQQ GRMTFMKNIG DALKEPAQSL AESAAKLNAP ESKQLANQAD VLVRLVDEIQ LANMLADDSW KSETVLFSVQ DLIDEVVPSV LPAIKRKGLQ LLINNHLKAH DMRRGDRDAL RRILLLLMQY AVTSTQLGKI TLEVDQDESS EDRLTFRILD TGEGVSIHEM DNLHFPFINQ TQNDRYGKAD PLAFWLSDQL ARKLGGHLNI KTRDGLGTRY SVHIKMLAAD PEVEEEEERL LDDVCVMVDV TSAEIRNIVT RQLENWGATC ITPDERLISQ DYDIFLTDNP SNLTASGLLL SDDESGVREI GPGQLCVNFN MSNAMQEAVL QLIEVQLAQE EVTESPLGGD ENAQLHASGY YALFVDTVPD DVKRLYTEAA TSDFAALAQT AHRLKGVFAM LNLVPGKQLC ETLEHLIREK DVPGIEKYIS DIDSYVKSLL
|
| |