Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4524 |
Symbol | rpoN |
ID | 6969117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4188837 |
End bp | 4190270 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388237 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_002272672 |
Protein GI | 209397246 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG GTTTGCAACT CAGGCTTAGC CAACAACTGG CGATGACGCC ACAGCTCCAA CAGGCAATTC GTCTGTTGCA GTTGTCGACG CTGGAACTTC AGCAGGAGCT ACAGCAGGCG CTGGAGAGTA ATCCGCTGCT TGAGCAAATC GACACTCATG AAGAAATCGA CACCCGCGAA ACGCAAGACA GTGAAACGCT GGACACCGCC GACGCGCTCG AACAAAAAGA GATGCCGGAA GAGCTGCCGC TCGATGCCAG TTGGGACACC ATTTACACCG CTGGTACACC ATCCGGCACC AGCGGTGACT ACATTGACGA CGAGCTGCCG GTCTATCAGG GCGAAACGAC GCAGACCTTG CAGGATTACC TGATGTGGCA GGTCGAGCTG ACACCGTTTT CCGACACTGA CCGCGCTATT GCTACCTCTA TCGTCGATGC CGTTGATGAC ACCGGTTATC TGACTGTCCC GCTGGAAGAT ATTCTCGAAA GTATGGGCGA TGAAGAGATC GACATCGACG AAGTTGAAGC CGTCCTTAAG CGGATCCAAC GGTTTGATCC GGTCGGTGTG GCGGCAAAAG ATCTGCGTGA CTGCCTGCTG ATCCAACTCT CCCAATTCGA TAAAACCACG CCGTGGCTGG AAGAGGCCAG ACTGATCATT AGCGATCATC TCGATCTGTT AGCCAATCAC GACTTCCGCA CTTTAATGCG CGTCACGCGT CTGAAAGAAG ATGTGCTGAA AGAAGCCGTC AATCTGATCC AGTCGCTCGA TCCGCGCCCC GGGCAATCGA TCCAGACTGG CGAACCTGAG TATGTCATTC CAGATGTGCT GGTGCGTAAG CATAACGGTC ACTGGACGGT AGAACTCAAC AGTGACAGCA TTCCTCGTCT GCAAATCAAC CAGCACTACG CCTCGATGTG CAATAACGCG CGCAATGATG GTGACAGCCA GTTTATCCGC AGCAATCTGC AAGATGCCAA ATGGTTGATC AAGAGTCTGG AAAGCCGTAA CGATACGCTA CTGCGCGTGA GTCGCTGTAT CGTTGAACAG CAGCAAGCCT TCTTTGAGCA AGGCGAAGAA TATATGAAAC CGATGGTACT GGCCGATATC GCCCAGGCTG TCGAAATGCA TGAATCGACG ATATCTCGCG TGACCACGCA AAAATACCTG CATAGTCCAC GAGGCATTTT TGAACTGAAG TATTTCTTTT CCAGTCACGT CAATACCGAG GGCGGCGGCG AAGCTTCCTC CACGGCGATT CGTGCGCTGG TGAAGAAATT AATCGCGGCG GAAAACCCAG CGAAACCGTT GAGCGACAGC AAGTTAACCT CTTTGCTGTC GGAACAAGGT ATCATGGTGG CACGCCGCAC TGTTGCGAAG TACCGAGAGT CTTTATCCAT TCCGCCGTCA AACCAGCGTA AACAGCTCGT TTGA
|
Protein sequence | MKQGLQLRLS QQLAMTPQLQ QAIRLLQLST LELQQELQQA LESNPLLEQI DTHEEIDTRE TQDSETLDTA DALEQKEMPE ELPLDASWDT IYTAGTPSGT SGDYIDDELP VYQGETTQTL QDYLMWQVEL TPFSDTDRAI ATSIVDAVDD TGYLTVPLED ILESMGDEEI DIDEVEAVLK RIQRFDPVGV AAKDLRDCLL IQLSQFDKTT PWLEEARLII SDHLDLLANH DFRTLMRVTR LKEDVLKEAV NLIQSLDPRP GQSIQTGEPE YVIPDVLVRK HNGHWTVELN SDSIPRLQIN QHYASMCNNA RNDGDSQFIR SNLQDAKWLI KSLESRNDTL LRVSRCIVEQ QQAFFEQGEE YMKPMVLADI AQAVEMHEST ISRVTTQKYL HSPRGIFELK YFFSSHVNTE GGGEASSTAI RALVKKLIAA ENPAKPLSDS KLTSLLSEQG IMVARRTVAK YRESLSIPPS NQRKQLV
|
| |