Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3498 |
Symbol | rpoN |
ID | 6144563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3571330 |
End bp | 3572763 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618327 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_001745474 |
Protein GI | 170684031 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG GTTTGCAACT CAGGCTTAGC CAACAACTGG CGATGACGCC ACAGCTCCAA CAGGCAATTC GTCTGTTGCA GTTGTCGACG CTGGAACTTC AGCAGGAGCT ACAGCAGGCG CTGGAGAGTA ATCCGCTGCT TGAGCAAATC GACACTCATG AAGAAATCGA CACCCGCGAA ACGCAAGACA GTGAAACGCT GGACACCGCC GACGCGCTCG AACAAAAAGA GATGCCGGAA GAGCTGCCGC TCGATGCCAG TTGGGACACC ATTTACACCG CTGGTACACC ATCCGGCACC AGCGGTGACT ACATTGACGA CGAGCTGCCG GTCTATCAGG GCGAAACGAC GCAGACCTTG CAGGATTACC TGATGTGGCA GGTCGAGCTG ACACCGTTTT CCGACACTGA CCGCGCTATT GCTACCTCTA TCGTCGATGC CGTTGATGAC ACCGGTTATC TGACTGTCCC GCTGGAAGAT ATTCTCGAAA GTATGGGCGA TGAAGAGATT GACATCGACG AAGTTGAAGC CGTCCTTAAG CGGATCCAAC GGTTTGATCC AGTCGGTGTG GCGGCAAAAG ATCTGCGTGA CTGTCTGCTG ATCCAACTCT CCCAATTCGA TAAAACCACG CCGTGGCTGG AAGAGGCCAG ACTGATTATT AGCGATCATC TCGATCTGTT AGCCAATCAC GACTTCCGCA CTTTAATGCG CGTCACGCGT CTGAAAGAAG ATGTGCTGAA AGAAGCCGTC AATCTGATCC AGTCGCTCGA TCCGCGCCCC GGGCAGTCGA TCCAGACTGG CGAACCTGAG TATGTCATTC CAGATGTGCT GGTGCGTAAG CATAACGGTC ACTGGACGGT AGAACTCAAC AGTGACAGCA TTCCGCGTCT GCAAATCAAC CAGCACTACG CCTCGATGTG CAATAACGCG CGCAACGATG GTGACAGCCA GTTTATCCGC AGCAATCTGC AGGATGCCAA ATGGTTGATC AAGAGTCTGG AAAGCCGTAA CGATACGCTA CTGCGCGTGA GTCGCTGTAT CGTTGAACAG CAGCAGGCCT TCTTTGAGCA AGGTGAAGAA TATATGAAAC CGATGGTACT GGCCGATATC GCCCAGGCTG TCGAAATGCA TGAATCGACG ATATCTCGCG TGACCACGCA AAAATACCTG CATAGTCCAC GAGGCATTTT TGAACTGAAG TATTTCTTTT CCAGTCACGT CAATACCGAG GGCGGCGGCG AAGCTTCCTC CACGGCGATT CGTGCGCTGG TGAAGAAATT AATCGCGGCG GAAAACCCAG CGAAACCGTT GAGCGACAGC AAGTTAACCT CTTTGCTGTC GGAACAAGGT ATCATGGTGG CACGCCGCAC TGTTGCGAAG TACCGAGAGT CTTTATCCAT TCCGCCGTCA AACCAGCGTA AACAGCTCGT TTGA
|
Protein sequence | MKQGLQLRLS QQLAMTPQLQ QAIRLLQLST LELQQELQQA LESNPLLEQI DTHEEIDTRE TQDSETLDTA DALEQKEMPE ELPLDASWDT IYTAGTPSGT SGDYIDDELP VYQGETTQTL QDYLMWQVEL TPFSDTDRAI ATSIVDAVDD TGYLTVPLED ILESMGDEEI DIDEVEAVLK RIQRFDPVGV AAKDLRDCLL IQLSQFDKTT PWLEEARLII SDHLDLLANH DFRTLMRVTR LKEDVLKEAV NLIQSLDPRP GQSIQTGEPE YVIPDVLVRK HNGHWTVELN SDSIPRLQIN QHYASMCNNA RNDGDSQFIR SNLQDAKWLI KSLESRNDTL LRVSRCIVEQ QQAFFEQGEE YMKPMVLADI AQAVEMHEST ISRVTTQKYL HSPRGIFELK YFFSSHVNTE GGGEASSTAI RALVKKLIAA ENPAKPLSDS KLTSLLSEQG IMVARRTVAK YRESLSIPPS NQRKQLV
|
| |