Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3744 |
Symbol | rpoH |
ID | 6144026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3811178 |
End bp | 3812032 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618570 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_001745710 |
Protein GI | 170683649 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACA AAATGCAAAG TTTAGCTTTA GCCCCAGTTG GCAACCTGGA TTCCTACATC CGGGCAGCTA ACGCGTGGCC GATGTTGTCG GCTGACGAGG AGCGGGCGCT GGCTGAAAAG CTGCATTACC ATGGCGATCT GGAAGCAGCT AAAACGCTGA TCCTGTCTCA CCTGCGGTTT GTTGTTCATA TTGCTCGTAA TTATGCGGGC TATGGCCTGC CACAGGCGGA TTTGATTCAG GAAGGTAACA TCGGCCTGAT GAAAGCAGTG CGCCGTTTTA ACCCGGAAGT GGGTGTGCGC CTGGTCTCCT TCGCCGTTCA CTGGATCAAA GCAGAGATCC ACGAATACGT TCTGCGCAAC TGGCGTATCG TCAAAGTTGC GACCACCAAA GCGCAGCGCA AACTGTTCTT CAACCTGCGT AAAACCAAGC AGCGTCTGGG CTGGTTTAAT CAGGATGAAG TCGAAATGGT GGCCCGTGAA CTGGGCGTAA CCAGCAAAGA CGTACGTGAG ATGGAATCGC GTATGGCGGC ACAGGACATG ACCTTTGACC TGTCTTCCGA TGACGATTCC GACAGCCAGC CGATGGCTCC GGTGCTCTAT CTGCAGGATA AATCATCTAA CTTTGCTGAC GGCATCGAAG ATGATAACTG GGAAGAGCAG GCGGCAAACC GTCTGACCGA CGCGATGCAA GGTCTGGACG AGCGTAGCCA GGATATCATC CGCGCGCGCT GGCTGGACGA AGACAACAAG TCCACGTTGC AGGAACTGGC TGACCGTTAC GGTGTTTCCG CTGAGCGTGT GCGTCAGCTG GAAAAGAACG CGATGAAAAA ATTGCGCGCT GCCATTGAAG CGTAA
|
Protein sequence | MTDKMQSLAL APVGNLDSYI RAANAWPMLS ADEERALAEK LHYHGDLEAA KTLILSHLRF VVHIARNYAG YGLPQADLIQ EGNIGLMKAV RRFNPEVGVR LVSFAVHWIK AEIHEYVLRN WRIVKVATTK AQRKLFFNLR KTKQRLGWFN QDEVEMVARE LGVTSKDVRE MESRMAAQDM TFDLSSDDDS DSQPMAPVLY LQDKSSNFAD GIEDDNWEEQ AANRLTDAMQ GLDERSQDII RARWLDEDNK STLQELADRY GVSAERVRQL EKNAMKKLRA AIEA
|
| |