Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0655 |
Symbol | rpoH |
ID | 3928045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 670231 |
End bp | 671124 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901777 |
Product | RNA polymerase sigma-32 factor |
Protein accession | YP_507464 |
Protein GI | 88658518 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0381956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACAA ATTCTATATT TTCCCTAACT CAAGACAATT TAATGTCCTA TATCAATGAA GTGCATGCAT TTCCGATTTT GTCTCCTGAA GAAGAAGACA GGTTAGCAAG AAATTGGTAT GAAAATGGGA TCGTTGCTGA CGCACATAGG TTAGTTACTA GTCATCTAAG GCTAGTAGTC AAAGTTGCAT TAAGCTTTAA AAATTATGAA TTGCCTCTTA TAGAGCTAAT AATGGAAGGA AATATAGGGC TGATGCAGGC TGTAAAAAAG TTCAATCCCA CTCTTGGCTT TAGGTTATCC ACTTATGCTA TTTGGTGGAT CAAAGCTTTT ATTAAGGACT ATATTCTTAA ATCTTGGTCG TGCATTAAAA TTGGTACAAC ACAAGCACAA AGGAAGTTAT TCTTTAGCTT AAGGAAAATT AAGAAAAAAC TTTTTAAATA TAACCACAAT ATTACAAAAG AAGATATAAA GCTAATTGCA AATAAATGTT CAACTTCTGA ACAAGAAGTA GAACAGATGA ACAGGTATTT TCTCTACAGA GATAGATCCC TGAATGAACT AGTATTCTCT AATGATAATC AAAATGGAGT CGAATTACAA GAGATTATAA AGTGTGATAC CCCAAACCAA GAGGATACAT ATTTACTAAA TGAAGAGTTA AATATAAAAA AGGCTTTAAT TTCACAAGCT TTATCAACAC TAAATGAAAG ATACCGCGAC ATATTCATCA GGCGGCGACT CATCGAAGAA CCAGATACTT TAGACAAATT AAGTCAAGAG TATAATATAT CAAAAGAGAG AGTTAGACAA ATAGAAATGC ATGCTTTTAC TAAAGTAAAG AATTTTATTA TATCTGAAAG AGAAAAACTA GGTCATTGTA ATATCAATAG TTAA
|
Protein sequence | MLTNSIFSLT QDNLMSYINE VHAFPILSPE EEDRLARNWY ENGIVADAHR LVTSHLRLVV KVALSFKNYE LPLIELIMEG NIGLMQAVKK FNPTLGFRLS TYAIWWIKAF IKDYILKSWS CIKIGTTQAQ RKLFFSLRKI KKKLFKYNHN ITKEDIKLIA NKCSTSEQEV EQMNRYFLYR DRSLNELVFS NDNQNGVELQ EIIKCDTPNQ EDTYLLNEEL NIKKALISQA LSTLNERYRD IFIRRRLIEE PDTLDKLSQE YNISKERVRQ IEMHAFTKVK NFIISEREKL GHCNINS
|
| |