Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2609 |
Symbol | rpoH2 |
ID | 5713507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2772541 |
End bp | 2773437 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641268533 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_001533943 |
Protein GI | 159045149 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.000766177 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCATCAT ATACCAATCT TCCGGCCCCC TCTCCGGAGC AGGGCCTCAA CCGGTACCTT CAGGAAATCC GCAAGTTTCC CATGCTGGAG CCCGAAGAGG AATACATGCT GGCCAAGCGC TGGGTGGATC ACGAGGACAC CGAGGCGGCG CACAAGATGG TCACCTCGCA CCTGCGTCTC GCCGCGAAAA TCGCCATGGG CTACCGCGGC TACGGCCTGC CCCAGGCGGA GGTGATCTCC GAGGCCAATG TGGGCCTGAT GCAGGCGGTC AAGCGGTTCG ATCCCGAGAA GGGGTTTCGG CTGGCGACCT ATGCCATGTG GTGGATCCGC GCCTCGATCC AGGAGTATAT CCTGCGGTCC TGGAGCCTGG TGAAGCTGGG CACGACCTCG GCCCAGAAGA AGCTGTTTTT CAACCTGCGC AAGGCCAAGA ACCGCATCGG CGCGCTGGAG GAGGGCGACC TGCGCCCGGA AAACGTCGCC CGGATCGCCA ATGACCTGAA CGTCACCGAG GACGAGGTGA TCTCGATGAA CCGGCGGATG TCGGGCGGGG ATGCGTCGCT CAACGCGATG ATCGGCTCGG ATGGCGAGGG CGCCACCGAA TGGCAGGACT GGCTGGAGGA CGAGGACGCC GACCAGGCCG ACGACTTCGC CGAGAAGGAC GAGTTGATGG TGCGCCGCGA ACTCTTGGCC GAGGCCATGG GCGTGCTCAA CGACCGCGAG AAGGACATCC TGATGAAGCG CCGGCTGGAG GACAAGCCCG CGACCCTGGA AGAGCTGTCG GAGGTCTACG GCGTCAGCCG GGAGCGGATC CGCCAGATCG AGGTCCGCGC CTTCGAGAAG TTGCAGAAGG CGATGCGCGA CCTGGCCAAG GAGAAGGGGT TGATGGCGAC CGCCTGA
|
Protein sequence | MASYTNLPAP SPEQGLNRYL QEIRKFPMLE PEEEYMLAKR WVDHEDTEAA HKMVTSHLRL AAKIAMGYRG YGLPQAEVIS EANVGLMQAV KRFDPEKGFR LATYAMWWIR ASIQEYILRS WSLVKLGTTS AQKKLFFNLR KAKNRIGALE EGDLRPENVA RIANDLNVTE DEVISMNRRM SGGDASLNAM IGSDGEGATE WQDWLEDEDA DQADDFAEKD ELMVRRELLA EAMGVLNDRE KDILMKRRLE DKPATLEELS EVYGVSRERI RQIEVRAFEK LQKAMRDLAK EKGLMATA
|
| |