Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0455 |
Symbol | |
ID | 3909800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 502023 |
End bp | 502922 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882342 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_484077 |
Protein GI | 86747581 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.676672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.248951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGTG CAGCTACGCT ACCCGTTCTC AACGGTGAAT CCGGACTCGC TCGGTATCTC GCGGAAATCC GCAAGTTCCC GATGCTCGAG CCGCAACAGG AATACATGTT CGCCAAGCGC TGGCGCGAGC ACGATGATCG CGACGCCGCG CATCACCTCG TCACCAGCCA TCTGCGGCTC GTCGCCAAGA TCGCCATGGG CTATCGCGGC TACGGCCTGC CGATCTCCGA GGTCGTCTCG GAAGGCAATG TCGGCCTGAT GCAGGCGGTG AAGCGGTTCG AGCCGGACAA AGGCTTCCGC CTCGCCACCT ACGCGATGTG GTGGATCAAG GCGTCGATTC AAGAATACAT CCTGCGTTCG TGGTCGCTCG TGAAGATGGG CACCACCGCG AACCAGAAGA AGCTGTTCTT CAATCTGCGC AAGGCGAAGA GCAAGATCTC GGCGCTGGAC GAGGGTGATA TGCACCCCGA CCAGGTCAAG CTGATCGCCA AGCGGCTCGG CGTCACCGAG CAGGACGTGA TCGACATGAA TCGCCGCCTC GGTGGCGACG CGTCGCTCAA CGCCCCGATC CGCGACGACG GCGAGCCCGG CGAATGGCAG GACTGGCTGG TCGACCAGTC GCCGAATCAG GAAGCCGTGA TGGCCGAGCA CGAGGAGCTC GATCATCGCC GCGCCGCGCT GAACGGTGCG ATCGGCGTGC TCAACCCGCG CGAACGGCGG ATCTTCGAGG CGCGCCGCCT CGCCGACGAG CCGATGACGC TGGAAGACCT CGCCGCCGAG TTCGGCGTCT CGCGCGAGCG CGTCCGCCAG ATCGAGGTGC GTGCCTTCGA GAAGGTGCAG AGCGCCGTCA AGGGCACCAT CGCGCGTCAG GAACAGGCGG CGCTCGAAGC CGCCCACTGA
|
Protein sequence | MARAATLPVL NGESGLARYL AEIRKFPMLE PQQEYMFAKR WREHDDRDAA HHLVTSHLRL VAKIAMGYRG YGLPISEVVS EGNVGLMQAV KRFEPDKGFR LATYAMWWIK ASIQEYILRS WSLVKMGTTA NQKKLFFNLR KAKSKISALD EGDMHPDQVK LIAKRLGVTE QDVIDMNRRL GGDASLNAPI RDDGEPGEWQ DWLVDQSPNQ EAVMAEHEEL DHRRAALNGA IGVLNPRERR IFEARRLADE PMTLEDLAAE FGVSRERVRQ IEVRAFEKVQ SAVKGTIARQ EQAALEAAH
|
| |