Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0197 |
Symbol | |
ID | 3909438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 221351 |
End bp | 222340 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882078 |
Product | sigma 32 (RpoH) |
Protein accession | YP_483819 |
Protein GI | 86747323 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.271088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCGG CATTTTCTTC GTCGTCCCTC GCCCCGGCAC ACTCCGACGC TGCGGTTTTC GATGAGAAGA GCTACATGAG GGCGATCGGC CGCTATCCGG TCCTGGAGCC GGATGAAGAA GCTCGGTTGT GGCAGCGATG GCTGCAGCAT CGCGACAAAG CGGCGGCTGA CGCGCTGATC ACCAGCCACC TCCGGCTCGC CGCGAAACTG GCTCGCGACT TCCGACGCTA TGGCTTTCCG CTGGGGGATC TGATCGCCGA AGCGAATCTC GGACTGATGA TGGCGCTCGA CCGGTTCGAC CCCGAACGCG GCGCGCGGTT CTCGACCTGC GCGGTGTGGT GGATCCGGTC AGCGATCTAC GATCACATCA TCCGATCGTG GTCGCTGGTG CGGATCGGCC GGACGCCTGC GCAGAAGAAG TTGTTCTTCC GGCTTCGCGG CGAGATCCGC CGGCTCCAGC CCGATCACCA CGGCACGCTC ACCAAGGAAT TGGCCGAACA GATTTCAGCG ACGCTCGACG TTCCGCTTCG CGAAGTCATC GAGATGGAGC AGCGCCTGTC CGGCGACCGG TCCTTGAACA CGCCGTTGTC TGATCTCGAC GAGAGCGGCG AGTGGCAGGA TCTGATTGCC GACGACGCGC CGAACGCCGA GGCGGTCCTC GCCGGCCACG ACGAACTCGA CCATCAACGC CGCGCGTTGC AGGACGCGCT GGTTCAGCTC GATGCCCGCG AACGCTACAT CTTTTCGGCC AGACATTTGG GCGAGCGTCC CGCCAGCTTC GAGACGATCG GTCAGTCGCT CTCGATCTCG GCGGAACGGG TGCGGCAGAT CGAGGCCCGC GCATTCGCCA AGGTCGCGAA CTCTGCCCGC CGAACGTGCG GGACGGCGCG GCCGGCCGCA CGTGTCACCA GCAACCGAAA GACGACCGCT CTGACCGCTC CGCCCAACTG GATCGGCCAC AACGCAGCCG CGGTCCACGC CTCGGTCTGA
|
Protein sequence | MTSAFSSSSL APAHSDAAVF DEKSYMRAIG RYPVLEPDEE ARLWQRWLQH RDKAAADALI TSHLRLAAKL ARDFRRYGFP LGDLIAEANL GLMMALDRFD PERGARFSTC AVWWIRSAIY DHIIRSWSLV RIGRTPAQKK LFFRLRGEIR RLQPDHHGTL TKELAEQISA TLDVPLREVI EMEQRLSGDR SLNTPLSDLD ESGEWQDLIA DDAPNAEAVL AGHDELDHQR RALQDALVQL DARERYIFSA RHLGERPASF ETIGQSLSIS AERVRQIEAR AFAKVANSAR RTCGTARPAA RVTSNRKTTA LTAPPNWIGH NAAAVHASV
|
| |