Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1525 |
Symbol | |
ID | 6409182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1606239 |
End bp | 1607219 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711419 |
Product | RNA polymerase, sigma 32 subunit, RpoH |
Protein accession | YP_001990534 |
Protein GI | 192289929 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.480979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCAT CGTTTCCTAT GGCGGCGCCC GCATCGACAC ACGTCAATTC CGTGCTGTTC GACGAGCAAG ACTACATGCG GGCGATCGGC CGCTATCCGG TGCTCGAACC GGACGAAGAG GCGCGGTTGT GGCAGCGCTG GCACACCGCT CGCGACAAAG CGGCGGCTGA CGTGCTGATC ACCAGCCATC TCCGACTCGC GGCCAAGCTG GCCCGCAACT ACCGACGCTA CGGATTCCCG ATCGGCGACC TGATCGCCGA AGCCAATCTC GGCCTGATGA TGGCGCTCGA CAAATTCGAT CCGACGCGCG GCGCACGCTT TGCAACCTGC GCGGTGTGGT GGATCCGGTC GGCGATCTAC GATCACATCG TCAGGTCGTG GTCGCTGGTG CGCATCGGTC GAACGCCGGC GCAGAAGAAG CTGTTCTTCC GTCTCCGCGG CGAGATCCGC CGGCTGAGGC CCGACCACCA CGGCGCCCTC ACCAAGGACC TGGCGGAGCA GATCTCGACC TCACTCGATG TCCCGCTGCG AGAGGTCATC GAGATGGAGC AGCGCCTGTC GGGCGATCTG TCGTTGAACG CACCGCTGTC GGACCTTGAC GAGAGCGGCG AGTGGCAGGA TTTGATCGCC GATGACTCGC CGAATGCCGA AGCTATTCTG GCAGGCCACG ACGAGCTCGA TCGTCAGCGC GGCGCGCTCC GGGACGCGCT GGTGCAGCTC GACGCGCGTG AGCGCTACAT CTTTTCGGCG CGGCATCTGG GCGACCGCCC CGCCAGCTTC GAGGCCATCG GTCAGTCGCT CTCGCTTTCT GCCGAACGGG TCCGGCAGAT CGAGGCGCGT GCGTTCGCCA AGGTGGCGAC AGCGGCGCGG CGGCATTGCG GCACGGCCCA GCAAGTTGCC CGCGGCAGGA TCGACCGACA TCCGCATCCG GCCATCCAGA GCCGCCAACT TGGAATGGAG CAGATGGCGC ACGCCTCGTG A
|
Protein sequence | MASSFPMAAP ASTHVNSVLF DEQDYMRAIG RYPVLEPDEE ARLWQRWHTA RDKAAADVLI TSHLRLAAKL ARNYRRYGFP IGDLIAEANL GLMMALDKFD PTRGARFATC AVWWIRSAIY DHIVRSWSLV RIGRTPAQKK LFFRLRGEIR RLRPDHHGAL TKDLAEQIST SLDVPLREVI EMEQRLSGDL SLNAPLSDLD ESGEWQDLIA DDSPNAEAIL AGHDELDRQR GALRDALVQL DARERYIFSA RHLGDRPASF EAIGQSLSLS AERVRQIEAR AFAKVATAAR RHCGTAQQVA RGRIDRHPHP AIQSRQLGME QMAHAS
|
| |