Gene Rpal_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1525 
Symbol 
ID6409182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1606239 
End bp1607219 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID642711419 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_001990534 
Protein GI192289929 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.480979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCAT CGTTTCCTAT GGCGGCGCCC GCATCGACAC ACGTCAATTC CGTGCTGTTC 
GACGAGCAAG ACTACATGCG GGCGATCGGC CGCTATCCGG TGCTCGAACC GGACGAAGAG
GCGCGGTTGT GGCAGCGCTG GCACACCGCT CGCGACAAAG CGGCGGCTGA CGTGCTGATC
ACCAGCCATC TCCGACTCGC GGCCAAGCTG GCCCGCAACT ACCGACGCTA CGGATTCCCG
ATCGGCGACC TGATCGCCGA AGCCAATCTC GGCCTGATGA TGGCGCTCGA CAAATTCGAT
CCGACGCGCG GCGCACGCTT TGCAACCTGC GCGGTGTGGT GGATCCGGTC GGCGATCTAC
GATCACATCG TCAGGTCGTG GTCGCTGGTG CGCATCGGTC GAACGCCGGC GCAGAAGAAG
CTGTTCTTCC GTCTCCGCGG CGAGATCCGC CGGCTGAGGC CCGACCACCA CGGCGCCCTC
ACCAAGGACC TGGCGGAGCA GATCTCGACC TCACTCGATG TCCCGCTGCG AGAGGTCATC
GAGATGGAGC AGCGCCTGTC GGGCGATCTG TCGTTGAACG CACCGCTGTC GGACCTTGAC
GAGAGCGGCG AGTGGCAGGA TTTGATCGCC GATGACTCGC CGAATGCCGA AGCTATTCTG
GCAGGCCACG ACGAGCTCGA TCGTCAGCGC GGCGCGCTCC GGGACGCGCT GGTGCAGCTC
GACGCGCGTG AGCGCTACAT CTTTTCGGCG CGGCATCTGG GCGACCGCCC CGCCAGCTTC
GAGGCCATCG GTCAGTCGCT CTCGCTTTCT GCCGAACGGG TCCGGCAGAT CGAGGCGCGT
GCGTTCGCCA AGGTGGCGAC AGCGGCGCGG CGGCATTGCG GCACGGCCCA GCAAGTTGCC
CGCGGCAGGA TCGACCGACA TCCGCATCCG GCCATCCAGA GCCGCCAACT TGGAATGGAG
CAGATGGCGC ACGCCTCGTG A
 
Protein sequence
MASSFPMAAP ASTHVNSVLF DEQDYMRAIG RYPVLEPDEE ARLWQRWHTA RDKAAADVLI 
TSHLRLAAKL ARNYRRYGFP IGDLIAEANL GLMMALDKFD PTRGARFATC AVWWIRSAIY
DHIVRSWSLV RIGRTPAQKK LFFRLRGEIR RLRPDHHGAL TKDLAEQIST SLDVPLREVI
EMEQRLSGDL SLNAPLSDLD ESGEWQDLIA DDSPNAEAIL AGHDELDRQR GALRDALVQL
DARERYIFSA RHLGDRPASF EAIGQSLSLS AERVRQIEAR AFAKVATAAR RHCGTAQQVA
RGRIDRHPHP AIQSRQLGME QMAHAS