Gene RPB_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0197 
Symbol 
ID3909438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp221351 
End bp222340 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content65% 
IMG OID637882078 
Productsigma 32 (RpoH) 
Protein accessionYP_483819 
Protein GI86747323 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.271088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGG CATTTTCTTC GTCGTCCCTC GCCCCGGCAC ACTCCGACGC TGCGGTTTTC 
GATGAGAAGA GCTACATGAG GGCGATCGGC CGCTATCCGG TCCTGGAGCC GGATGAAGAA
GCTCGGTTGT GGCAGCGATG GCTGCAGCAT CGCGACAAAG CGGCGGCTGA CGCGCTGATC
ACCAGCCACC TCCGGCTCGC CGCGAAACTG GCTCGCGACT TCCGACGCTA TGGCTTTCCG
CTGGGGGATC TGATCGCCGA AGCGAATCTC GGACTGATGA TGGCGCTCGA CCGGTTCGAC
CCCGAACGCG GCGCGCGGTT CTCGACCTGC GCGGTGTGGT GGATCCGGTC AGCGATCTAC
GATCACATCA TCCGATCGTG GTCGCTGGTG CGGATCGGCC GGACGCCTGC GCAGAAGAAG
TTGTTCTTCC GGCTTCGCGG CGAGATCCGC CGGCTCCAGC CCGATCACCA CGGCACGCTC
ACCAAGGAAT TGGCCGAACA GATTTCAGCG ACGCTCGACG TTCCGCTTCG CGAAGTCATC
GAGATGGAGC AGCGCCTGTC CGGCGACCGG TCCTTGAACA CGCCGTTGTC TGATCTCGAC
GAGAGCGGCG AGTGGCAGGA TCTGATTGCC GACGACGCGC CGAACGCCGA GGCGGTCCTC
GCCGGCCACG ACGAACTCGA CCATCAACGC CGCGCGTTGC AGGACGCGCT GGTTCAGCTC
GATGCCCGCG AACGCTACAT CTTTTCGGCC AGACATTTGG GCGAGCGTCC CGCCAGCTTC
GAGACGATCG GTCAGTCGCT CTCGATCTCG GCGGAACGGG TGCGGCAGAT CGAGGCCCGC
GCATTCGCCA AGGTCGCGAA CTCTGCCCGC CGAACGTGCG GGACGGCGCG GCCGGCCGCA
CGTGTCACCA GCAACCGAAA GACGACCGCT CTGACCGCTC CGCCCAACTG GATCGGCCAC
AACGCAGCCG CGGTCCACGC CTCGGTCTGA
 
Protein sequence
MTSAFSSSSL APAHSDAAVF DEKSYMRAIG RYPVLEPDEE ARLWQRWLQH RDKAAADALI 
TSHLRLAAKL ARDFRRYGFP LGDLIAEANL GLMMALDRFD PERGARFSTC AVWWIRSAIY
DHIIRSWSLV RIGRTPAQKK LFFRLRGEIR RLQPDHHGTL TKELAEQISA TLDVPLREVI
EMEQRLSGDR SLNTPLSDLD ESGEWQDLIA DDAPNAEAVL AGHDELDHQR RALQDALVQL
DARERYIFSA RHLGERPASF ETIGQSLSIS AERVRQIEAR AFAKVANSAR RTCGTARPAA
RVTSNRKTTA LTAPPNWIGH NAAAVHASV