Gene RPB_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4131 
Symbol 
ID3911939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4703578 
End bp4705722 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content64% 
IMG OID637886035 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_487734 
Protein GI86751238 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA AGGCGAAGAC GGTTCAGTTG AAAGACAAGG AAAAGGACGA CAAGGCCGAC 
GCGCCGGAGA AGGACTCCGC CGACGCTCCC TCGCCGTTGC TCGACCTGTC GGACGCGGCC
GTCAAGAAGA TGATCAAGCA GGCCAAGAAG CGCGGCTTCG TGACCTTCGA TCAGCTCAAC
GAAGTGCTGC CGTCCGACAC CACGTCGCCG GAGCAGATCG AGGACATCAT GTCGATGCTC
TCCGACATGG GCATCAACGT GTCCGAGGCG GAAGAGAGCG ACAGCGAGGA CGAGGACGCC
AAGGACGAGG CCGAGGAAGA GCCCGATAAC GACCTCGTCG AGGTCACCCA GAAGGCCGTC
ACCGAGACCA AGAAGTCCGA GCCCGGCGAG CGCACCGACG ATCCCGTCCG GATGTATCTG
CGCGAGATGG GCACCGTCGA GCTGCTGTCG CGCGAGGGCG AAATCGCCAT CGCCAAGCGG
ATCGAGGCCG GCCGCGAGGC GATGATCGCC GGACTCTGCG AAAGCCCGCT GACCTTCCAG
GCGATCATCA TCTGGCGCGA CGAGCTCAAC GAAGGCAAGA TCTTCCTTCG CGACATCATC
GATCTCGAAG CCACCTATGC GGGCCCCGAC GCCAAGGCCA ACATGAACCC TGCGATGGCC
GAGGGCGCCG GCGAGGAAGC CAGCGCCGAA GGCGAGGCCG ACGCCGGAGC GCCCGCGCAT
GTCGCGCCGC CCGCCGCGCC GCCGACGGCG ACGCCGTTCC GTCCCGCGCA GCAGCGCTCG
GCGCCCGCCG CGGCCCCGGC CGCCGGCGAG GGCGGCGGTG AAGGCGCCGC CGAAGGCGAC
ATGGACGACG ACGAGTTCGA GAACCAGATG TCGCTCGCCG CCATCGAGGC CGAACTCAAG
CCGAAGGTGG TCGAGACCTT CGACAAGATC GCCGACAACT ACAAGAAGCT GCGCAAGCTG
CAAGAGCAGG ACATCGCCAA CCAGCTCGAA AGCGCGTCGC AGGGGCCGTC GCTGTCGCCC
TCGCAGGAGC GCAAATACAA GAAGCTCAAG GACGAAATCA TCGTCGAGGT GAAGTCGCTG
CGGCTCAATC AGGCGCGTAT CGACTCACTG GTCGAGCAGC TCTACGACAT CAACAAGAAG
CTGGTGTCGT TCGAAGGCCG GCTGCTGCGC CTCGGCGACA GCCACGGCGT GGCGCGCGAG
GACTTCCTGC GCAACTATCA GGGCTCCGAG CTCGATCCGC GCTGGCTCAA CCGCGTCTCG
AAACTCTCCG CCAAGGGCTG GAAGAACTTC GTCCATTTCG AGAAGGACCG GATCCGCGAG
CTGCGCCAGG AAATCCAGTC GATGGCCGCG CTCACCGGGC TCGAGATCGG CGAATTCCGC
AAGATCGTGC ATTCGGTGCA GAAGGGCGAG CGCGAAGCCC GCCAGGCCAA GAAGGAAATG
GTCGAGGCCA ACCTGCGTCT GGTGATCTCG ATTGCCAAGA AATACACCAA CCGCGGCCTG
CAGTTCCTCG ATCTCATTCA GGAAGGCAAT ATCGGCCTGA TGAAGGCGGT CGACAAATTC
GAATATCGCC GCGGCTACAA ATTCTCCACC TATGCGACGT GGTGGATCCG GCAGGCGATC
ACGCGTTCGA TCGCCGACCA GGCCCGCACC ATCCGCATTC CGGTGCACAT GATCGAGACG
ATCAACAAGA TCGTGCGCAC CTCGCGGCAG ATGCTCAACG AGATCGGCCG CGAGCCGACC
CCGGAAGAGC TCGCCGAGAA GCTCGGCATG CCGCTGGAGA AGGTCCGCAA GGTCCTCAAG
ATCGCCAAGG AGCCGCTGTC GCTCGAAACC CCGGTGGGTG ACGAAGAGGA CAGCCATCTC
GGCGACTTCA TCGAGGACAA GAACGCGGTG CTGCCGATCG ACGCCGCGAT CCAGTCGAAC
CTGCGCGAAA CCACGACGCG CGTGCTCGCC TCGCTCACCC CGCGCGAAGA ACGCGTGCTG
CGCATGCGCT TCGGCATCGG CATGAACACC GACCACACGC TGGAAGAAGT CGGCCAGCAG
TTCTCGGTCA CCCGCGAACG CATCCGCCAG ATCGAAGCCA AGGCGCTGCG CAAGTTGAAG
CATCCGAGCC GGTCAAGGAA GCTGCGGAGC TTCTTGGACA ACTGA
 
Protein sequence
MASKAKTVQL KDKEKDDKAD APEKDSADAP SPLLDLSDAA VKKMIKQAKK RGFVTFDQLN 
EVLPSDTTSP EQIEDIMSML SDMGINVSEA EESDSEDEDA KDEAEEEPDN DLVEVTQKAV
TETKKSEPGE RTDDPVRMYL REMGTVELLS REGEIAIAKR IEAGREAMIA GLCESPLTFQ
AIIIWRDELN EGKIFLRDII DLEATYAGPD AKANMNPAMA EGAGEEASAE GEADAGAPAH
VAPPAAPPTA TPFRPAQQRS APAAAPAAGE GGGEGAAEGD MDDDEFENQM SLAAIEAELK
PKVVETFDKI ADNYKKLRKL QEQDIANQLE SASQGPSLSP SQERKYKKLK DEIIVEVKSL
RLNQARIDSL VEQLYDINKK LVSFEGRLLR LGDSHGVARE DFLRNYQGSE LDPRWLNRVS
KLSAKGWKNF VHFEKDRIRE LRQEIQSMAA LTGLEIGEFR KIVHSVQKGE REARQAKKEM
VEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST YATWWIRQAI
TRSIADQART IRIPVHMIET INKIVRTSRQ MLNEIGREPT PEELAEKLGM PLEKVRKVLK
IAKEPLSLET PVGDEEDSHL GDFIEDKNAV LPIDAAIQSN LRETTTRVLA SLTPREERVL
RMRFGIGMNT DHTLEEVGQQ FSVTRERIRQ IEAKALRKLK HPSRSRKLRS FLDN