Gene RPC_0415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0415 
Symbol 
ID3970867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp445890 
End bp447551 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID637923530 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_530309 
Protein GI90421939 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.774408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGA CCCAACGCTT AGAATTCCGC CAGTCGCAAT CGCTGGTGAT GACGCCGCAG 
CTGATGCAGG CGATCAAGCT GCTGCAACTG TCGAATCTCG ACCTCGCCAG CTTTGTGGAG
GACGAGCTCG AGCGGAACCC GCTGCTCGAT CGCGCCAATG AGAACGGCGA ACCGCCGGTT
CCGGGCGAGC TCGCGCCGGA GCGCGCCGAA TTCTCCGACC GCGACGACGC CGCGTCGGGA
TCTGGCGAGG ATGCGGGGGC GGATGGCTCG GATTTTTCCG ACCGCGCAGC CGGCGATGGC
TTTGAAGCCT CGGCCGAGGA CTGGATGCAG CGCGACCTCG GCAGCCGCGC CGAGATCGAA
CAGACCCTCG ACACCGGCCT CGACAATGTA TTTTCCGAAG AGCCGGCCGA GACCGCTGCG
CGCACCGCGC AGGACGCAGC CCCAACCGCC TTCACCGAAT GGGGCGGCGG CTCGTCGAAC
GACGACGGCT ACAATCTGGA AGCCTTCGTG GCCGCCGAAA TCACGCTGGG CGGCCATCTG
GCCGAACAGC TGGCGGTGGC GTTCACCGAT CCCAAGCAGC GGCTGATCGG GCAGTACCTG
GTCGACCTGG TCGACGACGC CGGCTATCTG CCGCCGGATC TCGGCGACGC CACCGAACGG
CTCGGCGCCA GCGCCGAGCA GGTCGAGGCG GTGGTCGCAG TGCTGCAGAA ATTCGACCCC
GCCGGGGTCT GCGCGCGCAA CCTCAGCGAA TGCCTGGCGA TCCAGCTGCG CGATCGCGAC
CGCTACGATC CGGCGATGCA GGCCTTGGTC GAGCATCTCG ATCTGTTGGC CAAGCGCGAC
ATCGGCGCGT TGCGCCGGAT CTGCGGCGTC GACGACGAGG ACCTCGCCGA CATGATCGGG
GAGATCCGCC ATCTCGATCC GAAGCCGGGG CTGAAATTCG GCACCGCGCG GGTGCAGACC
GTGGTGCCCG ACGTCTATGT GCGGCCCGGG CCGGACGGCG GCTGGCATGT CGAGCTGAAC
AGCGAGACGC TGCCGAAGGT CCTGGTCAAT CAGGTGTATT ATTCCGAACT TTCGAAGACG
ATCCGCAAGG ACGGCGACAA GGCCTACTTC ACCGATTGCC TGCAGAACGC CACCTGGCTG
GTGCGCGCGC TGGATCAGCG CGCCCGCACC ATCCTCAAGG TCTCCACCGA GATCGTGCGC
CAGCAGGACG GCTTCTTCAC CCAGGGCGTC GCCCATCTGC GGCCGCTCAA TCTGAAAGCC
GTCGCAGATG CCATTCAGAT GCACGAGTCG ACAGTCTCGC GCGTGACCGC CAACAAGTAT
ATGGCGACCA ATCGCGGGAT CTTCGAACTG AAATACTTCT TCACCGCGTC GATCGCCTCG
GCAGACGGCG GCGAAGCCCA TTCCGCCGAG GCGGTGCGCC ATCACATCAA GCAATTGATC
GACGGGGAAA ACCCGGCGAT TATTCTGTCC GACGACACCA TCGTTGAAAA ACTGCGCGAG
GCTGGTATTG ACATCGCCCG GCGCACCGTC GCCAAGTACC GCGAAGCGAT GCGGATTCCT
TCCTCCGTCC AGCGTCGCCG AGACAAGCAA AGCATGCTTG GAAATGCACT CACAGCGCCG
GCAACAACGG CAGACCGGTC CCGCGACACC GCACCGGCTT GA
 
Protein sequence
MALTQRLEFR QSQSLVMTPQ LMQAIKLLQL SNLDLASFVE DELERNPLLD RANENGEPPV 
PGELAPERAE FSDRDDAASG SGEDAGADGS DFSDRAAGDG FEASAEDWMQ RDLGSRAEIE
QTLDTGLDNV FSEEPAETAA RTAQDAAPTA FTEWGGGSSN DDGYNLEAFV AAEITLGGHL
AEQLAVAFTD PKQRLIGQYL VDLVDDAGYL PPDLGDATER LGASAEQVEA VVAVLQKFDP
AGVCARNLSE CLAIQLRDRD RYDPAMQALV EHLDLLAKRD IGALRRICGV DDEDLADMIG
EIRHLDPKPG LKFGTARVQT VVPDVYVRPG PDGGWHVELN SETLPKVLVN QVYYSELSKT
IRKDGDKAYF TDCLQNATWL VRALDQRART ILKVSTEIVR QQDGFFTQGV AHLRPLNLKA
VADAIQMHES TVSRVTANKY MATNRGIFEL KYFFTASIAS ADGGEAHSAE AVRHHIKQLI
DGENPAIILS DDTIVEKLRE AGIDIARRTV AKYREAMRIP SSVQRRRDKQ SMLGNALTAP
ATTADRSRDT APA