Gene RPD_4160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4160 
Symbol 
ID4024682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4627937 
End bp4629196 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content69% 
IMG OID637964368 
Productsigma-70 region 2 
Protein accessionYP_571280 
Protein GI91978621 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TCGCCTGGAT CGAAACCGCG ATCACCGCGG CGCGGCCCCA GGCGATCGGC 
GCGTTGCTGC GCTATTTCCG CAACCTCGAC ACCGCCGAGG AGGCGTTTCA GAACGCCTGC
CTGCGCGCGC TGAAGGCGTG GCCGCAGAAC GGCCCGCCGC GCGATCCGGC GGCGTGGCTG
ATCATGGTCG GCCGCAACGT CGCGATCGAC GACATCCGAC GCGGCAGGAA GCTGCAGCCG
CTGCCCGACG ACGACGCGAT CTCCGATCTC GACGACGCCG AGGACGCGCT CGCCGAGCGG
CTCGACGGCT CGCATTATCG CGACGACATC CTGCGGCTGC TGTTCATCTG CTGCCATCCC
GAATTGCCGC CGACCCAGCA GATCGCGCTG GCCCTGCGCA TCGTCAGCGG CCTGACCGTG
GCGCAGATCG CGCGCGCGTT TCTCGTCTCG GATGCTGCGA TGGAGCAGCG CATCACCCGC
GCCAAAGCCA GGGTCGCCCG CGCGAGCGTG CCGTTCGAGA CGCCGGGCGC GCCGGAGCGC
AGTGAACGGC TCGGCGCGGT GGCGGCGATG ATCTACCTGG TGTTCAACGA GGGCTATTCC
GCCTCCGGCG ACACCGCCGG CCTCCGCGCG CCGCTGTGCG AGGAGGCGAT CCGCCTGGCG
CGGCTGCTGC TGCGGCTGTT TCCGTCCGAG CCCGAGATCA TGGGCCTGAC CGCGCTGATG
CTGCTGCAGC ACGCCCGCGC GCCGGCGCGG TTCGATCCGG GCGGCGAGAT CGTGCTGCTC
GACGATCAGG ACCGCAGCCT CTGGAACGCA AAATTCATCG CCGAGGGTCT GGCGCTGATC
GACAAGGCGA TGCGCCATCG CCGCACCGGG GCGTATCAGA TCCAGGCCGC GATCGCCGCG
CTACATGCGC GGGCCGAAAA GCCCGAAGAT ACCGACTGGG CGCAGATCGA TCTGTTGTAC
GGTTCGCTGG AAATCCTGCA GCCGTCGCCG GTGGTGACGC TCAACCGCGC GGTCGCGGTG
TCGAAAGTGC GCGGCGCGGC GGCGGCGCTG TCGATGATCG CGCCGCTGGA GCAGCGGCTG
TCGAACTACT TCCATTATTT CGGCACCAAG GGCGCTCTGC TGCTGCAACA GGGTTGCCGC
GACGAGGCCC GCATCGCGTT CGATCGCGCC ATCGCGCTCG CCCGCACCAC CGCCGAGGCT
TCGCACATCC GGATGCATCT CGATCGGCTG AAGCGCGACA GCGAAGCGAT CGGAACGTGA
 
Protein sequence
MTDLAWIETA ITAARPQAIG ALLRYFRNLD TAEEAFQNAC LRALKAWPQN GPPRDPAAWL 
IMVGRNVAID DIRRGRKLQP LPDDDAISDL DDAEDALAER LDGSHYRDDI LRLLFICCHP
ELPPTQQIAL ALRIVSGLTV AQIARAFLVS DAAMEQRITR AKARVARASV PFETPGAPER
SERLGAVAAM IYLVFNEGYS ASGDTAGLRA PLCEEAIRLA RLLLRLFPSE PEIMGLTALM
LLQHARAPAR FDPGGEIVLL DDQDRSLWNA KFIAEGLALI DKAMRHRRTG AYQIQAAIAA
LHARAEKPED TDWAQIDLLY GSLEILQPSP VVTLNRAVAV SKVRGAAAAL SMIAPLEQRL
SNYFHYFGTK GALLLQQGCR DEARIAFDRA IALARTTAEA SHIRMHLDRL KRDSEAIGT