Gene RPB_4259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4259 
Symbol 
ID3912072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4844586 
End bp4845851 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID637886164 
Productputative sigma factor 
Protein accessionYP_487858 
Protein GI86751362 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.294953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TCGCCTGGAT CGACACCGCG ATCACGGCGT CCCGGCCCCA GGCGATCGGC 
GCGCTGCTGC GCTATTTCCG CGATCTCGAC ACCGCCGAGG AGGCGTTCCA GAACGCCTGC
CTGCGGGCGC TGAAAGCCTG GCCGCAGAAC GGCCCGCCGC GCGATCCGGC CGCATGGCTG
ATCCTGGTCG GCCGCAATGT CGCGATCGAC GACCTCCGCC GCGGCAAGAA GCAACAGCCG
CTGCCCGACG ACGAAGCGAT CTCCGATCTC GACGACGCCG AGAGCGCGCT CGCCGAGCGG
CTCGACGGCT CGCATTATCG CGACGACATC CTGCGGCTGC TGTTCATCTG CTGTCATCCC
GAATTGCCGC CCACCCAGCA GATCGCGCTG GCGCTGCGCA TCGTCTCGGG GCTGACCGTG
CCGCAGATCG CGCGGGCGTT TCTGGTGTCG GACGCGGCGA TGGAGCAGCG CATCACCCGC
GCCAAGGCCA AAGTCGCCCG CGCCCGCGTG CCGTTCGAAA CCCCGGGCGC GCCGGAGCGC
AGCGAACGGC TGGGCGCGGT GGCGGCGATG ATCTACCTGG TCTTCAACGA GGGCTATTCG
GCGTCGGGCG ACACTGCGGG AATCCGCGCG CCCTTGTGCG AGGAGGCGAT CCGGCTGGCG
CGGCTGCTGC TGCGGCTGTT TCCGTCCGAG CCCGAGATCA TGGGGCTCAC TGCTTTGATG
CTGCTGCAGC ATGCGCGCGC GCCGGCGCGC TTCGATGCCC ATGGCGAGAT CGTGCTGCTC
GACGAGCAGG ACCGCGGCCT GTGGGACACA AAGCTGATCG CCGAGGGCCT GGCGCTGATC
GACAAGGCGA TGCGTCATCG CCGCACCGGC GCGTATCAGA TCCAGGCCGC GATCGCCGCC
CTGCACGCCC GTGCGACCCG GCCCGAGGAT ACCGACTGGG CGCAGATCGA TCTGCTGTAC
GGCTCGCTGG AGATCCTGCA GCCGTCGCCG GTGATCACGC TCAACCGCGC GGTCGCGGTG
TCCAAAGTGC GCGGCGCCGA GGCGGCGCTG GCGATGATCG CACCGCTGGA AGAGAGGTTG
TCGAACTACT TCCATTATTT CGGCACCAGG GGCGCGCTGC TGCTGCAGCT GGGTCGCCGT
GACGAGGCGC GGACCGCCTT CGACCGCGCC ATCGCGCTGG CCCGGACCAC CGCCGAGGCC
AACCACATCC GCATGCATCT CGACCGCTCG AAGCGCGACG ACGCGGCCGA GCGGATCAAT
CCGTAG
 
Protein sequence
MTDLAWIDTA ITASRPQAIG ALLRYFRDLD TAEEAFQNAC LRALKAWPQN GPPRDPAAWL 
ILVGRNVAID DLRRGKKQQP LPDDEAISDL DDAESALAER LDGSHYRDDI LRLLFICCHP
ELPPTQQIAL ALRIVSGLTV PQIARAFLVS DAAMEQRITR AKAKVARARV PFETPGAPER
SERLGAVAAM IYLVFNEGYS ASGDTAGIRA PLCEEAIRLA RLLLRLFPSE PEIMGLTALM
LLQHARAPAR FDAHGEIVLL DEQDRGLWDT KLIAEGLALI DKAMRHRRTG AYQIQAAIAA
LHARATRPED TDWAQIDLLY GSLEILQPSP VITLNRAVAV SKVRGAEAAL AMIAPLEERL
SNYFHYFGTR GALLLQLGRR DEARTAFDRA IALARTTAEA NHIRMHLDRS KRDDAAERIN
P