Gene RPB_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1080 
Symbol 
ID3908932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1239310 
End bp1240470 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID637882973 
Productradical SAM family protein 
Protein accessionYP_484701 
Protein GI86748205 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.981176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.301506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAG CATCCAATGC CCTCAAGCGC CCGCCGGTCA CGGCGCCCTC CGAGCCGGTG 
GGCGCAATCT CTCCGTTTCC TGAGATTGAA ATTGCGATCG ACAAGCAGCG GCGGCGCGGC
CGCGGCGCGC AATCCAACGA GTCGGGCCGC TACGAAGCCG AGGCGCGGGT CGCGTTCGAT
GATGGCTGGC AGAGCCTCGA CGAGCTGCCG CCGTTCAAGA CCACGGTCGC GCTCGACACC
GCGCGGAAAG TCATCACCCG CAACGAGTCG CCGGATATCG GCTTCGATCG TTCGATCAAT
CCGTATCGCG GCTGCGAGCA CGGCTGCGTC TATTGCTTCG CGCGGCCGAC CCATGCCTAT
CTCGGCCTGT CGCCGGGGCT GGATTTCGAA TCGCGGCTGT TCGCCAAGCC GGATGCGCCG
GCGCTGCTGG AGAAAGAACT CGCCGCTGCC GACTATCAGC CGCGGATGAT CGCGATCGGT
ACCAATACCG ACCCGTATCA GCCGATCGAG CGCGAGCACA AGATCATGCG GGGCGTTCTC
GAAGTGCTGG AGAAGACCGG CCATCCGGTC GGCATCGTCA CCAAATCGGC GCTGGTCACG
CGTGACATCG ACATTCTGGC GCGGATGGCG AAGCGCCAGC TCGCCAAGGT CGCGCTGTCG
GTGACATCGC TGGATCCGAA ACTGGCGCGC ACCATGGAGC CGCGCGCCTC CGCGCCTGAG
AAGCGGCTGG AAGCGCTGAA GCGGCTCTCC GAGGCCGGGA TTCCGACCAC CGTGATGGTG
GCGCCGGTGA TCCCGGCGCT CAACGATGTG GAGATCGAGC GCATCCTCGA CGCCGCCGCC
CATGCCGGCG TCAAGGAGGC CAGCTACGTG ATGCTGCGGC TGCCGCTGGA AGTGCGCGAC
CTGTTCCGCG AATGGCTGAT GGCGAACTAT CCGGATCGCT ACCGCCACGT CTTCACCCTG
ATCCGCGACA TGCGCGGCGG CCGCGACTAC GATTCGCAAT GGGGCACGCG GATGAAAGGC
ACCGGCCCGA TCGCCTGGAT GATCGGTCGC CGCTTCGAGA CCGCCTGCGC GCGGCTCGGC
CTCAACAAGC GCCGCTCGAA ATTGACGACG GATCATTTCG AAAAGCCGGA GCGGGCGGGG
CAGCAGCTGA GTTTGTTCTA G
 
Protein sequence
MSRASNALKR PPVTAPSEPV GAISPFPEIE IAIDKQRRRG RGAQSNESGR YEAEARVAFD 
DGWQSLDELP PFKTTVALDT ARKVITRNES PDIGFDRSIN PYRGCEHGCV YCFARPTHAY
LGLSPGLDFE SRLFAKPDAP ALLEKELAAA DYQPRMIAIG TNTDPYQPIE REHKIMRGVL
EVLEKTGHPV GIVTKSALVT RDIDILARMA KRQLAKVALS VTSLDPKLAR TMEPRASAPE
KRLEALKRLS EAGIPTTVMV APVIPALNDV EIERILDAAA HAGVKEASYV MLRLPLEVRD
LFREWLMANY PDRYRHVFTL IRDMRGGRDY DSQWGTRMKG TGPIAWMIGR RFETACARLG
LNKRRSKLTT DHFEKPERAG QQLSLF