Gene RPB_3125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3125 
Symbol 
ID3910926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3558855 
End bp3559949 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content59% 
IMG OID637885027 
ProductPhage integrase 
Protein accessionYP_486732 
Protein GI86750236 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCT TGATCAAGCG CGGAAGAAAT TGGACTGCCA CGGCGCGCCT GCCGGATACT 
TTTCAGGGGC CGGCAGCCTC GAAATCGATC AGCAAAACGT TCCAAAAGAA AGATGCCGCA
AAGCTCTGGC TGAGTTCGAC AGAGACGGCC ATGAAGCTTG GCAACTGGCG TGATCCTCGG
CTCGAACTCA AGGCCTATGG ACCGCTGGGG TGGCCTGAGA AGCGCTTTGC GGAGGCGCTG
GATGACTACC GGATCAAGGT GACGCCCGAA AAGAAGGGTG CATCTCAGGA AGTTGCAATG
CTTCAGATGT TGGCCCGGGA GGAATTCGCC CAAAAGCGCA TCCGGGATCT GATGGTTTCC
GACTTTGCTG ACTTCCGCGA TGCGCGCGAC AAGGCAGGGA AGTCAGCGTC AACGATCAGG
AACAACCTCA ACACGATTTC GGCCGTCTAC GAATGGCTGA TTCACGAAAA GGCCGTCGAC
ATCGCCAATC CGATCGCTTC GCTTCGGAAG CGCCGACGCG GCGTGCCGCA GCCCACGGGG
CACCGTGAAC GGCGGCTCCT GGACGGGGAG GAAGAGGCCA TTGCGCAGGC AATTGAAGAG
CTTCCCGTTG ATCCGACGCG TCGGCAATGG CGCGCCCTTT TCCCACTTCT CCTCGACACC
GGAATGCGGC TTGGAGAGGC AATCTCTATC GAATGCGCAT GGATTCGTCG GGATTATGGA
TTTGTTTTCA TTCCTGACTC AAAGAACGGC AGCGTGCGGC ACGTCGCGCT GTCGGACCGC
GCTTACGCCG GTTTGCTCGA GCTGACAGAC GGAGAGCCCG CCGACAGCAA GGTCTTCCGG
TTCACGCCGT GGGTCGCCAA AGATGCCTGG CGGAATGAGA TCAGGGTCAG GGCAGGGTGT
CAGGACTTGA GAATTCACGA TCTCCGGCAT GAAGCGTTGT CGAGGATGGC AGCGAGGGGA
GCTGAGCTGA AAACACTGAT GCGGCAAAGC GGCCACAAGA CGGTGGCAGT GCTCATGAGG
TATTTGAATC CGACGCCTGC TGAGCAGAGG GCGCGGTTGT TTCCGGCCAA ACAGAATGTC
GAACTGGGAC AATGA
 
Protein sequence
MSTLIKRGRN WTATARLPDT FQGPAASKSI SKTFQKKDAA KLWLSSTETA MKLGNWRDPR 
LELKAYGPLG WPEKRFAEAL DDYRIKVTPE KKGASQEVAM LQMLAREEFA QKRIRDLMVS
DFADFRDARD KAGKSASTIR NNLNTISAVY EWLIHEKAVD IANPIASLRK RRRGVPQPTG
HRERRLLDGE EEAIAQAIEE LPVDPTRRQW RALFPLLLDT GMRLGEAISI ECAWIRRDYG
FVFIPDSKNG SVRHVALSDR AYAGLLELTD GEPADSKVFR FTPWVAKDAW RNEIRVRAGC
QDLRIHDLRH EALSRMAARG AELKTLMRQS GHKTVAVLMR YLNPTPAEQR ARLFPAKQNV
ELGQ