Gene RPD_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0003 
SymbolrecF 
ID4020457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3622 
End bp4758 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content72% 
IMG OID637960179 
Productrecombination protein F 
Protein accessionYP_567144 
Protein GI91974485 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.552608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAT CCCGCATCAC CCGGCTGACG CTGACGCACT TCCGCAATTA TCGGGCGGCG 
GCGTTGCATA CGCGTGGCGA ACGGGTGGTG CTGGTGGGCG CGAACGGCGC GGGCAAGACC
AATTGTCTCG AGGCGATCTC GTTTCTGTCG CCCGGCCGCG GCCTGCGCCG CGCCACGCTC
GACGACGTCT CCGACCATCA GGGCGACGGC TCCTGGGCGG TGTCGGCCGA GGTCGAGGGC
GCGCTCGGCC TCGCCACGCT CGGCACCGGG ATCGACCCAC CGCGCGCCGA CGCCGCCACG
ACGCGGCGCT GCCGGATCGA CCGCGAGCCG GTCGGCTCCG CCACCGCCTT CGGCGATCAT
CTGCGCATGG TGTGGCTGAC GCCTGCGATG GACGGACTGT TCATGGGCGC GGCGTCGGAA
CGGCGGCGGT TTTTCGATCG CCTGGTGCTG GCGATCGACA GCCAGCATTC CAGCCGGGTC
TCGGCGCTCG ACCGCAGCCT GCGCTCGCGC AACCGGCTGC TGGAGGAACG CAACGCGGAC
CGCCACTGGC TCGACGCGAT CGAGCGCGAA ACCGCCGAAC TCGCCGTCGC GGTCGCGGCG
ATGCGCGGCC AGACCGCGGC GCGGCTCGCC GCGATGCTCG ACGCCCGCGG CGCGGCGTCG
GCGTTTCCGT CGGCGAAGAT CATGCTCGAC GGCTGGATGG AAAGCGCGCT GCTGACCGAG
CCGGCGACCG CGGTCGAGGA TCGCTACCGC GCGATCCTGC GCGATGGCCG CCTGCGCGAC
GCCGCCGCTG GCCGTACCCT CGACGGCCCG CATCTCACCG ATCTCCAGGT GATCTACGCG
CCGAAGGCGA TGCCGGCGCG CGACGCCTCC ACCGGCGAGC AGAAGGCGCT GCTGATCGGG
CTGGTGCTCG CCCATGCGCA GCTCGTCTCC GAGATCACCG GCATCACGCC GCTGCTGCTG
CTCGACGAGG TGGTGGCGCA TCTCGACCCC GCCCGCCGCC GCGCGTTGTT TGCGGAACTC
GAGCGGCTTG GCGCGCAGGT CTGGATGACC GGCGCCGATC CGGCGGGCTT CGCCGAGATC
GGCCCCGACG CCGAGATTTT CACCGTCGAG TCGGGCCGGA TCGCGCCGCA AAAATGA
 
Protein sequence
MTASRITRLT LTHFRNYRAA ALHTRGERVV LVGANGAGKT NCLEAISFLS PGRGLRRATL 
DDVSDHQGDG SWAVSAEVEG ALGLATLGTG IDPPRADAAT TRRCRIDREP VGSATAFGDH
LRMVWLTPAM DGLFMGAASE RRRFFDRLVL AIDSQHSSRV SALDRSLRSR NRLLEERNAD
RHWLDAIERE TAELAVAVAA MRGQTAARLA AMLDARGAAS AFPSAKIMLD GWMESALLTE
PATAVEDRYR AILRDGRLRD AAAGRTLDGP HLTDLQVIYA PKAMPARDAS TGEQKALLIG
LVLAHAQLVS EITGITPLLL LDEVVAHLDP ARRRALFAEL ERLGAQVWMT GADPAGFAEI
GPDAEIFTVE SGRIAPQK