Gene RPB_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0004 
SymbolrecF 
ID3910209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4098 
End bp5234 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content72% 
IMG OID637881885 
Productrecombination protein F 
Protein accessionYP_483627 
Protein GI86747131 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.949729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCT CCCGCATCAC CCGGCTGACG TTGACGCATT TCCGCAATTA TCGGGCGGCG 
GTGCTGACGA CGAGTGCCGA GCGCGTGGTG CTGGTCGGCG CCAACGGCGC CGGCAAGACC
AATTGCCTCG AGGCGATCTC GTTTCTGTCG CCGGGGCGGG GGTTGCGGCG GGCGACGCTG
GACGACGTCG CCGACAATGA GGGCGACGGC TCCTGGGCGG TGGCGGCGGA GGTCGAGGGC
GCGCTCGGGC TGGCGACGCT CGGCACCGGG ATCGATCCGC CGCGCGCCGA CGCCGCGACC
TCGCGGCGCT GCCGGATCGA CCGCGAGCCG GTCGGCTCCG CCACCGCATT CGGCGATCAC
TTACGCATGG TGTGGCTGAC GCCGGCGATG GACGGGCTGT TCATGGGCGC GGCCTCGGAA
CGGCGGCGGT TCTTCGACCG GCTGGTGCTG GCGATCGACA GCCAGCATTC GGGCCGGGTC
TCGGCGCTGG ACCGCAGCCT CAGATCGCGG AACCGCCTGC TGGAGGTACG TTACCCCGAC
GCGCATTGGC TCGATGCGAT CGAGCGCGAA ACCGCCGAGC TCGCGGTCGC GGTCGCGGCG
ATGCGCGGCC AGACCGCGAT GCGCCTCGCC GCGATGCTCG ACGCCCGCGG CGCGGCATCG
GCGTTTCCGT CGGCGAAGAT CATGCTCGAC GGCTGGATGG AGAGCGCGCT GCTCACCGAA
CCCGCCACGG CGGTGGAAGA TCGCTACCGC ACCATCCTGC GCGAGGGCCG CCCGCGCGAC
GCCGCCGCCG GCCGCACCCT CGACGGCCCG CATCTGACCG ACCTCGAAGT CGTCTACGCG
CCGAAGGCGA TGCCGGCGCG CGACGCCTCC ACCGGCGAAC AGAAGGCGCT GCTGATCGGG
CTCGTCCTCG CGCATGCGCA GCTCGTCTCG GAGATGACCG GCATCACGCC GCTGCTGCTG
CTCGACGAGG TGGTGGCGCA TCTCGACCCG TCGCGGCGCG CCGCGCTGTT CGAGGAATTG
GCGAAGCTCG GCGCCCAGGT CTGGATGACC GGCGCCGACC CCGCAGCGTT CGCCGAGATC
GGTTCCGGCG CCGAGATATT CACCGTCGAA TCCGGCCGGA TCAGGCCGCA ACAATGA
 
Protein sequence
MTASRITRLT LTHFRNYRAA VLTTSAERVV LVGANGAGKT NCLEAISFLS PGRGLRRATL 
DDVADNEGDG SWAVAAEVEG ALGLATLGTG IDPPRADAAT SRRCRIDREP VGSATAFGDH
LRMVWLTPAM DGLFMGAASE RRRFFDRLVL AIDSQHSGRV SALDRSLRSR NRLLEVRYPD
AHWLDAIERE TAELAVAVAA MRGQTAMRLA AMLDARGAAS AFPSAKIMLD GWMESALLTE
PATAVEDRYR TILREGRPRD AAAGRTLDGP HLTDLEVVYA PKAMPARDAS TGEQKALLIG
LVLAHAQLVS EMTGITPLLL LDEVVAHLDP SRRAALFEEL AKLGAQVWMT GADPAAFAEI
GSGAEIFTVE SGRIRPQQ