Gene ECH74115_5128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5128 
SymbolrecF 
ID6967459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4769485 
End bp4770558 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content55% 
IMG OID643388800 
Productrecombination protein F 
Protein accessionYP_002273226 
Protein GI209397495 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.010515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.212432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCA CCCGCTTGTT GATCCGCGAT TTCCGCAACA TTGAAACCGC GGATCTCGCT 
TTATCTCCCG GCTTTAACTT TCTGGTAGGT GCCAACGGCA GTGGCAAAAC CAGCGTGCTG
GAAGCCATCT ATACGCTCGG CCATGGTCGG GCGTTTCGCA GTTTGCAGAT TGGTCGCGTC
ATTCGCCACG AGCAGGAGGC GTTTGTTCTC CACGGGCGAT TACAGGGCGA AGAGCGCGAG
ACGGCGATTG GCTTAACCAA AGACAAACAG GGCGACAGCA AAGTCCGCAT CGACGGTACT
GACGGGCATA AAGTCGCGGA ACTGGCGCAC CTGATGCCAA TGCAGCTGAT AACGCCAGAA
GGGTTTACTT TACTCAACGG CGGCCCCAAA TACAGAAGAG CATTCCTCGA CTGGGGATGC
TTTCACAACG AACCCGGATT TTTCACCGCC TGGAGCAATC TCAAGCGATT GCTCAAGCAG
CGCAATGCGG CGCTGCGCCA GGTGACACGT TACGAACAGC TACGCCCGTG GGATAAAGAA
CTGATCCCGC TGGCGGAGCA AATCAGTACC TGGCGCGCGG AGTATAGCGC CGGTATCGCG
GCCGATATGG CTGATACCTG TAAGCAATTT CTCCCTGAGT TTTCTCTGAC TTTCTCTTTC
CAGCGCGGCT GGGAGAAAGA GACAGAATAT GCTGAGGTGC TGGAACGTAA TTTTGAACGC
GATCGCCAGC TAACCTACAC CGCGCATGGC CCGCATAAAG CGGACTTACG CATTCGCGCC
GACGGTGCGC CGGTGGAAGA TACCTTATCG CGTGGGCAGC TTAAGCTGTT GATGTGCGCC
TTACGTCTGG CGCAAGGAGA GTTCCTCACC CGTGAAAGCG GGCGGCGGTG TCTCTACCTG
ATAGATGATT TTGCCTCTGA GCTTGATGAT GAGCGTCGCG GGCTGCTTGC CAGCCGCTTA
AAAGCTACGC AATCGCAGGT CTTTGTCAGC GCGATCAGTG CTGAACACGT TATAGACATG
TCGGACGAAA ATTCGAAGAT GTTTACCGTG GAAAAGGGTA AAATAACGGA TTAA
 
Protein sequence
MSLTRLLIRD FRNIETADLA LSPGFNFLVG ANGSGKTSVL EAIYTLGHGR AFRSLQIGRV 
IRHEQEAFVL HGRLQGEERE TAIGLTKDKQ GDSKVRIDGT DGHKVAELAH LMPMQLITPE
GFTLLNGGPK YRRAFLDWGC FHNEPGFFTA WSNLKRLLKQ RNAALRQVTR YEQLRPWDKE
LIPLAEQIST WRAEYSAGIA ADMADTCKQF LPEFSLTFSF QRGWEKETEY AEVLERNFER
DRQLTYTAHG PHKADLRIRA DGAPVEDTLS RGQLKLLMCA LRLAQGEFLT RESGRRCLYL
IDDFASELDD ERRGLLASRL KATQSQVFVS AISAEHVIDM SDENSKMFTV EKGKITD