Gene EcSMS35_4065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4065 
SymbolrecF 
ID6145136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4156816 
End bp4157889 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content55% 
IMG OID641618890 
Productrecombination protein F 
Protein accessionYP_001746028 
Protein GI170684195 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0837895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCA CCCGCTTGTT GATCCGCGAT TTCCGCAATA TCGAAACCGC GGATCTCGCT 
TTATCTCCCG GCTTTAACTT TCTGGTAGGT GCCAACGGTA GTGGCAAAAC CAGCGTGCTG
GAAGCCATCT ATACGCTCGG CCATGGTCGG GCGTTTCGCA GTTTGCAGAT TGGTCGCGTC
ATTCGCCACG AGCAGGAGGC GTTTGTTCTC CACGGGCGAT TACAGGGCGA AGAGCGCGAG
ACAGCGATTG GCTTAACCAA AGACAAACAG GGCGACAGCA AAGTCCGCAT CGACGGTACA
GACGGGCATA AGGTCGCGGA ACTGGCGCAC CTGATGCCAA TGCAGCTGAT AACGCCAGAA
GGGTTTACTT TACTCAACGG CGGCCCCAAA TACAGAAGAG CATTCCTCGA CTGGGGATGC
TTTCACAACG AACCCGGATT TTTCACCGCC TGGAGCAATC TCAAGCGATT GCTCAAGCAG
CGCAATGCGG CGCTGCGCCA GGTGACTCGT TACGAACAGC TACGCCCGTG GGATAAAGAA
CTGATCCCGC TGGCGGAGCA AATCAGCACC TGGCGCGCGG AGTATAGCGC CGGTATCGCA
GCCGATATGG CTGATACCTG TAAGCAATTT CTCCCTGAGT TTTCTCTGAC TTTCTCTTTC
CAGCGCGGCT GGGAGAAAGA GACAGAGTAC GCTGAGGTGC TGGAACGTAA TTTTGAACGC
GATCGCCAGC TAACCTACAC CGCGCACGGC CCGCATAAAG CGGACTTACG CATTCGCGCC
GACGGTGCGC CGGTGGAAGA TACCTTATCG CGTGGGCAGC TTAAGCTGTT GATGTGCGCC
TTACGTCTGG CGCAAGGAGA GTTCCTCACC CGTGAAAGCG GGCGGCGGTG TCTCTACCTG
ATAGATGATT TTGCCTCTGA GCTTGATGAT GAGCGTCGCG GGCTGCTTGC CAGCCGCTTA
AAAGCGACGC AATCACAGGT CTTTGTCAGC GCGATCAGTG CTGAACACGT TATAGACATG
TCGGACGAAA ATTCGAAGAT GTTTACCGTG GAAAAGGGTA AAATAACGGA TTAA
 
Protein sequence
MSLTRLLIRD FRNIETADLA LSPGFNFLVG ANGSGKTSVL EAIYTLGHGR AFRSLQIGRV 
IRHEQEAFVL HGRLQGEERE TAIGLTKDKQ GDSKVRIDGT DGHKVAELAH LMPMQLITPE
GFTLLNGGPK YRRAFLDWGC FHNEPGFFTA WSNLKRLLKQ RNAALRQVTR YEQLRPWDKE
LIPLAEQIST WRAEYSAGIA ADMADTCKQF LPEFSLTFSF QRGWEKETEY AEVLERNFER
DRQLTYTAHG PHKADLRIRA DGAPVEDTLS RGQLKLLMCA LRLAQGEFLT RESGRRCLYL
IDDFASELDD ERRGLLASRL KATQSQVFVS AISAEHVIDM SDENSKMFTV EKGKITD