Gene EcolC_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0003 
SymbolrecF 
ID6068546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2556 
End bp3629 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content55% 
IMG OID641599408 
Productrecombination protein F 
Protein accessionYP_001723018 
Protein GI170018064 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0566679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCA CCCGCTTGTT GATCCGCGAT TTCCGCAATA TCGAAACCGC GGATCTCGCT 
TTATCTCCCG GCTTTAACTT TCTGGTAGGT GCCAACGGCA GTGGCAAAAC CAGCGTGCTG
GAAGCCATCT ATACGCTCGG CCACGGTCGG GCGTTTCGCA GTTTGCAGAT TGGTCGCGTC
ATTCGCCACG AGCAGGAGGC GTTTGTTCTC CACGGGCGTT TACAGGGCGA AGAGCGCGAG
ACGGCGATTG GCTTAACCAA AGACAAACAG GGCGACAGCA AAGTCCGCAT CGACGGTACA
GACGGGCATA AGGTCGCAGA ACTGGCGCAC CTGATGCCAA TGCAGCTGAT AACGCCAGAA
GGGTTTACTT TACTCAACGG CGGCCCCAAA TACAGAAGAG CATTCCTCGA CTGGGGATGC
TTTCACAACG AACCTGGATT TTTCACCGCC TGGAGCAATC TCAAGCGCTT GCTCAAGCAG
CGCAATGCGG CGCTGCGCCA GGTGACTCGT TACGAACAGC TACGCCCGTG GGACAAAGAA
CTGATCCCGC TGGCGGAGCA AATCAGCACC TGGCGCGCGG AGTATAGCGC CGGTATCGCG
GCTGATATGG CTGATACCTG TAAGCAATTT CTCCCTGAGT TTTCTCTGAC TTTCTCTTTC
CAGCGCGGCT GGGAGAAAGA GACAGAATAT GCTGAGGTGC TGGAACGTAA TTTTGAACGC
GATCGCCAGC TAACCTACAC CGCGCACGGC CCGCACAAAG CAGACTTACG CATTCGCGCC
GACGGTGCGC CGGTGGAAGA TACCTTATCG CGTGGGCAGC TTAAGCTGTT GATGTGCGCC
TTACGTCTGG CGCAAGGAGA GTTCCTCACC CGTGAAAGCG GGCGGCGGTG TCTCTACCTG
ATAGATGATT TTGCCTCTGA GCTTGATGAT GAGCGTCGCG GGTTGCTTGC CAGCCGCTTA
AAAGCGACGC AATCACAGGT CTTTGTCAGC GCGATCAGTG CTGAACACGT TATAGACATG
TCGGACGAAA ATTCGAAGAT GTTTACCGTG GAAAAGGGTA AAATAACGGA TTAA
 
Protein sequence
MSLTRLLIRD FRNIETADLA LSPGFNFLVG ANGSGKTSVL EAIYTLGHGR AFRSLQIGRV 
IRHEQEAFVL HGRLQGEERE TAIGLTKDKQ GDSKVRIDGT DGHKVAELAH LMPMQLITPE
GFTLLNGGPK YRRAFLDWGC FHNEPGFFTA WSNLKRLLKQ RNAALRQVTR YEQLRPWDKE
LIPLAEQIST WRAEYSAGIA ADMADTCKQF LPEFSLTFSF QRGWEKETEY AEVLERNFER
DRQLTYTAHG PHKADLRIRA DGAPVEDTLS RGQLKLLMCA LRLAQGEFLT RESGRRCLYL
IDDFASELDD ERRGLLASRL KATQSQVFVS AISAEHVIDM SDENSKMFTV EKGKITD