Gene EcHS_A3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3913 
SymbolrecF 
ID5591870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3907319 
End bp3908392 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content55% 
IMG OID640923021 
Productrecombination protein F 
Protein accessionYP_001460498 
Protein GI157163180 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.00221081 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCA CCCGCTTGTT GATCCGCGAT TTCCGCAACA TTGAAACCGC GGATCTCGCC 
TTATCTCCCG GCTTTAACTT TCTGGTAGGT GCCAACGGCA GTGGCAAAAC CAGCGTGCTG
GAAGCCATCT ATACGCTCGG CCATGGTCGG GCGTTTCGCA GTTTGCAGAT TGGTCGCGTC
ATTCGCCATG AGCAGGAGGC GTTTGTTCTC CACGGGCGAT TACAGGGCGA AGAGCGCGAG
ACAGCGATTG GCTTAACCAA AGACAAACAG GGCGACAGCA AAGTCCGCAT CGACGGTACA
GACGGGCATA AGGTCGCGGA ACTGGCGCAC CTGATGCCAA TGCAGTTGAT AACGCCAGAA
GGGTTTACTT TACTCAACGG CGGCCCCAAA TACAGAAGAG CATTCCTCGA CTGGGGATGC
TTTCACAACG AACCCGGATT TTTCACCGCC TGGAGCAATC TCAAGCGATT GCTCAAGCAG
CGCAATGCGG CGCTGCGCCA GGTGACACGT TACGAACAGC TACGCCCGTG GGATAAAGAG
CTGATCCCGC TGGCGGAGCA AATCAGCACC TGGCGCGCGG AGTATAGCGC CGGTATCGCG
GCTGATATGG CCGATACTTG TAAGCAATTT CTCCCTGAGT TTTCTCTGAC TTTCTCTTTC
CAGCGCGGCT GGGAGAAAGA GACAGAATAT GCTGAGGTGC TGGAACGTAA TTTTGAACGC
GATCGCCAGC TAACCTACAC CGCGCATGGC CCGCATAAAG CGGACTTACG CATTCGCGCC
GACGGTGCGC CGGTGGAAGA TACCTTATCG CGTGGGCAGC TTAAGCTGTT GATGTGCGCC
TTACGTCTGG CGCAAGGAGA GTTCCTCACC CGTGAAAGCG GGCGGCGGTG TCTCTACCTG
ATAGATGATT TTGCCTCTGA GCTTGATGAT GAGCGTCGCG GGCTGCTTGC CAGCCGCTTA
AAAGCGACGC AATCACAGGT CTTTGTCAGC GCGATCAGTG CTGAACACGT TATAGACATG
TCGGACGAAA ATTCGAAGAT GTTTACCGTG GAAAAGGGTA AAATAACGGA TTAA
 
Protein sequence
MSLTRLLIRD FRNIETADLA LSPGFNFLVG ANGSGKTSVL EAIYTLGHGR AFRSLQIGRV 
IRHEQEAFVL HGRLQGEERE TAIGLTKDKQ GDSKVRIDGT DGHKVAELAH LMPMQLITPE
GFTLLNGGPK YRRAFLDWGC FHNEPGFFTA WSNLKRLLKQ RNAALRQVTR YEQLRPWDKE
LIPLAEQIST WRAEYSAGIA ADMADTCKQF LPEFSLTFSF QRGWEKETEY AEVLERNFER
DRQLTYTAHG PHKADLRIRA DGAPVEDTLS RGQLKLLMCA LRLAQGEFLT RESGRRCLYL
IDDFASELDD ERRGLLASRL KATQSQVFVS AISAEHVIDM SDENSKMFTV EKGKITD