Gene ECH74115_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1494 
Symbolmfd 
ID6969887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1473234 
End bp1476680 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content55% 
IMG OID643385465 
Producttranscription-repair coupling factor 
Protein accessionYP_002269959 
Protein GI209399391 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.44461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.168665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAC AATATCGTTA TACGCTGCCC GTCAAAGCGG GTGAGCAGCG TCTGCTGGGC 
GAGTTAACCG GCGCTGCCTG CGCGACGCTG GTAGCGGAAA TTGCCGAACG TCACGCCGGA
CCGGTGGTGC TTATTGCGCC AGATATGCAA AATGCCCTGC GTTTGCATGA TGAAATTAGC
CAGTTTACCG ACCAGATGGT GATGAATCTG GCGGACTGGG AAACCCTCCC TTACGACAGT
TTTTCGCCTC ATCAGGACAT TATCTCCTCG CGCCTTTCCA CCCTTTACCA GCTACCGACA
ATGCAGCGTG GCGTACTGAT TGTTCCGGTG AATACACTTA TGCAGCGCGT TTGCCCGCAC
AGTTTTCTCC ACGGTCATGC GCTGGTGATG GAAAAAGGTC AGCGCCTGTC ACGAGATGCA
TTACGAACCC AACTGGATAG CGCCGGTTAT CGCCATGTTG ACCAGGTGAT GGAGCACGGC
GAATACGCCA CGCGCGGCGC GTTGCTGGAT CTCTTCCCGA TGGGGAGTGA GTTGCCTTAT
CGTCTTGATT TCTTTGATGA TGAAATCGAC AGTCTGCGGG TGTTTGACGT CGACAGTCAG
CGCACGCTGG AGGAAGTAGA AGCGATCAAC CTGCTGCCTG CGCACGAATT TCCGACCGAT
AAAGCGGCGA TTGAACTGTT CCGCAGCCAG TGGCGCGATA CCTTCGAAGT GAAGCGCGAT
CCGGAACATA TTTACCAGCA AGTGAGTAAA GGCACATTAC CTGCCGGGAT CGAGTACTGG
CAGCCATTGT TCTTCAGCGA ACCACTGCCG CCGCTGTTCA GTTATTTCCC TGCCAATACC
TTGCTGGTGA ATACTGGCGA TCTGGAAAAC AGTGCCGAAC GTTTCCAGGC TGACACGCTG
GCGCGTTTTG AGAATCGCGG CGTCGATCCG ATGCGCCCAC TGTTGCCACC ACAATCGCTC
TGGCTGCGGG TGGACGAGCT CTTCTCAGAG CTGAAAAACT GGCCCCGAGT GCAGCTAAAA
ACTGAACATT TACCGACAAA AGCCGCGAAT GCCAATTTAG GTTTCCAGAA ACTGCCAGAC
CTGGCCGTTC AGGCACAACA AAAAGCGCCG CTGGATGCGC TGCGTAAGTT CCTCGAGTCT
TTCGACGGTC CGGTGGTGTT CTCGGTAGAA AGTGAAGGCC GCCGTGAAGC GCTGGGTGAA
CTGCTCGCGC GAATTAAAAT TGCTCCGCAA CGCATTATGC GTCTTGATGA AGCCAGCGAC
CGTGGGCGTT ATCTGATGAT TGGCGCTGCA GAGCATGGTT TTGTCGATAC GGTACGTAAT
CTGGCGCTGA TTTGCGAAAG CGATCTGCTC GGTGAACGCG TTGCCCGTCG TCGTCAGGAT
TCTCGCCGCA CCATCAACCC CGATACACTG ATCCGTAACC TCGCGGAACT GCATATTGGT
CAGCCGGTGG TCCATCTGGA GCACGGCGTC GGGCGCTATG CCGGAATGAC CACGCTGGAA
GCAGGCGGCA TTACCGGCGA GTATCTGATG CTCACCTATG CCAACGACGC CAAATTGTAT
GTTCCGGTGT CGTCGCTGCA TCTGATTAGC CGTTACGCGG GTGGTGCGGA AGAAAATGCC
CCGCTGCATA AACTTGGCGG CGATGCGTGG TCACGCGCGC GGCAGAAAGC GGCGGAAAAA
GTGCGTGATG TGGCGGCGGA ATTGCTGGAT ATCTACGCGC AACGCGCCGC CAAAGAGGGC
TTCGCGTTTA AACACGATCG TGAGCAGTAT CAGTTGTTCT GCGACAGCTT CCCGTTTGAA
ACCACGCCGG ATCAGGCGCA GGCCATTAAT GCGGTACTTA GCGACATGTG TCAGCCGCTG
GCAATGGATC GTCTGGTGTG CGGCGATGTT GGCTTTGGTA AAACAGAAGT GGCGATGCGC
GCCGCTTTCC TGGCAGTAGA TAACCACAAG CAGGTGGCGG TGCTGGTGCC TACCACCCTT
CTCGCGCAGC AGCATTACGA CAACTTCCGC GACCGTTTCG CCAACTGGCC GGTACGTATC
GAAATGCTCT CCCGTTTCCG CAGCGCCAAA GAGCAGACGC AAATCCTTGC GGAAGTGGCG
GAAGGGAAAA TCGATATTCT GATCGGTACG CACAAACTGC TGCAAAGTGA CGTCAAGTTT
AAAGATTTAG GCCTGCTGAT TGTCGATGAA GAACACCGCT TCGGGGTGCG TCATAAAGAG
CGCATTAAAG CGATGCGCGC GAACGTGGAT ATTCTGACGC TTACTGCAAC GCCGATCCCA
CGTACGCTGA ATATGGCAAT GAGCGGAATG CGTGACCTAT CGATTATCGC CACGCCGCCC
GCCCGTCGTC TGGCAGTTAA AACCTTTGTC CGTGAGTATG ACAGCCTGGT GGTCCGCGAG
GCGATCCTGC GTGAAATTTT GCGCGGAGGA CAGGTTTATT ATCTCTACAA TGATGTGGAA
AACATTCAGA AAGCCGCCGA ACGGCTGGCA GAACTGGTGC CAGAAGCGCG GATCGCCATC
GGTCACGGGC AGATGCGCGA GCGCGAACTG GAACGGGTGA TGAATGATTT CCATCATCAA
CGTTTCAACG TGCTGGTTTG TACAACCATT ATCGAAACCG GGATCGACAT CCCGACAGCC
AACACCATTA TCATTGAACG CGCGGATCAC TTCGGTCTGG CGCAACTGCA CCAGTTACGC
GGTCGCGTCG GACGTTCGCA TCATCAGGCA TATGCATGGC TGCTGACGCC GCATCCAAAA
GCGATGACTA CCGATGCACA AAAACGTCTT GAAGCGATTG CCTCGCTGGA AGATCTCGGG
GCAGGTTTTG CGCTGGCAAC GCACGATCTG GAGATTCGCG GCGCGGGTGA ACTGCTTGGC
GAAGAACAAA GCGGCTCAAT GGAAACCATC GGTTTCTCGC TGTATATGGA GTTGCTGGAA
AACGCCGTCG ATGCACTGAA AGCCGGACGC GAGCCGTCGC TGGAAGATCT CACCAGCCAG
CAAACAGAAG TCGAGCTGCG GATGCCGTCG CTATTGCCAG ATGATTTCAT TCCTGACGTG
AATACGCGTC TGTCGTTCTA TAAACGTATT GCCAGCGCCA AAACGGAAAA CGAACTGGAA
GAGATCAAAG TCGAGCTTAT CGATCGCTTC GGCCTGCTGC CGGATCCGGC GCGTACCCTG
CTGGATATTG CCCGTCTACG CCAGCAAGCG CAGAAACTGG GGATCAGGAA GCTGGAAGGT
AATGAGAAAG GCGGCGTGAT CGAATTTGCC GAGAAGAATC ACGTTAATCC GGCCTGGTTG
ATTGGTTTGC TGCAAAAACA GCCGCAGCAT TACCGCCTTG ATGGTCCGAC GCGCCTGAAA
TTTATTCAGG ATTTGAGTGA GCGGAAAACG CGTATCGAAT GGGTACGCCA GTTTATGCGT
GAACTGGAAG AAAACGCGAT CGCTTAA
 
Protein sequence
MPEQYRYTLP VKAGEQRLLG ELTGAACATL VAEIAERHAG PVVLIAPDMQ NALRLHDEIS 
QFTDQMVMNL ADWETLPYDS FSPHQDIISS RLSTLYQLPT MQRGVLIVPV NTLMQRVCPH
SFLHGHALVM EKGQRLSRDA LRTQLDSAGY RHVDQVMEHG EYATRGALLD LFPMGSELPY
RLDFFDDEID SLRVFDVDSQ RTLEEVEAIN LLPAHEFPTD KAAIELFRSQ WRDTFEVKRD
PEHIYQQVSK GTLPAGIEYW QPLFFSEPLP PLFSYFPANT LLVNTGDLEN SAERFQADTL
ARFENRGVDP MRPLLPPQSL WLRVDELFSE LKNWPRVQLK TEHLPTKAAN ANLGFQKLPD
LAVQAQQKAP LDALRKFLES FDGPVVFSVE SEGRREALGE LLARIKIAPQ RIMRLDEASD
RGRYLMIGAA EHGFVDTVRN LALICESDLL GERVARRRQD SRRTINPDTL IRNLAELHIG
QPVVHLEHGV GRYAGMTTLE AGGITGEYLM LTYANDAKLY VPVSSLHLIS RYAGGAEENA
PLHKLGGDAW SRARQKAAEK VRDVAAELLD IYAQRAAKEG FAFKHDREQY QLFCDSFPFE
TTPDQAQAIN AVLSDMCQPL AMDRLVCGDV GFGKTEVAMR AAFLAVDNHK QVAVLVPTTL
LAQQHYDNFR DRFANWPVRI EMLSRFRSAK EQTQILAEVA EGKIDILIGT HKLLQSDVKF
KDLGLLIVDE EHRFGVRHKE RIKAMRANVD ILTLTATPIP RTLNMAMSGM RDLSIIATPP
ARRLAVKTFV REYDSLVVRE AILREILRGG QVYYLYNDVE NIQKAAERLA ELVPEARIAI
GHGQMREREL ERVMNDFHHQ RFNVLVCTTI IETGIDIPTA NTIIIERADH FGLAQLHQLR
GRVGRSHHQA YAWLLTPHPK AMTTDAQKRL EAIASLEDLG AGFALATHDL EIRGAGELLG
EEQSGSMETI GFSLYMELLE NAVDALKAGR EPSLEDLTSQ QTEVELRMPS LLPDDFIPDV
NTRLSFYKRI ASAKTENELE EIKVELIDRF GLLPDPARTL LDIARLRQQA QKLGIRKLEG
NEKGGVIEFA EKNHVNPAWL IGLLQKQPQH YRLDGPTRLK FIQDLSERKT RIEWVRQFMR
ELEENAIA