Gene EcSMS35_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2012 
Symbolmfd 
ID6142779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2032545 
End bp2035991 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content55% 
IMG OID641616888 
Producttranscription-repair coupling factor 
Protein accessionYP_001744064 
Protein GI170682845 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00267728 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGAAC AATATCGTTA TACGCTGCCC GTCAAAGCGG GTGAGCAGCG TCTGCTGGGC 
GAGTTAACTG GCGCTGCCTG CGCGACGCTG GTAGCGGAAA TTGCCGAACG TCACGCCGGA
CCAGTGGTGC TTATTGCGCC AGATATGCAA AATGCTCTGC GTTTGCATGA TGAAATTAGC
CAGTTTACCG ACCAGATGGT GATGAATCTG GCGGACTGGG AAACTCTTCC CTACGACAGT
TTTTCGCCTC ATCAGGACAT TATCTCCTCG CGCCTTTCCA CCCTTTACCA GCTACCGACG
ATGCAGCGTG GCGTACTGAT TGTTCCGGTG AATACGCTTA TGCAGCGCGT TTGCCCGCAC
AGTTTTCTCC ACGGCCATGC GCTGGTGATG AAAAAAGGTC AGCGCCTGTC ACGAGATGCA
TTACGAACCC AACTGGACAG CGCCGGTTAT CGCCATGTTG ACCAGGTGAT GGAGCACGGC
GAATACGCCA CGCGCGGCGC GTTGCTGGAT CTCTTCCCGA TGGGGAGTGA GCTGCCTTAT
CGTCTTGATT TCTTTGATGA TGAAATCGAC AGCCTGCGGG TGTTTGACGT CGACAGCCAG
CGCACGCTGG AGGAAGTAGA AGCGATCAAT CTGCTGCCCG CGCACGAATT TCCGACCGAT
AAAGCGGCAA TTGAACTGTT CCGCAGCCAG TGGCGCGATA CCTTCGAAGT GAAGCGCGAT
CCGGAACATA TTTACCAGCA AGTGAGTAAA GGCACATTAC CTGCCGGGAT CGAGTACTGG
CAGCCGCTAT TCTTCAGCGA ACCACTGCCG CCGCTGTTCA GTTATTTCCC TGCCAATACC
TTGCTGGTGA ATACTGGCGA TCTGGAAAAC AGTGCCGAAC GTTTCCAGGC TGACACGCTG
GCGCGTTTTG AGAATCGCGG CGTCGATCCG ATGCGCCCAC TGTTGCCACC ACAATCGCTC
TGGCTGCGGG TGGACGAGCT CTTCTCAGAG CTGAAAAACT GGCCCCGAGT GCAGCTAAAA
ACTGAACATT TACCGACAAA AGCCGCGAAT GCCAATTTAG GTTTCCAGAA ACTGCCAGAC
CTGGCCGTTC AGGCACAACA AAAAGCACCG CTGGATGCGC TGCGTAAGTT CCTCGAGTCT
TTCGACGGTC CGGTGGTGTT CTCGGTAGAA AGTGAAGGTC GCCGTGAAGC GCTGGGTGAA
CTGCTCGCGC GAATTAAAAT TGCTCCTCAA CGCATTATGC GTCTTGATGA AGCCAAAGAC
CGTGGGCGTT ATCTGATGAT TGGCGCTGCC GAACATGGTT TTGTCGATAC GGTGCGTAAT
CTGGCGCTGA TTTGCGAAAG CGATCTGCTC GGTGAACGTG TTGCCCGTCG TCGTCAGGAC
TCTCGCCGCA CCATCAACCC CGATACACTG ATCCGTAACC TCGCGGAGCT GCATATTGGT
CAGCCGGTGG TCCATCTGGA GCACGGTGTC GGCCGCTATG CCGGAATGAC CACGCTCGAA
GCGGGCGGCA TTACTGGCGA GTATTTGATG CTCACCTATG CCAACGACGC CAAACTGTAT
GTTCCGGTGT CGTCACTGCA TCTGATTAGC CGTTACGCGG GTGGCGCGGA AGAAAACGCC
CCGCTGCATA AACTTGGCGG CGATGCGTGG TCACGCGCTC GGCAGAAAGC GGCGGAAAAA
GTGCGTGATG TGGCAGCAGA ATTGCTGGAT ATCTACGCGC AACGAGCCGC CAAAGAGGGC
TTCGCGTTTA AACACGATCG TGAGCAGTAT CAGTTGTTCT GCGACAGCTT CCCGTTTGAA
ACCACGCCGG ATCAGGCACA GGCCATTAAT GCGGTACTTA GCGACATGTG TCAGCCGCTG
GCAATGGATC GTCTGGTGTG CGGCGATGTT GGCTTTGGTA AAACAGAAGT GGCGATGCGC
GCCGCTTTCC TGGCAGTAGA TAACCACAAG CAGGTGGCGG TGCTGGTGCC TACCACCCTT
CTCGCGCAGC AGCATTACGA CAACTTCCGC GACCGTTTCG CTAACTGGCC AGTACGCATC
GAAATGATCT CCCGTTTCCG TAGCGCCAAA GAGCAGACGC AAATCCTTGC GGAAGTGGCG
GAAGGGAAAA TCGATATTCT GATCGGTACG CACAAACTGC TGCAAAATGA CGTCAAGTTT
AAAGATTTAG GTCTGCTGAT TGTCGATGAA GAACACCGCT TCGGGGTGCG TCATAAAGAG
CGCATTAAAG CGATGCGGGC GAACGTGGAT ATTCTGACAC TTACCGCAAC GCCGATCCCA
CGTACGCTGA ATATGGCAAT GAGCGGAATG CGTGACCTGT CGATTATCGC CACGCCGCCC
GCCCGTCGTC TGGCAGTTAA AACCTTTGTC CGTGAGTATG ACAGCCTGGT GGTCAGGGAG
GCGATCCTGC GTGAAATTTT GCGCGGGGGG CAGGTTTATT ATCTTTACAA TGATGTGGAA
AACATCCAGA AAGCCGCCGA ACGGCTGGCA GAACTGGTGC CAGAAGCGCG GATCGCCATC
GGTCACGGGC AAATGCGTGA GCGCGAACTG GAACGGGTGA TGAATGATTT CCATCATCAA
CGTTTCAACG TGCTGGTTTG TACCACCATT ATCGAAACCG GGATCGACAT CCCGACAGCC
AACACCATTA TCATTGAACG CGCGGATCAC TTCGGTCTGG CGCAGCTGCA CCAGTTACGC
GGTCGCGTCG GACGTTCGCA TCATCAGGCA TATGCATGGT TGCTGACACC GCATCCAAAA
GCGATGACTA CCGATGCACA AAAACGTCTT GAAGCGATTG CCTCGCTGGA AGATCTCGGT
GCAGGTTTTG CGCTGGCAAC GCACGATCTG GAGATCCGCG GCGCGGGTGA ACTGCTTGGC
GAAGAACAAA GCGGCTCAAT GGAAACCATC GGTTTCTCGC TGTATATGGA GTTGCTGGAA
AACGCCGTCG ATGCGCTGAA AGCCGGACGC GAGCCGTCGC TGGAAGATCT CACCAGCCAG
CAAACAGAAG TCGAGCTGCG GATACCGTCG CTATTGCCAG ATGATTTCAT TCCTGACGTG
AATACGCGTT TGTCGTTCTA TAAACGTATT GCCAGCGCCA AAACGGAAAA CGAACTGGAA
GAGATCAAAG TCGAGCTTAT CGATCGCTTC GGCCTGCTGC CGGATCCGGC GCGTACCCTG
CTGGATATTG CCCGTCTGCG CCAGCAAGCG CAGAAACTGG GGATCAGGAA GCTGGAAGGT
AATGAGAAAG GCGGCGTGAT CGAATTTGCC GAGAAGAATC ACGTTAATCC GGCCTGGTTG
ATTGGTTTGC TGCAAAAACA GCCGCAGCAT TATCGCCTCG ATGGCCCGAC GCGCCTGAAG
TTTATTCAGG ATTTGAGTGA GCGGAAAACG CGTATCGAAT GGGTACGCCA GTTTATGCGT
GAACTGGAAG AGAACGCGAT CGCTTGA
 
Protein sequence
MPEQYRYTLP VKAGEQRLLG ELTGAACATL VAEIAERHAG PVVLIAPDMQ NALRLHDEIS 
QFTDQMVMNL ADWETLPYDS FSPHQDIISS RLSTLYQLPT MQRGVLIVPV NTLMQRVCPH
SFLHGHALVM KKGQRLSRDA LRTQLDSAGY RHVDQVMEHG EYATRGALLD LFPMGSELPY
RLDFFDDEID SLRVFDVDSQ RTLEEVEAIN LLPAHEFPTD KAAIELFRSQ WRDTFEVKRD
PEHIYQQVSK GTLPAGIEYW QPLFFSEPLP PLFSYFPANT LLVNTGDLEN SAERFQADTL
ARFENRGVDP MRPLLPPQSL WLRVDELFSE LKNWPRVQLK TEHLPTKAAN ANLGFQKLPD
LAVQAQQKAP LDALRKFLES FDGPVVFSVE SEGRREALGE LLARIKIAPQ RIMRLDEAKD
RGRYLMIGAA EHGFVDTVRN LALICESDLL GERVARRRQD SRRTINPDTL IRNLAELHIG
QPVVHLEHGV GRYAGMTTLE AGGITGEYLM LTYANDAKLY VPVSSLHLIS RYAGGAEENA
PLHKLGGDAW SRARQKAAEK VRDVAAELLD IYAQRAAKEG FAFKHDREQY QLFCDSFPFE
TTPDQAQAIN AVLSDMCQPL AMDRLVCGDV GFGKTEVAMR AAFLAVDNHK QVAVLVPTTL
LAQQHYDNFR DRFANWPVRI EMISRFRSAK EQTQILAEVA EGKIDILIGT HKLLQNDVKF
KDLGLLIVDE EHRFGVRHKE RIKAMRANVD ILTLTATPIP RTLNMAMSGM RDLSIIATPP
ARRLAVKTFV REYDSLVVRE AILREILRGG QVYYLYNDVE NIQKAAERLA ELVPEARIAI
GHGQMREREL ERVMNDFHHQ RFNVLVCTTI IETGIDIPTA NTIIIERADH FGLAQLHQLR
GRVGRSHHQA YAWLLTPHPK AMTTDAQKRL EAIASLEDLG AGFALATHDL EIRGAGELLG
EEQSGSMETI GFSLYMELLE NAVDALKAGR EPSLEDLTSQ QTEVELRIPS LLPDDFIPDV
NTRLSFYKRI ASAKTENELE EIKVELIDRF GLLPDPARTL LDIARLRQQA QKLGIRKLEG
NEKGGVIEFA EKNHVNPAWL IGLLQKQPQH YRLDGPTRLK FIQDLSERKT RIEWVRQFMR
ELEENAIA