Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2012 |
Symbol | mfd |
ID | 6142779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2032545 |
End bp | 2035991 |
Gene Length | 3447 bp |
Protein Length | 1148 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616888 |
Product | transcription-repair coupling factor |
Protein accession | YP_001744064 |
Protein GI | 170682845 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00267728 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGAAC AATATCGTTA TACGCTGCCC GTCAAAGCGG GTGAGCAGCG TCTGCTGGGC GAGTTAACTG GCGCTGCCTG CGCGACGCTG GTAGCGGAAA TTGCCGAACG TCACGCCGGA CCAGTGGTGC TTATTGCGCC AGATATGCAA AATGCTCTGC GTTTGCATGA TGAAATTAGC CAGTTTACCG ACCAGATGGT GATGAATCTG GCGGACTGGG AAACTCTTCC CTACGACAGT TTTTCGCCTC ATCAGGACAT TATCTCCTCG CGCCTTTCCA CCCTTTACCA GCTACCGACG ATGCAGCGTG GCGTACTGAT TGTTCCGGTG AATACGCTTA TGCAGCGCGT TTGCCCGCAC AGTTTTCTCC ACGGCCATGC GCTGGTGATG AAAAAAGGTC AGCGCCTGTC ACGAGATGCA TTACGAACCC AACTGGACAG CGCCGGTTAT CGCCATGTTG ACCAGGTGAT GGAGCACGGC GAATACGCCA CGCGCGGCGC GTTGCTGGAT CTCTTCCCGA TGGGGAGTGA GCTGCCTTAT CGTCTTGATT TCTTTGATGA TGAAATCGAC AGCCTGCGGG TGTTTGACGT CGACAGCCAG CGCACGCTGG AGGAAGTAGA AGCGATCAAT CTGCTGCCCG CGCACGAATT TCCGACCGAT AAAGCGGCAA TTGAACTGTT CCGCAGCCAG TGGCGCGATA CCTTCGAAGT GAAGCGCGAT CCGGAACATA TTTACCAGCA AGTGAGTAAA GGCACATTAC CTGCCGGGAT CGAGTACTGG CAGCCGCTAT TCTTCAGCGA ACCACTGCCG CCGCTGTTCA GTTATTTCCC TGCCAATACC TTGCTGGTGA ATACTGGCGA TCTGGAAAAC AGTGCCGAAC GTTTCCAGGC TGACACGCTG GCGCGTTTTG AGAATCGCGG CGTCGATCCG ATGCGCCCAC TGTTGCCACC ACAATCGCTC TGGCTGCGGG TGGACGAGCT CTTCTCAGAG CTGAAAAACT GGCCCCGAGT GCAGCTAAAA ACTGAACATT TACCGACAAA AGCCGCGAAT GCCAATTTAG GTTTCCAGAA ACTGCCAGAC CTGGCCGTTC AGGCACAACA AAAAGCACCG CTGGATGCGC TGCGTAAGTT CCTCGAGTCT TTCGACGGTC CGGTGGTGTT CTCGGTAGAA AGTGAAGGTC GCCGTGAAGC GCTGGGTGAA CTGCTCGCGC GAATTAAAAT TGCTCCTCAA CGCATTATGC GTCTTGATGA AGCCAAAGAC CGTGGGCGTT ATCTGATGAT TGGCGCTGCC GAACATGGTT TTGTCGATAC GGTGCGTAAT CTGGCGCTGA TTTGCGAAAG CGATCTGCTC GGTGAACGTG TTGCCCGTCG TCGTCAGGAC TCTCGCCGCA CCATCAACCC CGATACACTG ATCCGTAACC TCGCGGAGCT GCATATTGGT CAGCCGGTGG TCCATCTGGA GCACGGTGTC GGCCGCTATG CCGGAATGAC CACGCTCGAA GCGGGCGGCA TTACTGGCGA GTATTTGATG CTCACCTATG CCAACGACGC CAAACTGTAT GTTCCGGTGT CGTCACTGCA TCTGATTAGC CGTTACGCGG GTGGCGCGGA AGAAAACGCC CCGCTGCATA AACTTGGCGG CGATGCGTGG TCACGCGCTC GGCAGAAAGC GGCGGAAAAA GTGCGTGATG TGGCAGCAGA ATTGCTGGAT ATCTACGCGC AACGAGCCGC CAAAGAGGGC TTCGCGTTTA AACACGATCG TGAGCAGTAT CAGTTGTTCT GCGACAGCTT CCCGTTTGAA ACCACGCCGG ATCAGGCACA GGCCATTAAT GCGGTACTTA GCGACATGTG TCAGCCGCTG GCAATGGATC GTCTGGTGTG CGGCGATGTT GGCTTTGGTA AAACAGAAGT GGCGATGCGC GCCGCTTTCC TGGCAGTAGA TAACCACAAG CAGGTGGCGG TGCTGGTGCC TACCACCCTT CTCGCGCAGC AGCATTACGA CAACTTCCGC GACCGTTTCG CTAACTGGCC AGTACGCATC GAAATGATCT CCCGTTTCCG TAGCGCCAAA GAGCAGACGC AAATCCTTGC GGAAGTGGCG GAAGGGAAAA TCGATATTCT GATCGGTACG CACAAACTGC TGCAAAATGA CGTCAAGTTT AAAGATTTAG GTCTGCTGAT TGTCGATGAA GAACACCGCT TCGGGGTGCG TCATAAAGAG CGCATTAAAG CGATGCGGGC GAACGTGGAT ATTCTGACAC TTACCGCAAC GCCGATCCCA CGTACGCTGA ATATGGCAAT GAGCGGAATG CGTGACCTGT CGATTATCGC CACGCCGCCC GCCCGTCGTC TGGCAGTTAA AACCTTTGTC CGTGAGTATG ACAGCCTGGT GGTCAGGGAG GCGATCCTGC GTGAAATTTT GCGCGGGGGG CAGGTTTATT ATCTTTACAA TGATGTGGAA AACATCCAGA AAGCCGCCGA ACGGCTGGCA GAACTGGTGC CAGAAGCGCG GATCGCCATC GGTCACGGGC AAATGCGTGA GCGCGAACTG GAACGGGTGA TGAATGATTT CCATCATCAA CGTTTCAACG TGCTGGTTTG TACCACCATT ATCGAAACCG GGATCGACAT CCCGACAGCC AACACCATTA TCATTGAACG CGCGGATCAC TTCGGTCTGG CGCAGCTGCA CCAGTTACGC GGTCGCGTCG GACGTTCGCA TCATCAGGCA TATGCATGGT TGCTGACACC GCATCCAAAA GCGATGACTA CCGATGCACA AAAACGTCTT GAAGCGATTG CCTCGCTGGA AGATCTCGGT GCAGGTTTTG CGCTGGCAAC GCACGATCTG GAGATCCGCG GCGCGGGTGA ACTGCTTGGC GAAGAACAAA GCGGCTCAAT GGAAACCATC GGTTTCTCGC TGTATATGGA GTTGCTGGAA AACGCCGTCG ATGCGCTGAA AGCCGGACGC GAGCCGTCGC TGGAAGATCT CACCAGCCAG CAAACAGAAG TCGAGCTGCG GATACCGTCG CTATTGCCAG ATGATTTCAT TCCTGACGTG AATACGCGTT TGTCGTTCTA TAAACGTATT GCCAGCGCCA AAACGGAAAA CGAACTGGAA GAGATCAAAG TCGAGCTTAT CGATCGCTTC GGCCTGCTGC CGGATCCGGC GCGTACCCTG CTGGATATTG CCCGTCTGCG CCAGCAAGCG CAGAAACTGG GGATCAGGAA GCTGGAAGGT AATGAGAAAG GCGGCGTGAT CGAATTTGCC GAGAAGAATC ACGTTAATCC GGCCTGGTTG ATTGGTTTGC TGCAAAAACA GCCGCAGCAT TATCGCCTCG ATGGCCCGAC GCGCCTGAAG TTTATTCAGG ATTTGAGTGA GCGGAAAACG CGTATCGAAT GGGTACGCCA GTTTATGCGT GAACTGGAAG AGAACGCGAT CGCTTGA
|
Protein sequence | MPEQYRYTLP VKAGEQRLLG ELTGAACATL VAEIAERHAG PVVLIAPDMQ NALRLHDEIS QFTDQMVMNL ADWETLPYDS FSPHQDIISS RLSTLYQLPT MQRGVLIVPV NTLMQRVCPH SFLHGHALVM KKGQRLSRDA LRTQLDSAGY RHVDQVMEHG EYATRGALLD LFPMGSELPY RLDFFDDEID SLRVFDVDSQ RTLEEVEAIN LLPAHEFPTD KAAIELFRSQ WRDTFEVKRD PEHIYQQVSK GTLPAGIEYW QPLFFSEPLP PLFSYFPANT LLVNTGDLEN SAERFQADTL ARFENRGVDP MRPLLPPQSL WLRVDELFSE LKNWPRVQLK TEHLPTKAAN ANLGFQKLPD LAVQAQQKAP LDALRKFLES FDGPVVFSVE SEGRREALGE LLARIKIAPQ RIMRLDEAKD RGRYLMIGAA EHGFVDTVRN LALICESDLL GERVARRRQD SRRTINPDTL IRNLAELHIG QPVVHLEHGV GRYAGMTTLE AGGITGEYLM LTYANDAKLY VPVSSLHLIS RYAGGAEENA PLHKLGGDAW SRARQKAAEK VRDVAAELLD IYAQRAAKEG FAFKHDREQY QLFCDSFPFE TTPDQAQAIN AVLSDMCQPL AMDRLVCGDV GFGKTEVAMR AAFLAVDNHK QVAVLVPTTL LAQQHYDNFR DRFANWPVRI EMISRFRSAK EQTQILAEVA EGKIDILIGT HKLLQNDVKF KDLGLLIVDE EHRFGVRHKE RIKAMRANVD ILTLTATPIP RTLNMAMSGM RDLSIIATPP ARRLAVKTFV REYDSLVVRE AILREILRGG QVYYLYNDVE NIQKAAERLA ELVPEARIAI GHGQMREREL ERVMNDFHHQ RFNVLVCTTI IETGIDIPTA NTIIIERADH FGLAQLHQLR GRVGRSHHQA YAWLLTPHPK AMTTDAQKRL EAIASLEDLG AGFALATHDL EIRGAGELLG EEQSGSMETI GFSLYMELLE NAVDALKAGR EPSLEDLTSQ QTEVELRIPS LLPDDFIPDV NTRLSFYKRI ASAKTENELE EIKVELIDRF GLLPDPARTL LDIARLRQQA QKLGIRKLEG NEKGGVIEFA EKNHVNPAWL IGLLQKQPQH YRLDGPTRLK FIQDLSERKT RIEWVRQFMR ELEENAIA
|
| |