Gene EcHS_A1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1237 
Symbolmfd 
ID5592971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1234224 
End bp1237670 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content55% 
IMG OID640920397 
Producttranscription-repair coupling factor 
Protein accessionYP_001457959 
Protein GI157160641 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.792337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAC AATATCGTTA TACGCTGCCC GTCAAAGCGG GTGAGCAGCG TCTGCTGGGC 
GAGTTAACCG GCGCAGCCTG TGCAACGCTG GTAGCGGAAA TTGCCGAACG TCACGCCGGT
CCGGTGGTAC TCATTGCACC AGATATGCAA AATGCTCTGC GTTTGCATGA TGAAATCAGC
CAGTTCACCG ATCAGATGGT GATGAATCTG GCGGACTGGG AAACTCTTCC CTACGACAGT
TTTTCGCCTC ATCAGGACAT TATCTCCTCG CGCCTTTCCA CCCTTTACCA GCTACCGACG
ATGCAGCGTG GCGTACTGAT TGTTCCGGTG AATACGCTTA TGCAGCGCGT TTGCCCACAC
AGTTTTCTCC ACGGTCATGC GCTGGTGATG AAAAAAGGCC AGCGCCTGTC ACGAGATGCA
TTACGAACCC AACTGGACAG CGCCGGTTAT CGCCATGTTG ACCAGGTGAT GGAGCACGGC
GAATACGCCA CGCGCGGCGC GTTGCTGGAT CTCTTCCCGA TGGGGAGTGA GCTGCCTTAT
CGTCTTGATT TCTTTGATGA TGAAATCGAC AGCCTGCGGG TGTTTGACGT CGACAGCCAG
CGCACGCTGG AGGAAGTAGA AGCGATCAAT CTGCTGCCCG CGCACGAATT TCCGACCGAT
AAAGCGGCAA TTGAACTGTT CCGCAGCCAG TGGCGCGATA CCTTCGAAGT GAAGCGCGAT
CCGGAACATA TTTACCAGCA AGTGAGTAAA GGCACATTAC CTGCCGGGAT CGAGTACTGG
CAGCCATTGT TCTTCAGCGA ACCACTGCCG CCGCTGTTCA GTTATTTCCC TGCCAATACC
TTGCTGGTGA ATACTGGCGA TCTGGAAACC AGTGCCGAAC GTTTCCAGGC TGACACGCTG
GCGCGTTTTG AGAATCGCGG CGTCGATCCG ATGCGCCCGC TGTTGCCACC ACAATCGCTC
TGGCTGCGGG TGGACGAGCT CTTCTCAGAG CTGAAAAACT GGCCCCGGGT GCAGCTAAAA
ACTGAACATT TACCGACAAA AGCCGCGAAT GCCAATTTAG GTTTCCAGAA ACTGCCAGAC
CTGGCCGTTC AGGCGCAACA AAAAGCGCCG CTGGATGCGC TGCGTAAGTT CCTCGAGACT
TTCGACGGTC CGGTGGTGTT CTCGGTAGAA AGTGAAGGTC GCCGTGAAGC GCTGGGTGAA
CTGCTCGCAC GAATTAAAAT TGCTCCGCAA CGCATTATGC GTCTTGATGA AGCCAGCGAC
CGTGGGCGTT ATCTGATGAT TGGCGCTGCC GAACATGGTT TTGTCGATAC GGTGCGTAAT
CTGGCGCTGA TCTGCGAAAG CGATCTGCTC GGTGAACGTG TTGCCCGCCG TCGTCAGGAT
TCTCGCCGCA CCATCAACCC CGATACACTG ATCCGTAACC TTGCGGAGCT GCATATTGGT
CAGCCGGTGG TCCATCTGGA GCACGGTGTC GGGCGCTATG CCGGAATGAC CACGCTCGAA
GCGGGCGGCA TTACTGGCGA GTATTTGATG CTCACCTATG CCAACGACGC CAAACTGTAT
GTTCCGGTGT CGTCACTGCA TCTGATTAGC CGTTACGCAG GTGGCGCGGA AGAAAACGCC
CCGCTGCATA AACTTGGCGG CGATGCGTGG TCACGCGCGC GGCAGAAAGC GGCGGAAAAA
GTGCGTGATG TGGCGGCGGA ATTGCTGGAT ATCTACGCGC AACGCGCCGC CAAAGAGGGC
TTCGCGTTTA AACACGATCG TGAGCAGTAT CAGTTGTTCT GCGACAGCTT CCCGTTTGAA
ACCACGCCGG ATCAGGCGCA GGCCATTAAT GCGGTACTTA GCGACATGTG TCAGCCGCTG
GCAATGGATC GTCTGGTGTG CGGCGATGTT GGCTTTGGTA AAACAGAAGT GGCGATGCGC
GCCGCTTTCC TGGCAGTAGA TAACCACAAG CAGGTAGCGG TGTTGGTGCC TACCACCCTT
CTTGCGCAGC AGCATTACGA CAACTTCCGC GACCGTTTCG CCAACTGGCC GGTACGTATC
GAAATGATCT CCCGTTTCCG CAGCGCCAAA GAGCAGACGC AAATCCTTGC GGAAGTGGCG
GAAGGGAAAA TCGATATTCT GATCGGTACG CACAAACTGC TGCAAAGTGA CGTCAAGTTT
AAAGATTTAG GCCTGCTGAT TGTCGATGAA GAACACCGCT TCGGGGTGCG TCATAAAGAG
CGCATTAAAG CGATGCGCGC GAACGTGGAT ATTCTGACGC TTACTGCAAC GCCGATCCCA
CGCACGCTGA ATATGGCAAT GAGCGGAATG CGTGACCTGT CGATTATCGC CACGCCGCCC
GCCCGTCGTC TGGCAGTTAA AACCTTTGTC CGTGAGTATG ACAGCCTGGT GGTCCGGGAG
GCGATCCTGC GTGAAATTTT GCGCGGGGGG CAGGTTTATT ATCTCTACAA TGATGTGGAA
AACATCCAGA AAGCTGCCGA ACGGCTGGCA GAACTGGTGC CTGAAGCACG GATTGCCATC
GGTCACGGGC AGATGCGCGA GCGCGAACTG GAACGGGTGA TGAATGATTT CCATCATCAA
CGTTTCAACG TGCTGGTTTG TACCACCATT ATCGAAACCG GGATCGACAT CCCGACAGCC
AACACCATTA TCATTGAACG TGCGGATCAC TTCGGTCTGG CGCAGCTGCA CCAGTTACGC
GGTCGCGTCG GACGTTCACA TCATCAGGCA TATGCATGGC TGCTGACGCC GCATCCAAAA
GCGATGACTA CCGATGCACA AAAACGTCTT GAAGCGATTG CCTCGCTGGA AGATCTCGGT
GCAGGCTTTG CGCTGGCAAC GCACGATCTG GAGATCCGCG GCGCGGGTGA ACTGCTTGGC
GAAGAACAAA GTGGCTCAAT GGAAACCATC GGTTTCTCGC TGTATATGGA GTTGCTGGAA
AACGCCGTCG ATGCACTGAA AGCCGGACGC GAGCCGTCGC TGGAAGATCT CACCAGCCAG
CAAACAGAAG TCGAGCTGCG GATGCCGTCG CTATTGCCAG ATGATTTCAT TCCTGACGTG
AATACGCGTT TGTCGTTCTA TAAACGTATT GCCAGCGCCA AAACGGAAAA CGAACTGGAA
GAGATCAAAG TCGAGCTTAT CGATCGCTTC GGCCTGCTGC CGGATCCGGC GCGTACCCTG
CTGGATGTTG CCCGTCTGCG CCAGCAAGCG CAGAAACTGG GGATCAGGAA GCTGGAAGGT
AATGAGAAAG GCGGGGTGAT CGAATTTGCC GAGAAGAATC ACGTTAATCC GGCCTGGTTG
ATTGGTTTGC TGCAAAAACA GCCGCAGCAT TACCGCCTTG ATGGTCCGAC GCGCCTGAAA
TTTATTCAGG ATTTGAGTGA GCGGAAAACG CGTATCGAAT GGGTACGCCA GTTTATGCGT
GAACTGGAAG AGAACGCGAT CGCTTAA
 
Protein sequence
MPEQYRYTLP VKAGEQRLLG ELTGAACATL VAEIAERHAG PVVLIAPDMQ NALRLHDEIS 
QFTDQMVMNL ADWETLPYDS FSPHQDIISS RLSTLYQLPT MQRGVLIVPV NTLMQRVCPH
SFLHGHALVM KKGQRLSRDA LRTQLDSAGY RHVDQVMEHG EYATRGALLD LFPMGSELPY
RLDFFDDEID SLRVFDVDSQ RTLEEVEAIN LLPAHEFPTD KAAIELFRSQ WRDTFEVKRD
PEHIYQQVSK GTLPAGIEYW QPLFFSEPLP PLFSYFPANT LLVNTGDLET SAERFQADTL
ARFENRGVDP MRPLLPPQSL WLRVDELFSE LKNWPRVQLK TEHLPTKAAN ANLGFQKLPD
LAVQAQQKAP LDALRKFLET FDGPVVFSVE SEGRREALGE LLARIKIAPQ RIMRLDEASD
RGRYLMIGAA EHGFVDTVRN LALICESDLL GERVARRRQD SRRTINPDTL IRNLAELHIG
QPVVHLEHGV GRYAGMTTLE AGGITGEYLM LTYANDAKLY VPVSSLHLIS RYAGGAEENA
PLHKLGGDAW SRARQKAAEK VRDVAAELLD IYAQRAAKEG FAFKHDREQY QLFCDSFPFE
TTPDQAQAIN AVLSDMCQPL AMDRLVCGDV GFGKTEVAMR AAFLAVDNHK QVAVLVPTTL
LAQQHYDNFR DRFANWPVRI EMISRFRSAK EQTQILAEVA EGKIDILIGT HKLLQSDVKF
KDLGLLIVDE EHRFGVRHKE RIKAMRANVD ILTLTATPIP RTLNMAMSGM RDLSIIATPP
ARRLAVKTFV REYDSLVVRE AILREILRGG QVYYLYNDVE NIQKAAERLA ELVPEARIAI
GHGQMREREL ERVMNDFHHQ RFNVLVCTTI IETGIDIPTA NTIIIERADH FGLAQLHQLR
GRVGRSHHQA YAWLLTPHPK AMTTDAQKRL EAIASLEDLG AGFALATHDL EIRGAGELLG
EEQSGSMETI GFSLYMELLE NAVDALKAGR EPSLEDLTSQ QTEVELRMPS LLPDDFIPDV
NTRLSFYKRI ASAKTENELE EIKVELIDRF GLLPDPARTL LDVARLRQQA QKLGIRKLEG
NEKGGVIEFA EKNHVNPAWL IGLLQKQPQH YRLDGPTRLK FIQDLSERKT RIEWVRQFMR
ELEENAIA