Gene ECH74115_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1126 
SymbolhelD 
ID6971402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1157297 
End bp1159351 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content53% 
IMG OID643385131 
ProductDNA helicase IV 
Protein accessionYP_002269630 
Protein GI209400242 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.440253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.367013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGA AAGCGACAAC GCTTGGAAAA CGTCTGGCAC AGCACCCTTA CGATCGGGCG 
GTGATCCTCA ATGCCGGGAT TAAAGTCTCC GGCGATCGCC ACGAATACCT TATTCCTTTC
AATCAATTAC TGGCGATTCA CTGTAAGCGC GGTCTGGTAT GGGGCGAGCT GGAATTTGTA
CTGCCGGACG AAAAAGTGGT GCGTCTGCAC GGTACCGAAT GGGGCGAGAC GCAGCGTTTT
TACCATCATC TTGATGCTCA CTGGCGGCGG TGGAGTGGCG AGATGAGCGA AATTGCGTCT
GGTGTTTTAC GCCAGCAACT GGATTTGATT GCCACGCGCA CTGGAGAAAA TAAATGGCTG
ACGCGTGAGC AAACCTCTGG CGTTCAGCAA CAAATCCGCC AGGCTTTGTC GGCGTTGCCG
TTGCCGGTTA ACCGACTGGA AGAATTCGAT AACTGCCGTG AGGCGTGGCG TAAATGTCAG
GCCTGGTTGA AAGATATTGA AAGCGCTCGG TTGCAGCATA ACCAGGCGTA TACCGAAGCC
ATGCTTACCG AGTATGCGGA TTTTTTCCGC CAGGTCGAGT CTTCACCGCT GAATCCGGCG
CAGGCCCGGG CAGTCGTTAA TGGCGAGCAT TCTCTGTTAG TGCTGGCAGG TGCAGGAAGC
GGAAAAACGT CGGTGCTGGT GGCCCGTGCA GGCTGGTTGC TGGCGCGTGG TGAAGCGTCC
CCTGAGCAAA TTTTATTGCT GGCGTTTGGT CGCAAAGCCG CTGAAGAGAT GGACGAGCGT
ATTCGCGAAC GGCTACATAC CGAAGACATT ACCGCACGCA CGTTTCATGC GCTGGCGCTG
CATATTATTC AGCAGGGCAG CAAAAAAGTT CCGATAGTCA GCAAACTGGA AAATGATACC
GCTGCCCGTC ATGAACTCTT TATTGCTGAG TGGCGCAAGC AATGCAGCGA AAAGAAAGCG
CAGGCGAAGG GCTGGCGGCA ATGGCTGACG GAAGAAATGC AGTGGTCAGT GCCAGAAGGT
AACTTCTGGG ATGATGAAAA ATTACAGCGT CGTCTTGCCT CACGCCTCGA TCGTTGGGTA
AGTCTGATGC GGATGCACGG TGGTGCACAG GCAGAAATGA TTGCCAGTGC ACCCGAAGAG
ATTCGCGATC TGTTCAGTAA ACGTATCAAG TTGATGGCCC CGTTATTAAA AGCCTGGAAA
GGTGCGCTGA AGGCAGAAAA CGCTGTCGAT TTTTCGGGCC TTATTCATCA GGCGATTGTG
ATTCTGGAGA AAGGTCGCTT TATCAGCCCG TGGAAGCATA TTCTGGTTGA TGAATTTCAG
GATATCTCGC CGCAGCGGGC AGCGTTGTTA GCGGCATTAC GCAAGCAAAA CAGTCAGACG
ACGTTGTTCG CCGTTGGTGA TGACTGGCAG GCGATTTACC GATTCAGCGG TGCGCAAATG
TCGCTCACCA CCGCTTTCCA TGAAAACTTT GGTGAAGGCG AACGCTGTGA TTTAGACACG
ACTTACCGTT TTAACAGTCG TATCGGTGAG GTGGCAAACC GGTTTATTCA GCAGAACCCA
GGCCAGCTGA AAAAGCCGCT AAACAGCTTA ACCAATGGAG ACAAAAAAGC CGTCACGTTA
TTGGATGAGA GTCAACTTGA CGCTTTGCTG GATAAGCTCT CTGGTTATGC CAAACCGGAA
GAGCGCATTC TGATCCTGGC GCGTTACCAT CACATGAGGC CTGCCAGCCT GGAAAAAGCG
GCAACACGCT GGCCGAAGTT GCAAATCGAC TTTATGACCA TTCATGCCAG CAAAGGGCAA
CAGGCGGATT ACGTCCTCAT CGTTGGCTTG CAGGAGGGAA GTGATGGTTT TCCGGCTGCG
GCGCGGGAGT CGATTATGGA AGAGGCGCTA CTGCCACCGG TTGAGGATTT CCCGGACGCT
GAAGAACGGC GGTTAATGTA CGTGGCGCTG ACCCGGGCAC GTCATCGGGT ATGGGCACTG
TTTAACAAAG AGAATCCCTC TCCCTTTGTG GAAATACTGA AAAATCTGGA TGTGCCGGTG
GCGAGAAAAC CGTAA
 
Protein sequence
MELKATTLGK RLAQHPYDRA VILNAGIKVS GDRHEYLIPF NQLLAIHCKR GLVWGELEFV 
LPDEKVVRLH GTEWGETQRF YHHLDAHWRR WSGEMSEIAS GVLRQQLDLI ATRTGENKWL
TREQTSGVQQ QIRQALSALP LPVNRLEEFD NCREAWRKCQ AWLKDIESAR LQHNQAYTEA
MLTEYADFFR QVESSPLNPA QARAVVNGEH SLLVLAGAGS GKTSVLVARA GWLLARGEAS
PEQILLLAFG RKAAEEMDER IRERLHTEDI TARTFHALAL HIIQQGSKKV PIVSKLENDT
AARHELFIAE WRKQCSEKKA QAKGWRQWLT EEMQWSVPEG NFWDDEKLQR RLASRLDRWV
SLMRMHGGAQ AEMIASAPEE IRDLFSKRIK LMAPLLKAWK GALKAENAVD FSGLIHQAIV
ILEKGRFISP WKHILVDEFQ DISPQRAALL AALRKQNSQT TLFAVGDDWQ AIYRFSGAQM
SLTTAFHENF GEGERCDLDT TYRFNSRIGE VANRFIQQNP GQLKKPLNSL TNGDKKAVTL
LDESQLDALL DKLSGYAKPE ERILILARYH HMRPASLEKA ATRWPKLQID FMTIHASKGQ
QADYVLIVGL QEGSDGFPAA ARESIMEEAL LPPVEDFPDA EERRLMYVAL TRARHRVWAL
FNKENPSPFV EILKNLDVPV ARKP