Gene ECH74115_0948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0948 
SymboldinG 
ID6967338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp960737 
End bp963025 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content53% 
IMG OID643384969 
ProductATP-dependent DNA helicase DinG 
Protein accessionYP_002269469 
Protein GI209398717 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCGA ATTTTCAATC AAATATATTG CGAGAGGGCG TGTTTGCGAG CCGCTTTCCA 
GAAACAGAAA AACCATTACC CCTGAAAACC GAAAAATGCC ACAATATTGG CTGTTTATAC
AGTATTTCAG GTTTTCTCAT GGCATTAACC GCCGCGCTTA AAGCGCAAAT TGCCGCCTGG
TATAAGGCGC TTCAGGAACA GATCCCCGAC TTTATTCCCC GTGCGCCGCA GCGGCAGATG
ATTGCGGACG TCGCCAAAAC GCTGGCCGGA GAAGAAGGGC GGCATCTGGC GATTGAAGCC
CCCACTGGCG TTGGGAAAAC GCTCTCCTAT TTGATTCCCG GCATCGCCAT TGCCCGCGAA
GAGCAAAAAA CGCTGGTGGT GAGTACCGCC AACGTGGCAT TGCAGGATCA GATCTACAGC
AAAGATTTGC CACTGCTGAA AAAGATCATT CCCGATCTTA AATTCACTGC CGCTTTTGGG
CGTGGGCGCT ACGTTTGCCC GCGTAATCTG ACGGCGCTTG CCAGCACTGA GCCCACGCAA
CAGGATCTGC TGGCGTTTCT TGACGACGAA CTGACGCCAA ATAATCAGGA AGAGCAAAAA
CGTTGTGCGA AGCTGAAGGG CGATCTCGAC ACTTATAAAT GGGATGGTCT GCGCGATCAT
ACGGATATCG CTATAGATGA CGATCTCTGG CGTCGTTTAA GTACCGACAA AGCCAGCTGC
CTCAACCGCA ACTGTTACTA CTATCGCGAA TGCCCGTTTT TTGTCGCTCG TCGGGAAATT
CAGGAAGCGG AAGTGGTGGT GGCAAACCAT GCGCTGGTGA TGGCGGCGAT GGAAAGCGAA
GCCGTATTGC CTGACCCGAA AAATTTACTG CTGGTGCTGG ACGAAGGCCA TCACCTGCCA
GATGTGGCGC GGGATGCGCT GGAGATGAGT GCCGAAATCA CCGCGCCATG GTATCGGCTA
CAGCTGGACT TGTTCACGAA ACTGGTCGCT ACCTGCATGG AGCAGTTTCG CCCGAAGACC
ATCCCGCCGT TGGCGATCCC TGAACGTTTG AATGCGCATT GTGAAGAGCT GTATGAGCTT
ATCGCCTCAT TAAACAACAT TCTCAATCTC TACATGCCTG CCGGGCAGGA AGCAGAACAC
CGTTTTGCGA TGGGCGAACT GCCTGATGAA GTGCTGGAGA TCTGCCAGCG GCTGGCAAAA
CTCACCGAGA TGCTGCGTGG TCTGGCGGAG TTATTTCTTA ACGATTTAAG TGAGAAAACC
GGCAGCCATG ACATTGTACG TCTGCATCGG TTGATTTTGC AGATGAACCG CGCGTTGGGG
ATGTTCGAGG CGCAAAGCAA ACTCTGGCGG CTGGCTTCGC TGGCGCAATC TTCCGGTGCA
CCGGTGACCA AATGGGCGAC GCGGGAAGAG CGCGAAGGGC AGCTACACCT CTGGTTTCAC
TGCGTGGGAA TACGTGTCAG CGATCAGCTG GAAAGGCTGC TGTGGCGCAG TATTCCGCAC
ATTATTGTCA CCTCCGCAAC CTTGCGTTCG CTGAACAGTT TTTCGCGTTT GCAGGAGATG
AGCGGGCTGA AAGAGAAAGC GGGCGACCGT TTTGTGGCGC TGGATTCCCC CTTTAACCAC
TGCGAACAGG GCAAAATTGT TATTCCCCGG ATGCGCGTTG AGCCTTCCAT CGACAACGAA
GAACAGCATA TTGCTGAAAT GGCGGCCTTT TTCCGTGAGC AGGTGGAGAG CAAAAAACAT
CTCGGTATGT TGGTGCTGTT TGCCAGCGGG CGTGCGATGC AGCGCTTTCT CGACTATGTG
ACGGATTTAC GTCTGATGTT GCTGGTGCAG GGCGATCAGC CGCGTTACCG TTTAGTTGAA
CTGCACCGCA AACGCGTCGC CAACGGTGAG CGTAGTGTGC TGGTGGGCTT ACAGTCATTT
GCCGAAGGGC TTGATTTGAA AGGTGATATG CTCAGCCAGG TGCATATCCA CAAAATCGCT
TTTCCGCCTA TCGACAGCCC GGTGGTGATC ACTGAAGGGG AATGGCTGAA AAGCCTCAAC
CGCTATCCAT TTGAGGTGCA AAGCCTGCCG AGCGCCTCGT TTAACCTGAT TCAGCAGGTT
GGGCGACTGA TTCGAAGTCA CGGTTGCTGG GGCGAAGTGG TGATTTACGA TAAACGCTTG
CTGACCAAAA ATTATGGCAA GCGACTACTG GATGCATTAC CGGTATTTCC GATAGAGCAA
CCGGAAGTCC CTGAAGGTAT AGTTAAAAAG AAAGAAAAAA CGAAATCCCC ACGCCGTCGG
CGGCGTTAA
 
Protein sequence
MPSNFQSNIL REGVFASRFP ETEKPLPLKT EKCHNIGCLY SISGFLMALT AALKAQIAAW 
YKALQEQIPD FIPRAPQRQM IADVAKTLAG EEGRHLAIEA PTGVGKTLSY LIPGIAIARE
EQKTLVVSTA NVALQDQIYS KDLPLLKKII PDLKFTAAFG RGRYVCPRNL TALASTEPTQ
QDLLAFLDDE LTPNNQEEQK RCAKLKGDLD TYKWDGLRDH TDIAIDDDLW RRLSTDKASC
LNRNCYYYRE CPFFVARREI QEAEVVVANH ALVMAAMESE AVLPDPKNLL LVLDEGHHLP
DVARDALEMS AEITAPWYRL QLDLFTKLVA TCMEQFRPKT IPPLAIPERL NAHCEELYEL
IASLNNILNL YMPAGQEAEH RFAMGELPDE VLEICQRLAK LTEMLRGLAE LFLNDLSEKT
GSHDIVRLHR LILQMNRALG MFEAQSKLWR LASLAQSSGA PVTKWATREE REGQLHLWFH
CVGIRVSDQL ERLLWRSIPH IIVTSATLRS LNSFSRLQEM SGLKEKAGDR FVALDSPFNH
CEQGKIVIPR MRVEPSIDNE EQHIAEMAAF FREQVESKKH LGMLVLFASG RAMQRFLDYV
TDLRLMLLVQ GDQPRYRLVE LHRKRVANGE RSVLVGLQSF AEGLDLKGDM LSQVHIHKIA
FPPIDSPVVI TEGEWLKSLN RYPFEVQSLP SASFNLIQQV GRLIRSHGCW GEVVIYDKRL
LTKNYGKRLL DALPVFPIEQ PEVPEGIVKK KEKTKSPRRR RR