Gene ECH74115_0170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0170 
Symboldgt 
ID6967955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp183417 
End bp184934 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content48% 
IMG OID643384246 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002268769 
Protein GI209396609 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGA TTGATTTCCG AAAAAAAATA AACTGGCATC GTCGTTACCG TTCACCGCAG 
GGCGTTAAAA CCGAACATGA GATCCTGCGG ATCTTCGAGA GCGATCGCGG GCGTATCATC
AACTCTCCGG CAATTCGTCG TCTGCAACAA AAGACCCAGG TTTTTCCACT GGAGCGCAAT
GCCGCCGTGC GCACGCGTCT TACCCACTCG ATGGAAGTCC AGCAGGTGGG GCGCTACATC
GCCAAAGAAA TTTTAAGCCG TCTGAAAGAG CTTAAATTGC TGGAAGCATA CGGCCTGGAT
GAACTGACCG GACCTTTTGA AAGCATTGTT GAGATGTCAT GCCTGATGCA CGATATCGGC
AATCCGCCGT TTGGTCATTT TGGCGAAGCG GCGATAAATG ACTGGTTTCG CCAGCGTTTG
CACCCGGAAG ATGCCGAAAG CCAGCCTCTG ACTGACGATC GCTGCAGCGT GGCGGTACTA
CGTTTACGGG ACGGGGAAGA ACCGCTTAAC GAGCTGCGGC GCAAGATTCG TCAGGACTTA
TGTCATTTTG AGGGGAATGC ACAAGGCATT CGTCTGGTGC ATACATTGAT GCGGATGAAT
CTCACCTGGG CGCAGGTTGG CGGTATTTTA AAATATACCC GTCCAGCGTG GTGGCGTGGC
GAAACGCCTG AGACACATCA CTATTTAATG AAAAAGCCGG GTTATTATCT TTCTGAAGAA
GCCTATATTG CCCGGTTGCG TAAAGAACTT AATTTGGCGC TTTACAGTCG TTTTCCATTA
ACGTGGATTA TGGAAGCTGC CGACGACATC TCCTATTGTG TGGCAGATCT TGAAGATGCG
GTAGAGAAAA GAATATTTAC CGTTGAGCAG CTTTATCATC ATTTGCACGA GGCGTGGGGC
CAGCATGAGA AAGGTTCGCT CTTTTCGCTG GTGGTTGAAA ATGCCTGGGA AAAATCACGC
TCAAATAGTT TAAGCCGCAG TACGGAAGAT CAGTTTTTTA TGTATTTACG GGTAAACACC
CTAAATAAAC TGGTACCCTA CGCGGCACAA CGATTTATTG ATAATCTGCC TGCGATTTTC
GCCGGAACGT TTAATCATGC ATTATTGGAA GATGCCAGCG AATGCAGCGA TCTTCTTAAG
CTATATAAAA ATGTCGCTGT AAAACATGTG TTTAGCCATC CAGATGTCGA GCAGCTTGAA
TTGCAGGGCT ATCGGGTCAT TAGCGGATTA TTAGAGATTT ATCGCCCTTT ATTAAGCCTG
TCATTATCAG ACTTTACTGA ACTGGTAGAA AAAGAACGGG TGAAACGTTC CCCTATTGAA
TCTCGCTTAT TCCACAAACT CTCGACGCGC CATCGGCTGG CCTATGTCGA GGCTGTCAGT
AAATTACCGT CAGATTCTCC TGAGTTTCCG CTATGGGAAT ATTATTACCG TTGCCGCCTG
CTGCAGGATT ATATCAGCGG TATGACCGAC CTCTATGCCT GGGATGAATA CCGACGTCTG
ATGGCCGTAG AACAATAA
 
Protein sequence
MAQIDFRKKI NWHRRYRSPQ GVKTEHEILR IFESDRGRII NSPAIRRLQQ KTQVFPLERN 
AAVRTRLTHS MEVQQVGRYI AKEILSRLKE LKLLEAYGLD ELTGPFESIV EMSCLMHDIG
NPPFGHFGEA AINDWFRQRL HPEDAESQPL TDDRCSVAVL RLRDGEEPLN ELRRKIRQDL
CHFEGNAQGI RLVHTLMRMN LTWAQVGGIL KYTRPAWWRG ETPETHHYLM KKPGYYLSEE
AYIARLRKEL NLALYSRFPL TWIMEAADDI SYCVADLEDA VEKRIFTVEQ LYHHLHEAWG
QHEKGSLFSL VVENAWEKSR SNSLSRSTED QFFMYLRVNT LNKLVPYAAQ RFIDNLPAIF
AGTFNHALLE DASECSDLLK LYKNVAVKHV FSHPDVEQLE LQGYRVISGL LEIYRPLLSL
SLSDFTELVE KERVKRSPIE SRLFHKLSTR HRLAYVEAVS KLPSDSPEFP LWEYYYRCRL
LQDYISGMTD LYAWDEYRRL MAVEQ