Gene EcDH1_3442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3442 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3690623 
End bp3692140 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content48% 
IMG OID 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionACX41057 
Protein GI260450635 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGA TTGATTTCCG AAAAAAAATA AACTGGCATC GTCGTTACCG TTCACCGCAG 
GGCGTTAAAA CCGAACATGA GATCCTGCGG ATCTTCGAGA GCGATCGCGG GCGTATCATC
AACTCTCCGG CAATTCGTCG TCTGCAACAA AAGACCCAGG TTTTTCCACT GGAGCGCAAT
GCCGCCGTGC GCACGCGTCT TACCCACTCG ATGGAAGTCC AGCAGGTGGG GCGCTACATC
GCCAAAGAAA TTTTAAGCCG TCTGAAAGAG CTTAAATTAC TGGAAGCATA CGGCCTGGAT
GAACTGACCG GTCCCTTTGA AAGCATTGTT GAGATGTCAT GCCTGATGCA CGATATCGGC
AATCCGCCGT TTGGTCATTT TGGCGAAGCG GCGATAAATG ACTGGTTTCG CCAACGTTTG
CACCCGGAAG ATGCCGAAAG CCAGCCTCTG ACTGACGATC GCTGCAGCGT GGCGGCACTA
CGTTTACGGG ACGGGGAAGA ACCGCTTAAC GAGCTGCGGC GCAAGATTCG TCAGGACTTA
TGTCATTTTG AGGGGAATGC ACAAGGCATT CGCCTGGTGC ATACATTGAT GCGGATGAAT
CTCACCTGGG CACAGGTTGG CGGTATTTTA AAATATACCC GTCCGGCGTG GTGGCGTGGC
GAAACGCCTG AGACACATCA CTATTTAATG AAAAAGCCGG GTTATTATCT TTCTGAAGAA
GCCTATATTG CCCGGTTGCG TAAAGAACTT AATTTGGCGC TTTACAGTCG TTTTCCATTA
ACGTGGATTA TGGAAGCTGC CGACGACATC TCCTATTGTG TGGCAGACCT TGAAGATGCG
GTAGAGAAAA GAATATTTAC CGTTGAGCAG CTTTATCATC ATTTGCACGA AGCGTGGGGC
CAGCATGAGA AAGGTTCGCT CTTTTCGCTG GTGGTTGAAA ATGCCTGGGA AAAATCACGC
TCAAATAGTT TAAGCCGCAG TACGGAAGAT CAGTTTTTTA TGTATTTACG GGTAAACACC
CTAAATAAAC TGGTACCCTA CGCGGCACAA CGATTTATTG ATAATCTGCC TGCGATTTTC
GCCGGAACGT TTAATCATGC ATTATTGGAA GATGCCAGCG AATGCAGCGA TCTTCTTAAG
CTATATAAAA ATGTCGCTGT AAAACATGTG TTTAGCCATC CAGATGTCGA GCGGCTTGAA
TTGCAGGGCT ATCGGGTCAT TAGCGGATTA TTAGAGATTT ATCGTCCTTT ATTAAGCCTG
TCGTTATCAG ACTTTACTGA ACTGGTAGAA AAAGAACGGG TGAAACGTTT CCCTATTGAA
TCGCGCTTAT TCCACAAACT CTCGACGCGC CATCGGCTGG CCTATGTCGA GGCTGTCAGT
AAATTACCGT CAGATTCTCC TGAGTTTCCG CTATGGGAAT ATTATTACCG TTGCCGCCTG
CTGCAGGATT ATATCAGCGG TATGACCGAC CTCTATGCGT GGGATGAATA CCGACGTCTG
ATGGCCGTAG AACAATAA
 
Protein sequence
MAQIDFRKKI NWHRRYRSPQ GVKTEHEILR IFESDRGRII NSPAIRRLQQ KTQVFPLERN 
AAVRTRLTHS MEVQQVGRYI AKEILSRLKE LKLLEAYGLD ELTGPFESIV EMSCLMHDIG
NPPFGHFGEA AINDWFRQRL HPEDAESQPL TDDRCSVAAL RLRDGEEPLN ELRRKIRQDL
CHFEGNAQGI RLVHTLMRMN LTWAQVGGIL KYTRPAWWRG ETPETHHYLM KKPGYYLSEE
AYIARLRKEL NLALYSRFPL TWIMEAADDI SYCVADLEDA VEKRIFTVEQ LYHHLHEAWG
QHEKGSLFSL VVENAWEKSR SNSLSRSTED QFFMYLRVNT LNKLVPYAAQ RFIDNLPAIF
AGTFNHALLE DASECSDLLK LYKNVAVKHV FSHPDVERLE LQGYRVISGL LEIYRPLLSL
SLSDFTELVE KERVKRFPIE SRLFHKLSTR HRLAYVEAVS KLPSDSPEFP LWEYYYRCRL
LQDYISGMTD LYAWDEYRRL MAVEQ