Gene EcE24377A_0165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0165 
Symboldgt 
ID5585949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp180580 
End bp182097 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content48% 
IMG OID640923894 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001461331 
Protein GI157155851 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGA TTGATTTCCG AAAAAAAATA AACTGGCATC GTCGTTACCG TTCACCGCAG 
GGCGTTAAAA CCGAACATGA GATCCTGCGG ATCTTCGAGA GCGATCGCGG GCGTATCATC
AACTCTCCGG CAATTCGTCG TCTGCAACAA AAGACCCAGG TTTTTCCACT GGAGCGCAAT
GCCGCCGTGC GCACGCGTCT TACCCACTCG ATGGAAGTCC AGCAGGTGGG GCGCTACATC
GCCAAAGAAA TTTTAAGCCG TCTGAAAGAG CTTAAATTGC TGGAAGCATA CGGCCTGGAT
GAACTGACCG GACCTTTTGA AAGCATTGTT GAGATGTCAT GCCTGATGCA CGATATCGGC
AATCCGCCGT TTGGTCATTT TGGCGAAGCG GCGATAAATG ACTGGTTTCG CCAGCGTTTA
TACCCGGAAG ATGCCGAAAG CCAGCCTCTG ACTGACGATC GCTGCAGCGT GGCGGCACTA
CGTTTACGGG ACGGGGAAGA ACCGCTTAAC GAGCTGCGGC GCAAGATTCG TCAGGACTTA
TGTCATTTTG AGGGGAATGC ACAAGGCATT CGTCTGGTGC ATACATTGAT GAGGATGAAT
CTCACCTGGG CACAGGTTGG CGGTATTTTA AAATATACCC GTCCGGCGTG GTGGCGTGGC
GAAACGCCTG AGACACATCA CTATTTAATG AAAAAGCCGG GTTATTATCT TTCTGAAGAA
GCCTATATTG CCCGGTTGCG TAAAGAACTT AATTTGGCGC TTTACAGTCG TTTTCCATTA
ACGTGGATTA TGGAAGCTGC CGACGACATC TCCTATTGTG TGGCAGACCT TGAAGATGCG
GTAGAGAAAA GAATATTTAC CGTTGAGCAG CTTTATCATC ATTTGCACGA GGCGTGGGGT
CAGCATGAGA AAGGTTCGCT CTTTTCGCTA GTGGTTGAAA ATGCCTGGGA AAAATCACGC
TCAAATAGTT TAAGCCGCAG TACGGAAGAT CAGTTTTTTA TGTATTTACG GGTAAACACC
CTAAATAAAC TGGTACCATA CGCGGCACAA CGATTTATTG ATAATCTGCC TGCGATTTTC
GCCGGAACGT TTAATCATGC ATTATTGGAA GATGCCAGCG AATGCAGCGA TCTTCTTAAG
CTATATAAAA ATGTCGCTGT AAAACATGTG TTTAGCCATC CGGATGTCGA GCAGCTTGAA
TTGCAGGGCT ATCGGGTCAT TAGCGGATTA TTAGAGATTT ATCGTCCTTT ATTAAGCCTG
TCGTTATCAG ACTTTACTGA ACTGGTAGAA AAAGAACGGG TGAAACGTTT CCCTATTGAA
TCGCGCTTAT TCCACAAACT CTCGACGCGC CATCGGCTGG CCTATGTCGA GGCTGTCAGT
AAATTACCGT CAGATTCTCC TGAGTTTCCG CTATGGGAAT ATTATTACCG TTGCCGCCTG
CTGCAGGATT ATATCAGCGG TATGACCGAC CTCTATGCGT GGGATGAATA CCGACGTCTG
ATGGCCGTAG AACAATAA
 
Protein sequence
MAQIDFRKKI NWHRRYRSPQ GVKTEHEILR IFESDRGRII NSPAIRRLQQ KTQVFPLERN 
AAVRTRLTHS MEVQQVGRYI AKEILSRLKE LKLLEAYGLD ELTGPFESIV EMSCLMHDIG
NPPFGHFGEA AINDWFRQRL YPEDAESQPL TDDRCSVAAL RLRDGEEPLN ELRRKIRQDL
CHFEGNAQGI RLVHTLMRMN LTWAQVGGIL KYTRPAWWRG ETPETHHYLM KKPGYYLSEE
AYIARLRKEL NLALYSRFPL TWIMEAADDI SYCVADLEDA VEKRIFTVEQ LYHHLHEAWG
QHEKGSLFSL VVENAWEKSR SNSLSRSTED QFFMYLRVNT LNKLVPYAAQ RFIDNLPAIF
AGTFNHALLE DASECSDLLK LYKNVAVKHV FSHPDVEQLE LQGYRVISGL LEIYRPLLSL
SLSDFTELVE KERVKRFPIE SRLFHKLSTR HRLAYVEAVS KLPSDSPEFP LWEYYYRCRL
LQDYISGMTD LYAWDEYRRL MAVEQ