Gene ECH74115_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3902 
Symbol 
ID6967721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3616405 
End bp3617673 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content58% 
IMG OID643387676 
Producthydroxyglutarate oxidase 
Protein accessionYP_002272124 
Protein GI209400455 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATT TTGTGATTAT TGGCGGCGGC ATCATCGGCA TGTCGACCGC CATGCAACTG 
ATTGATGTCT ATCCGGATGC CCGCATTGCG TTGCTGGAAA AAGAGTCCGG CCCGGCCTGT
CATCAGACGG GCCACAACAG CGGCGTGATC CATGCCGGGG TTTATTACAC GCCCGGTAGC
CTGAAGGCGC AGTTTTGCCT GGCGGGAAAC CGCGCCACTA AAGCCTTTTG CGATCAAAAC
GGCATTCGCT ATGACAACTG CGGCAAGATG CTGGTCGCCA CCTCTGAACT CGAAATGGAA
CGGATGCGCG CGTTGTGGGA ACGCACGGCG GCAAACGGTA TCGAGCGCGA GTGGTTAAAC
GCCGATGAAC TGCGCGAGCG CGAACCGAAT ATCACCGGGC TTGGCGGTAT TTTTGTGCCG
TCCAGCGGCA TTGTCAGCTA CCGCGAAGTC ACGGCGGCGA TGGCAAAAAT TTTCCAGGCC
AGAGGCGGCG AGATTATCTA TAACGCCGAA GTCAGCGGCC TTAGTGAGCA TAAAAGCGGC
GTGGTGATAC GTACCCGTCA GGGCAGCGAC TATGAAGCAT CGACGCTGAT TAGCTGTTCC
GGGCTGATGG CTGACCGGCT GGTGAAAATG CTCGGCCTCG AACCGGGCTT TATTATCTGC
CCGTTCCGCG GCGAGTATTT CCGCCTGGCG CCGGAGCATA ACCAGATTGT TAACCATCTG
ATTTACCCCA TTCCCGACCC GGCAATGCCG TTTTTGGGCG TGCATCTCAC CCGCATGATC
GACGGCAGCG TCACCGTCGG GCCAAACGCG GTGCTGGCTT TCAAACGCGA AGGCTATCGC
AAGCGCGACT TCTCATTTAG CGACACGCTG GAGATTTTGG GCTCGTCGGG GATTCGCCGG
GTGCTGCAAA ACCATCTACG CTCAGGACTG GGCGAGATGA AAAACTCGCT GTGCAAAAGC
GGCTATCTGC GGCTGGTGCA AAAGTATTGT CCCCGGCTTT CGTTAAGCGA TCTCCAGCCC
TGGCCCGCCG GTGTGCGGGC GCAGGCGGTA TCGCCGGACG GCAAGCTGAT TGACGATTTT
CTGTTTGTCA CCACCCCGCG CACGATCCAC ACCTGCAATG CGCCCTCCCC GGCAGCGACA
TCAGCAATTC CTATTGGTGC GCATATTGTC AGCAAGGTAC AAACGCTGTT GGCAAGCCAG
AGTAACCCCG GACGCACGCT GCGAGCGGCA CGTAGTGTGG ATGCCTTACA CGCCGCGTTT
AATCAATAA
 
Protein sequence
MYDFVIIGGG IIGMSTAMQL IDVYPDARIA LLEKESGPAC HQTGHNSGVI HAGVYYTPGS 
LKAQFCLAGN RATKAFCDQN GIRYDNCGKM LVATSELEME RMRALWERTA ANGIEREWLN
ADELREREPN ITGLGGIFVP SSGIVSYREV TAAMAKIFQA RGGEIIYNAE VSGLSEHKSG
VVIRTRQGSD YEASTLISCS GLMADRLVKM LGLEPGFIIC PFRGEYFRLA PEHNQIVNHL
IYPIPDPAMP FLGVHLTRMI DGSVTVGPNA VLAFKREGYR KRDFSFSDTL EILGSSGIRR
VLQNHLRSGL GEMKNSLCKS GYLRLVQKYC PRLSLSDLQP WPAGVRAQAV SPDGKLIDDF
LFVTTPRTIH TCNAPSPAAT SAIPIGAHIV SKVQTLLASQ SNPGRTLRAA RSVDALHAAF
NQ