Gene ECH74115_4757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4757 
Symbol 
ID6971599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4402102 
End bp4403139 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content51% 
IMG OID643388454 
Productputative dehydrogenase 
Protein accessionYP_002272882 
Protein GI209400110 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.523048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.406172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATCA ACTGCGCCTT TATTGGCTTC GGCAAAAGCA CCACCCGTTA CCATCTGCCG 
TATGTACTTA ACCGCAAGGA TAGCTGGCAT GTCGCGCATA TTTTTCGCCG CCATGCAAAG
CCGGAAGAAC AAGCTCCCAT TTACTCTCAT ATCCATTTCA CCAGCGATCT CAACGAAGTG
CTAAACGATC CCGATGTTAA GCTGGTTGTT GTCTGCACCC ATGCGGACAG CCACTTCGAG
TACGCGAAGC GCGCGATGGA AGCCGGGAAA AATGTGCTGG TCGAAAAACC GTTCACTCCG
ACACTTGCGC AGGCGAAAGA GCTGTTTGCA CTGGCGAAAA GCAAAGGGCT GACCGTCACG
CCATATCAGA ATCGTCGCTT TGATTCCTGC TTCCTGACGG CGAAAAAAGC GATTGAAAGC
GGCAAGCTGG GAGAGATTGT CGAAGTGGAA AGCCATTTTG ACTATTACCG CCCGGTGGCA
GAAACCAAAC CTGGGCTGCC GCAGGATGGC GCGTTCTATG GCCTTGGTGT GCATACGATG
GACCAGATTA TTTCTCTGTT CGGTCGCCCG GATCACGTCG CTTATGACAT CCGCAGCCTG
CGCAATAAAG CCAATCCTGA CGATACTTTC GAAGCGCAAC TGTTTTATGG CGATCTAAAA
GCCATCGTCA AAACCAGCCA TCTGGTGAAA ATCGATTATC CGAAATTTAT CGTTCACGGT
AAGAAAGGTT CGTTTATTAA ATATGGTATC GACCAGCAGG AAACCAGCCT GAAGGCTAAT
ATTATGCCGG GCGAACCGGG ATTCGCAGCG GATGATTCGG TCGGTGTGCT GGAGTATGTC
AATGACGAGG GCGTGACGGT CAGAGAAGAG ATGAAGCCGG AGATGGGCGA TTACGGGCGC
GTTTATGATG CGTTGTATCA AACCATCACC CACGGTGCGC CAAATTACGT CAAGGAATCT
GAAGTTCTTA CGAATCTGGA AATACTTGAA CGTGGATTCG AGCAAGCCTC TCCCTCCACA
GTGACTCTCG CGAAGTAA
 
Protein sequence
MVINCAFIGF GKSTTRYHLP YVLNRKDSWH VAHIFRRHAK PEEQAPIYSH IHFTSDLNEV 
LNDPDVKLVV VCTHADSHFE YAKRAMEAGK NVLVEKPFTP TLAQAKELFA LAKSKGLTVT
PYQNRRFDSC FLTAKKAIES GKLGEIVEVE SHFDYYRPVA ETKPGLPQDG AFYGLGVHTM
DQIISLFGRP DHVAYDIRSL RNKANPDDTF EAQLFYGDLK AIVKTSHLVK IDYPKFIVHG
KKGSFIKYGI DQQETSLKAN IMPGEPGFAA DDSVGVLEYV NDEGVTVREE MKPEMGDYGR
VYDALYQTIT HGAPNYVKES EVLTNLEILE RGFEQASPST VTLAK