Gene ECH74115_2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2336 
Symbol 
ID6966707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2209194 
End bp2210234 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content53% 
IMG OID643386210 
Productputative oxidoreductase 
Protein accessionYP_002270694 
Protein GI209400913 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACATCCGTGT TGGGTTGATT GGGTATGGTT ATGCGAGCAA AACCTTCCAT 
GCGCCCCTGA TTGCGGGCAC ACCCGGGCTG GAACTGGCGG TAATCTCCAG CAGTGATGAA
ACAAAAGTAA AAGCCGACTG GCCAACGGTT ACGGTTGTCT CTGAGCCGAA GCATCTGTTT
AACGATCCCA ACATAGACCT GATTGTCATT CCTACACCCA ACGATACCCA TTTCCCGTTA
GCCAAAGCAG CGCTTGAGGC GGGTAAACAT GTGGTCGTTG ATAAACCCTT TACCGTGACA
CTGTCACAAG CGCGAGAGCT GGATGCGCTG GCAAAAAGCC TGGGGCGTGT TCTGTCTGTA
TTCCATAACC GTCGTTGGGA TAGCGATTTC TTGACGCTAA AAGGTTTGCT CGCGGAAGGC
GTGCTGGGTG AAGTTGCTTA CTTTGAGTCT CATTTTGACC GCTTCCGTCC GCAGGTGCGC
GATCGTTGGC GTGAACAGGG CGGCCCAGGC AGCGGTATCT GGTACGATTT AGCACCACAT
CTTCTTGATC AGGCCATTAT GCTATTTGGT TTACCGGTCA GCATGACGGT AGATTTGGCA
CAGTTACGGC CCGGAGCGCA GTCGACCGAT TATTTCCACG CCATCTTGTC CTATCCACAG
CGGCGAGTCA TTTTACACGG TACCATGCTG GCAGCCGCTG AGTCAGCACG GTATATCGTG
CATGGATCCC GAGGCAGTTA TGTGAAATAT GGCCTCGATC CACAGGAAGA ACGTCTGAAA
AATGGCGAGC GTCTGCCGCA GGAAGACTGG GGCTACGATA TGCGTGATGG CGTACTTACC
CGCGTGGAAG GTGAGGAACG TGTCGAAGAA ACGCTGTTGA CGGTGCCTGG GAATTATCCG
GCTTACTATG CGGCGATTCG TGATGCGTTA AATGGCGATG GTGAAAATCC GGTTCCGGCA
AGCCAGGCAA TCCAGGTAAT GGAGTTGATT GAGCTGGGCA TCGAATCCGC CAAACATCGC
GCGACTTTGT GCCTTGCATG A
 
Protein sequence
MSDNIRVGLI GYGYASKTFH APLIAGTPGL ELAVISSSDE TKVKADWPTV TVVSEPKHLF 
NDPNIDLIVI PTPNDTHFPL AKAALEAGKH VVVDKPFTVT LSQARELDAL AKSLGRVLSV
FHNRRWDSDF LTLKGLLAEG VLGEVAYFES HFDRFRPQVR DRWREQGGPG SGIWYDLAPH
LLDQAIMLFG LPVSMTVDLA QLRPGAQSTD YFHAILSYPQ RRVILHGTML AAAESARYIV
HGSRGSYVKY GLDPQEERLK NGERLPQEDW GYDMRDGVLT RVEGEERVEE TLLTVPGNYP
AYYAAIRDAL NGDGENPVPA SQAIQVMELI ELGIESAKHR ATLCLA