Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4313 |
Symbol | |
ID | 6967272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3991336 |
End bp | 3992376 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388042 |
Product | aldo-keto reductase |
Protein accession | YP_002272480 |
Protein GI | 209399849 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTGGT TAGCGAATCC CGAACGTTAC GGGCAGATGC AATACCGCTA TTGCGGAAAA AGCGGTTTAC GCCTGCCCGC GTTATCGCTC GGTTTATGGC ACAATTTCGG TCACGTTAAC GCGCTGGAAT CACAGCGTGC GATCCTGCGT AAAGCGTTTG ATTTAGGCAT TACGCACTTT GATTTAGCCA ACAATTACGG GCCGCCTCCA GGAAGCGCAG AAGAGAACTT TGGTCGCCTG CTGCGGGAGG ATTTTGCCGC TTATCGCGAT GAACTGATTA TCTCTACCAA AGCTGGCTAC GATATGTGGC CCGGCCCTTA CGGCTCTGGC GGTTCACGTA AATACCTGCT CGCCAGCCTC GACCAAAGCC TGAAGCGTAT GGGGCTTGAG TATGTCGATA TCTTTTACTC TCATCGCGTC GATGAAAATA CGCCGATGGA AGAAACCGCC TCTGCGCTGG CTCATGCGGT ACAAAGCGGT AAGGCGCTGT ATGTCGGGAT CTCCTCTTAC TCACCAGAGC GGACGCAAAA AATGGTCGAG TTGCTGCACG AGTGGAAAAT TCCGCTGTTA ATTCATCAAC CTTCGTACAA TTTACTAAAC CGCTGGGTGG ATAAAAGCGG CCTGCTGGAT ACCCTGCAAA ATAACGGCGT GGGCTGCATT GCCTTTACTC CTCTGGCTCA GGGATTGCTG ACCGGAAAAT ATCTCAACGG CATTCCTGAA GATTCACGGA TGCATCGTGA AGGGAATAAA GTTCGTGGTC TGACGCCGAA AATGCTCACC GAAGCCAACC TCAACAGCCT ACGCTTATTG AATGAAATGG CACAGCAGCG TGGACAATCA ATGGCGCAAA TGGCGTTAAG CTGGTTGCTG AAAGATGAGC GAGTGACGTC GGTATTGGTT GGTGCCAGCC GCGCGGAGCA ACTTGAGGAG AACGTGCAGG CGCTGAATAA TCTGACATTT AGCACCGAGG AGCTGGCGCA GATCGATCAG CATATCGCCG ATGGCGAGCT GAATCTGTGG CAGGCGTCTT CCGATAAATG A
|
Protein sequence | MVWLANPERY GQMQYRYCGK SGLRLPALSL GLWHNFGHVN ALESQRAILR KAFDLGITHF DLANNYGPPP GSAEENFGRL LREDFAAYRD ELIISTKAGY DMWPGPYGSG GSRKYLLASL DQSLKRMGLE YVDIFYSHRV DENTPMEETA SALAHAVQSG KALYVGISSY SPERTQKMVE LLHEWKIPLL IHQPSYNLLN RWVDKSGLLD TLQNNGVGCI AFTPLAQGLL TGKYLNGIPE DSRMHREGNK VRGLTPKMLT EANLNSLRLL NEMAQQRGQS MAQMALSWLL KDERVTSVLV GASRAEQLEE NVQALNNLTF STEELAQIDQ HIADGELNLW QASSDK
|
| |