Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5329 |
Symbol | |
ID | 6969401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4972358 |
End bp | 4973254 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388990 |
Product | 3-hydroxyisobutyrate dehydrogenase family protein |
Protein accession | YP_002273399 |
Protein GI | 209399372 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2084] 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.478364 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCAA TCGCGTTTAT CGGTTTAGGA CAAATGGGTT CGCCAATGGC GAGCAATTTA TTGCAGCAAG GGCACCAACT TCGCGTCTTT GATGTGAATG CCGAGGCTGT GCGGCATCTG GTAGACAAAG GCGCGACTCC CGCCGCCAAC CCGGCGCAGG CCGCTAAAGA TGCCGAATTT ATCATTACCA TGTTGCCGAA TGGCGATCTG GTGCGCAGCG TGTTGTTCGG TGAAAACGGC GTTTGCGAAA GCTTATCTAC CGATGCGCTG GTCATTGATA TGTCGACCAT CCATCCGCTG CAAACCGATA AATTGATTGC CGATATGCAA GCCAAAGGCT TCAACATGAT GGATGTTCCG GTAGGCCGTA CTTCTGCAAA TGCCATTACC GGTACTCTGT TACTGCTGGC TGGCGGCACC GCTGAACAAG TTGAACGTGC CACGCCGATC CTGATGGCGA TGGGCAGTGA GTTGATCAAC GCAGGCGGTC CGGGCATGGG GATCCGCGTT AAGCTCATCA ACAACTATAT GAGCATCGCG CTCAATGCGC TTTCGGCAGA AGCTGCCGTT TTGTGCGAAG CCCTGAATCT TCCCTTCGAT GTTGCCGTCA AAGTGATGAG CGGTACCGCC GCCGGTAAAG GCCACTTCAC CACTTCCTGG CCGAACAAAG TCCTCAGCGG GGATCTTTCT CCCGCCTTCA TGATCGATCT TGCCCATAAG GATCTTGGCA TCGCCCTTGA TGTCGCCAAC CAGCTGCATG TGCCAATGCC GCTGGGGGCC GCCTCACGGG AGGTTTATAG CCAGGCGCGC GCAGCGGGTC GCGGTCGCCA GGACTGGTCC GCCATTCTGG AACAGGTCCG TGTCAGTGCC GGGATGACTG CCAAAGTAAA AATGTAA
|
Protein sequence | MAAIAFIGLG QMGSPMASNL LQQGHQLRVF DVNAEAVRHL VDKGATPAAN PAQAAKDAEF IITMLPNGDL VRSVLFGENG VCESLSTDAL VIDMSTIHPL QTDKLIADMQ AKGFNMMDVP VGRTSANAIT GTLLLLAGGT AEQVERATPI LMAMGSELIN AGGPGMGIRV KLINNYMSIA LNALSAEAAV LCEALNLPFD VAVKVMSGTA AGKGHFTTSW PNKVLSGDLS PAFMIDLAHK DLGIALDVAN QLHVPMPLGA ASREVYSQAR AAGRGRQDWS AILEQVRVSA GMTAKVKM
|
| |