Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2498 |
Symbol | gutB |
ID | 6966950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2365686 |
End bp | 2366729 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643386367 |
Product | sorbitol dehydrogenase |
Protein accession | YP_002270849 |
Protein GI | 209398191 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.503095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT CAAAAGCAAT ATTGCAGGTG CCGGGCACAA TGAAAATTAT TTCAGCAGAA ATACCAGTGC CTAAAGAAGA TGAAGTTTTG ATTAAAGTAG AATATGTCGG TATTTGTGGT TCAGATGTAC ATGGTTTTGA ATCAGGTCCG TTTATTCCGC CTAAAGACCC AAATCAAGAA ATTGGCCTGG GTCATGAATG CGCCGGGACG GTTGTGGCTG TGGGAAGCCG TGTGCGCAAA TTTAAACCGG GGGATCGGGT AAATATCGAA CCTGGCGTTC CTTGCGGTCA CTGTCGTTAC TGTCTGGAAG GCAAATATAA CATCTGCCCG GACGTTGATT TTATGGCGAC ACAACCCAAC TACCGCGGCG CATTAACGCA CTATCTGTGT CATCCGGAGA GCTTTACTTA CAAACTGCCC GACAATATGG ACACGATGGA AGGGACGCTG GTGGAGCCTG CCGCAGTCGG GATGCATGCC GCGATGCTGG CAGATGTTAA ACCGGGTAAG AAGATAATTA TTCTGGGAGC AGGTTGTATT GGTTTGATGA CGTTGCAAGC GTGCAAATGC CTGGGAGCAA CGGAAATTGC CGTCGTTGAT GTGCTGGAAA AACGTCTGGC AATGGCGGAA CAGCTTGGTG CGACAGTGGT TATTAACGGC GCAAAAGAAG ACACTATTGC ACGCTGTCAG CAATTTACCG AAGACATGGG CGCAGATATT GTTTTCGAAA CAGCGGGTTC TGCGGTCACC GTTAAACAGG CACCTTATCT GGTAATGCGT GGCGGTAAAA TTATGATTGT TGGTACTGTA CCCGGCGCTT CGGCAATCAA TTTCCTCAAA ATCAATCGCG AAGTCACTAT CCAGACGGTA TTCCGCTATG CCAATCGTTA TCCGGTCACG ATTGAAGCAA TTTCTTCAGG GCGATTCGAT GTGAAATCGA TGGTGACGCA TATTTACGAT TATCGGGATG TACAACAGGC ATTTGAAGAG TCAGTTAACA ACAAACGCGA CATTATTAAA GGCGTTATTA AGATTAGCGA TTAA
|
Protein sequence | MKNSKAILQV PGTMKIISAE IPVPKEDEVL IKVEYVGICG SDVHGFESGP FIPPKDPNQE IGLGHECAGT VVAVGSRVRK FKPGDRVNIE PGVPCGHCRY CLEGKYNICP DVDFMATQPN YRGALTHYLC HPESFTYKLP DNMDTMEGTL VEPAAVGMHA AMLADVKPGK KIIILGAGCI GLMTLQACKC LGATEIAVVD VLEKRLAMAE QLGATVVING AKEDTIARCQ QFTEDMGADI VFETAGSAVT VKQAPYLVMR GGKIMIVGTV PGASAINFLK INREVTIQTV FRYANRYPVT IEAISSGRFD VKSMVTHIYD YRDVQQAFEE SVNNKRDIIK GVIKISD
|
| |