Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5498 |
Symbol | |
ID | 6968326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5145929 |
End bp | 5146735 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643389142 |
Product | sorbitol-6-phosphate 2-dehydrogenase |
Protein accession | YP_002273539 |
Protein GI | 209395780 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACGT GGTTAAATTT GCAGGATAAA ATCATTATCG TCACCGGCGG CGCATCCGGT ATTGGTCTGG CGATTGTAGA GGAATTATTA GCACAAGGCG CAAACGTACA GATGGTAGAT ATTCACGGTG GCGATGGACA ATATGAAGGT CATAAAGGCT ATCAGTTTTG GCCGACCGAT ATTTCCAGCG CCAAAGAGGT AAATCATACG GTAGCGGAAA TTATCCAGCG TTTTGGTCGC ATCGACGGTC TGGTCAATAA CGCCGGGGTC AATTTCCCGC GTCTGCTGGT CGATGAGAAA GCGCCTGCCG GGCAGTATGA ACTCAACGAA GCTGCATTCG AAAAAATGGT CAATATCAAC CAGAAAGGCG TTTTTCTGAT GTCGCAGGCG GTGGCGCGAC AGATGGTCAA ACAACATGAT GGCGTGATTG TGAATGTTTC CTCAGAAAGT GGGCTGGAAG GCTCAGAAGG CCAAAGCTGT TACGCCGCTA CCAAAGCCGC GCTCAATAGC TTCACGCGCT CCTGGTCGAA AGAGCTGGGT AAACACGGTA TCCGTGTGGT CGGTATCGCG CCGGGGATTC TGGAAAAAAC CGGGCTACGC ACGCCGGAAT ATGAAGAGGC GCTGGCGTGG ACGCGCAATA TCACCGTCGA GCAGCTGCGT GAAGGCTATA CCAAAAACGC CATTCCTATT GGGCGCGCCG GAAGATTAGC AGAAATCGCT GATTTTGTTT GTTATCTGCT GTCTGAACGC GCCAGCTATA TCACCGGAGT AACCACTAAC ATTGCGGGCG GCAAAACGCG CGGCTAA
|
Protein sequence | MQTWLNLQDK IIIVTGGASG IGLAIVEELL AQGANVQMVD IHGGDGQYEG HKGYQFWPTD ISSAKEVNHT VAEIIQRFGR IDGLVNNAGV NFPRLLVDEK APAGQYELNE AAFEKMVNIN QKGVFLMSQA VARQMVKQHD GVIVNVSSES GLEGSEGQSC YAATKAALNS FTRSWSKELG KHGIRVVGIA PGILEKTGLR TPEYEEALAW TRNITVEQLR EGYTKNAIPI GRAGRLAEIA DFVCYLLSER ASYITGVTTN IAGGKTRG
|
| |