Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4481 |
Symbol | |
ID | 6145264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4577862 |
End bp | 4578668 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619297 |
Product | sorbitol-6-phosphate 2-dehydrogenase |
Protein accession | YP_001746409 |
Protein GI | 170681378 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.628006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACGT GGTTAAATTT GCAGGATAAA ATCATTATTG TCACCGGCGG CGCATCCGGT ATTGGTCTGG CGATTGTGGA TGAATTATTA GCACAAGGCG CGAATGTACA GATGGTCGAT ATTCACGGTG GCGATGGTCA ATATGAAAAC CATAAAGGTT ATCAGTTCTG GCCGACCGAT ATTTCCAGCG CCAAAGAGGT AAATCATACG GTAGCAGAAA TTATCCAGCG TTTTGGTCGC ATCGACGGTC TGGTCAATAA CGCCGGGGTC AATTTCCCGC GTCTGCTGGT CGATGAGAAA GCGCCTGCCG GGCAGTATGA ACTCAACGAA GCGGCATTCG AAAAAATGGT CAATATCAAC CAGAAAGGCG TTTTTCTGAT GTCGCAGGCG GTGGCGCGAC AGATGGTCAA ACAACATGAT GGCGTGATTG TGAATGTTTC CTCAGAAAGT GGGCTGGAAG GCTCAGAAGG CCAAAGCTGT TACGCCGCGA CCAAAGCCGC GCTCAATAGC TTCACGCGCT CCTGGTCGAA AGAGCTGGGT AAGCACGGTA TCCGTGTGGT CGGTATCGCG CCGGGGATTC TGGAAAAAAC AGGACTGCGT ACGCCGGAAT ATGAAGAAGC GCTGGCGTGG ACGCGCAATA TCACCGTCGA GCAGCTGCGT GAAGGCTATA CCAAAAACGC CATTCCTATT GGGCGCGCCG GAAGATTAGC AGAAGTGGCT GATTTTGTTT GTTATCTGCT GTCTGAACGC GCCAGCTATA TCACCGGAGT AACCACTAAC ATTGCGGGCG GCAAAACGCG CGGCTAA
|
Protein sequence | MQTWLNLQDK IIIVTGGASG IGLAIVDELL AQGANVQMVD IHGGDGQYEN HKGYQFWPTD ISSAKEVNHT VAEIIQRFGR IDGLVNNAGV NFPRLLVDEK APAGQYELNE AAFEKMVNIN QKGVFLMSQA VARQMVKQHD GVIVNVSSES GLEGSEGQSC YAATKAALNS FTRSWSKELG KHGIRVVGIA PGILEKTGLR TPEYEEALAW TRNITVEQLR EGYTKNAIPI GRAGRLAEVA DFVCYLLSER ASYITGVTTN IAGGKTRG
|
| |