Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0779 |
Symbol | galM |
ID | 6145327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 779807 |
End bp | 780847 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615667 |
Product | aldose 1-epimerase |
Protein accession | YP_001742859 |
Protein GI | 170681719 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2017] Galactose mutarotase and related enzymes |
TIGRFAM ID | [TIGR02636] galactose mutarotase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.184805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGAACG AAACTCCCGC ACTGGCACCC GATGGTCAGC CGTACCGACT GTTAACTTTG CGTAACAACG CAGGGATGGT AGTCACGCTG ATGGACTGGG GTGCGACTTT ACTTTCCGCC CGTATTCCGC TTTCCGATGG CAGCGTCCGC GAGGCGCTGC TTGGCTGCGC CAGCCCGGAA TGCTATCAGG ATCAGGCCGC ATTTCTGGGG GCCTCTATTG GTCGTTATGC CAACCGCATC GCCAATAGCC GTTATACCTT TGACGGTGAA ACCGTGACGC TTTCGCCAAG TCAGGGCGTT AACCAGCTGC ACGGCGGCCC GGAAGGGTTC GACAAACGTC GCTGGCAGAT TGTGAACCAG AATGATCGTC AGGTGCTATT TGCCCTGAGT TCAGATGATG GCGATCAGGG CTTCCCGGGT AATCTCGGCG CGACGGTGCA ATATCGTCTG ACCGACGATA ACCGTATCTC CATTACTTAT CGCGCCACAG TTGATAAACC TTGCCCGGTG AATATGACTA ATCACGTCTA TTTCAACCTC GACGGCGAGC AGTCTGACGT GCGCAATCAC AAGTTGCAGA TTCTGGCGGA CGAATATCTG CCGGTTGATG AAGGCGGCAT TCCGCACGAC GGTCTGAAAT CTGTCGCCGG AACATCTTTT GATTTCCGCA GCGCCAAAAT CATCGCCAGT GAGTTTCTTG CCGACGACGA TCAGCGCAAA GTGAAAGGTT ACGATCACGC GTTCTTGTTA CAGGCCAAAG GCGATGGCAA GAAAGTGGCG GCGCATGTCT GGTCAGCAGA TGAAAAATTG CAGCTGAAGG TCTACACCAC CGCTCCGGCT CTGCAATTCT ACTCCGGCAA CTTCCTCGGC GGCACACCGT CGCGGGGAAC CGAACCTTAC GCCGACTGGC AAGGGCTGGC TCTGGAAAGC GAATTTCTGC CGGACAGTCC GAACCACCCT GAATGGCCGC AACCGGACTG CTTCCTGCGT CCTGGCGAAG AGTATTCCAG CCTGACGGAA TATCAGTTTA TTGCTGAGTA A
|
Protein sequence | MLNETPALAP DGQPYRLLTL RNNAGMVVTL MDWGATLLSA RIPLSDGSVR EALLGCASPE CYQDQAAFLG ASIGRYANRI ANSRYTFDGE TVTLSPSQGV NQLHGGPEGF DKRRWQIVNQ NDRQVLFALS SDDGDQGFPG NLGATVQYRL TDDNRISITY RATVDKPCPV NMTNHVYFNL DGEQSDVRNH KLQILADEYL PVDEGGIPHD GLKSVAGTSF DFRSAKIIAS EFLADDDQRK VKGYDHAFLL QAKGDGKKVA AHVWSADEKL QLKVYTTAPA LQFYSGNFLG GTPSRGTEPY ADWQGLALES EFLPDSPNHP EWPQPDCFLR PGEEYSSLTE YQFIAE
|
| |