Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2401 |
Symbol | |
ID | 6146910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2449589 |
End bp | 2450794 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617274 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_001744446 |
Protein GI | 170683394 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTAC CAAAAATCAA ACAGGTTCGC GCCTGGTTTA CTGGCGGTGC GACAGCAGAA AAAGGCGCTG GCGGTGGTGA TTATCACGAC CAGGGGGCGA ATCACTGGAT TGACGATCAT ATTGCCACCC CGATGAGTAA ATACCGCGAT TACGAGCAGT CACGCCAGTC ATTTGGCATT AACGTTCTTG GCACGTTGAT TGTTGAAGTC GAAGCAGAAA ACGGCCAGAC CGGATTCGCC GTTTCGACAG CCGGTGAAAT GGGCTGTTTT ATTGTCGAAA AACACCTTAA CCGTTTCATT GAGGGTAAAT GTGTCAGTGA TATCAAACTG ATCCACGATC AAATGCTCAA TGCCACCCTG TATTACTCCG GCTCTGGTGG CCTGGTGATG AATACGATTT CCTGTGTCGA TCTGGCTCTG TGGGATCTGT TCGGCAAAGT GGTCGGGCTT CCGGTTTATA AACTTTTAGG CGGCGCAGTT CGTGATGAGA TTCAGTTCTA CGCCACAGGT GCGCGTCCGG ATCTGGCAAA AGAGATGGGC TTTATCGGTG GCAAAATGCC GACGCACTGG GGGCCACATG ATGGCGATGC GGGGATCCGC AAAGATGCCG CTATGGTCGC GGATATGCGT GAAAAATGCG GTGAGGATTT CTGGTTAATG CTCGACTGCT GGATGAGTCA GGACGTGAAC TATGCCACCA AACTGGCCCA CGCTTGCGCG CCCTATAACC TGAAATGGAT CGAAGAGTGC CTGCCGCCAC AGCAGTATGA AGGTTATCGC GAACTGAAAC GCAACGCACC AGCGGGGATG ATGGTCACCA GCGGTGAGCA CCACGGCACA TTGCAATCTT TCCGTACGCT TTCAGAAACC GGTATCGACA TTATGCAGCC GGATGTTGGC TGGTGCGGTG GCTTAACCAC GCTGGTGGAA ATTGCCGCAA TCGCCAAATC CAGGGGGCAA CTGGTGGTGC CGCACGGTTC GTCTGTTTAC TCCCACCATG CGGTGATCAC CTTCACCAAT ACGCCATTCA GTGAATTCCT GATGACCAGC CCGGATTGTT CAACGATGCG TCCGCAATTT GATCCGATTC TGCTTAATGA GCCGGTTCCG GTGAATGGTC GTATTCATAA ATCAGTGCTT GATAAACCCG GTTTCGGCGT TGAACTCAAT CGTGACTGCA ATTTGAAACG CCCCTACAGC CACTAA
|
Protein sequence | MTLPKIKQVR AWFTGGATAE KGAGGGDYHD QGANHWIDDH IATPMSKYRD YEQSRQSFGI NVLGTLIVEV EAENGQTGFA VSTAGEMGCF IVEKHLNRFI EGKCVSDIKL IHDQMLNATL YYSGSGGLVM NTISCVDLAL WDLFGKVVGL PVYKLLGGAV RDEIQFYATG ARPDLAKEMG FIGGKMPTHW GPHDGDAGIR KDAAMVADMR EKCGEDFWLM LDCWMSQDVN YATKLAHACA PYNLKWIEEC LPPQQYEGYR ELKRNAPAGM MVTSGEHHGT LQSFRTLSET GIDIMQPDVG WCGGLTTLVE IAAIAKSRGQ LVVPHGSSVY SHHAVITFTN TPFSEFLMTS PDCSTMRPQF DPILLNEPVP VNGRIHKSVL DKPGFGVELN RDCNLKRPYS H
|
| |