Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2380 |
Symbol | |
ID | 6146157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2423517 |
End bp | 2424719 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641617253 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_001744425 |
Protein GI | 170684206 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.162609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.98097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTA CATCTGTAGA AATTTTTGAC TGCGAACTCA AAAAAAGAGA TCAAACTATG GCCTCTTATA ATCCAGTACT TATTCGGGTT AATACCGATG CTGGAATAAG TGGTGTTGGC GAAGTAGGAT TAGCCTATGG TGCAGGTGCA AAATCTGGTG TTGGTATTAT CAGAGATTTA GCCCCTCTCA TCATTGGTGA AGATCCGTTG AACATTGAGA AGATTTGGGA ATTCCTCTTC AGGAAAACAT TTTGGGGAAT GGGCGGTGGT AATGTTTTTT ACGCAGGTAT GAGCGCTATT GATATTGCCT TATGGGATAT TAAAGGTAAA TATTTAAACG TGCCGGTTTA TCAACTACTG GGGGGAAAGA CTAACGATAA ATTGAGAACT TACGCCAGTC AGTTACAATT TGGTTGGGGC GAAAAAAGGC AAATATTAGT AACGCCTGAA GAGTATGCAG AGGCAGCGCG TGCCGCTATT GCAGAAGGTT ATAATGCGAT AAAAGTTGAT CCGCTTGAAA TCGATCTGCA TGGCGGCGAT TGTGTCTTTC AGAACAAAAA TAGAAATTAC TCTGGCTTAT TACTGGCCGA ACAATTAAAA ATGGGCGAGG CAAGAATTGC CGCTATGCGT GAAGCAATGG GCGACGATGC TGATATTATT GTAGAGATTC ATTCTCTTCT AGGAACAAAT TCAGCGATCC AGTTTGCCAA AGCAATAGAG AAATATCGTA TTTTTCTCTA TGAGGAGCCA ATTCATCCAT TAAATTCGGA TAATATGCAG AAAGTTGCCC GTTCAACATC GATTCCGATA GCAACAGGTG AGCGTTCTTA TACTCGCTGG GGATATCGTG ATTTACTGGA AAAACAATCA ATTGCGGTAG CTCAACCTGA TTTGTGTCTT TGTGGTGGCA TTACAGAGGG AAAGAAAATT TGTGATTATG CCAATATATA TGACACAACC GTTCAGGTAC ATGTTTGTGG CGGGCCTGTT TCAACAGTAG CGGCGCTGCA TATGGAAACT GCAATACCTA ACTTTATTAT TCATGAACAT CATACCAATG CGATGAAAGC ATGTATTCGG GAACTTTGTA CTTACGATTA TCAACCCGAG AATGGCTATT ATGTCGCGCC AGAGTTGCCT GGGCTTGGTC AGGAATTAAA CGATGAAATC GTACAACAAT ATCTTGCCTA TGTGATCAAA TAA
|
Protein sequence | MKITSVEIFD CELKKRDQTM ASYNPVLIRV NTDAGISGVG EVGLAYGAGA KSGVGIIRDL APLIIGEDPL NIEKIWEFLF RKTFWGMGGG NVFYAGMSAI DIALWDIKGK YLNVPVYQLL GGKTNDKLRT YASQLQFGWG EKRQILVTPE EYAEAARAAI AEGYNAIKVD PLEIDLHGGD CVFQNKNRNY SGLLLAEQLK MGEARIAAMR EAMGDDADII VEIHSLLGTN SAIQFAKAIE KYRIFLYEEP IHPLNSDNMQ KVARSTSIPI ATGERSYTRW GYRDLLEKQS IAVAQPDLCL CGGITEGKKI CDYANIYDTT VQVHVCGGPV STVAALHMET AIPNFIIHEH HTNAMKACIR ELCTYDYQPE NGYYVAPELP GLGQELNDEI VQQYLAYVIK
|
| |