Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3561 |
Symbol | |
ID | 8120025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | - |
Start bp | 4039548 |
End bp | 4040738 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644853936 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003005849 |
Protein GI | 251791128 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTAA GTGCGAATTC AGATGCCGTT TCCTATGCCA GCGTTACCGG CGTTAAAACG GCGGCGGAAA CCGGCGACCG TATTGAGTGG GTTAAGCTTT CGCTGGCGTT TTTGCCGCTG GCAACGCCGG TGAGTGATGC CAAGGTATTG ACCGGGCGTC AGAAACCGCT GACCGAAGTG GCGATTATTA TCGCAGAGAT TCGTTCCCGT GACGGGTTTG AAGGCGTGGG GTTCAGCTAC TCCAAGCGCG CGGGCGGCCA GGGGATTTAT GCGCACGCGA AAGAGATTGC CGATAATTTG CTGGGGGAAG ACCCAAACGA TATCGATAAA ATCTACAGCA AGCTGCTGTG GGCCGGTGCC TCGGTGGGGC GTAGCGGCAT GGCCGTGCAG GCTATTTCTC CTATCGACAT CGCGCTTTGG GATATGAAAG CCAAACGCGC CGGATTGCCG CTGGCGAAAT TACTGGGCGC GCACCGGGAT TCGGTCCGAT GCTACAACAC CTCCGGCGGT TTTTTGCATA CTCCGCTGGA ACAGGTGCTG AAAAACGTGG CGCTCTCGCG GGAAAGCGGT ATCGGCGGCA TCAAGCTGAA AGTCGGGCAA CCCAACACCG CAGAGGATAT TCGCCGTCTG ACCGCGGTGC GTGAAGCGCT GGGCGATGAT TTCCCGCTGA TGGTGGATGC CAACCAGCAA TGGGATCGGG AAACGGCTAT CCGCATGGGG CGTAAGATGG AGTCATTCAA TCTGGTCTGG ATTGAAGAGC CGCTGGACGC CTACGATGTG GAAGGCCACG CGCAGCTTGC CGCGGCACTC GACACGCCGA TCGCCACCGG CGAAATGCTC ACCAGTTTCC GCGAACATGA ACAGTTGATT CTGGGCAACG CCAGTGACGT TGTACAGCCG GATGCACCGC GCGTCGGCGG GATTTCCCCG TTCCTGAAAA TTATGGATTT GGCGGCTAAA CATGGCCGCA CGCTGGCGCC GCATTTTGCC ATGGAAGTGC ATTTGCACCT GGCCGCCGCT TACCCGCTGG AACCTTGGCT GGAGCACTTT GAATGGCTCA ACCCGCTGTT TAACGAACAG CTCGAGTTGC GCGACGGGCG TATGTGGGTG TCGGATCGCC ACGGTCTGGG CTTTACCCTG AGCGAACAGG CTCGCCGATG GACGCAGCTC AGCTGTGAAT ATGGCAAATA G
|
Protein sequence | MSLSANSDAV SYASVTGVKT AAETGDRIEW VKLSLAFLPL ATPVSDAKVL TGRQKPLTEV AIIIAEIRSR DGFEGVGFSY SKRAGGQGIY AHAKEIADNL LGEDPNDIDK IYSKLLWAGA SVGRSGMAVQ AISPIDIALW DMKAKRAGLP LAKLLGAHRD SVRCYNTSGG FLHTPLEQVL KNVALSRESG IGGIKLKVGQ PNTAEDIRRL TAVREALGDD FPLMVDANQQ WDRETAIRMG RKMESFNLVW IEEPLDAYDV EGHAQLAAAL DTPIATGEML TSFREHEQLI LGNASDVVQP DAPRVGGISP FLKIMDLAAK HGRTLAPHFA MEVHLHLAAA YPLEPWLEHF EWLNPLFNEQ LELRDGRMWV SDRHGLGFTL SEQARRWTQL SCEYGK
|
| |