Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4969 |
Symbol | |
ID | 5318032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1483388 |
End bp | 1484665 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776751 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001313683 |
Protein GI | 150377087 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.428522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0904039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCA TAACAGACCT GCGCGTCTTC GACCTGCGCT TCCCGACCTC CGCAAGCCTT GATGGCTCGG ATGCCATGAA CCCGGACCCG GACTATTCGG CGGCCTACGT CATCCTCGAT ACCGATCGCC CAGGGCTGGC GGGGCACGGC CTCACCTTCA CCATCGGGCG CGGCAATGAC GTCTGCTGCA TGGCTATCCA GGCCATGCGG CATCTCGTCG TCGGTCAGGA CATTGGAGAT ATTCTGAAAC ATCCCGGTCG CTTCTGGCGG CACCTGACCA GCGACAGCCA GTTGCGCTGG ATCGGACCCG AAAAGGGCGC AATCCATCTT GCGACCGGGG CAATCGTCAA CGCGATCTGG GACCTCCTTG CGAAGGACGC CGGCAAGCCG GTATGGCGGC TCGTCAGCGA GATGTCGGCG GAAGAGATCG CCGACATCGT CGACTACCGT TACATCACGG ATGTGCTTAC CCGCGACGAG GCGATCGAGA TTTTGCGGCG GGCCGAAACC GGCAAGGCCG ACCGGATCGC CGCGCTCGAG AAGGAGGGCT ATCCCTGCTA CACGACCTCG GCGGGTTGGC TCGGTTATGA CGACGAGAAG CTCCGGCGGC TTGCGCAGGA GGCAGTCGAC GCAGGCTTCA ACCATATCAA GATGAAAGTC GGCCGCGATC TCGCCGACGA TATGCGCCGG TTGAGGATCG CCCGTGAAGT GATCGGCCCC GACAGATATC TGATGATCGA CGCGAACCAG GTCTGGGAAG TCGGCGAGGC GATCGACTGG GTGCAGAAGC TCGCTTTCGC CAAACCCTTC TTCATCGAGG AGCCGACGAG CCCCGACGAC GTAGCCGGGC ATCGCAAGAT CCGCGAGGCG ATCGGGCCGG TCAAGGTCGC AACCGGTGAG ATGTGCCAGA ACCGTATCAT GTTCAAGCAG TTCATCGCCG AGGGTGCGAT CGACATCGTG CAGATCGATT CCTGCCGGAT GGGTGGGCTC AACGAGGTTC TGGCCGTGCT TCTGATCGCT GCCAAATACG GGCTTCCCGT ATGGCCGCAC GCCGGTGGGG TCGGTCTCTG CGAATATGTG CAGCACCTGT CGATGATCGA CTACGTCGCC GTTTCCGGTA CGAAAGACGG CCGGGTCATC GAATATGTCG ACCACCTGCA CGAGCACTTC CTGGATCCGT GCGTCATCAG CGACGCTGCC TATATGCCGC CGTCCCGACC CGGATTCTCG ATTGAGATGA AGGAAAAGTC GATTGAGGAT TATACGTTTC GCGGCTGA
|
Protein sequence | MTRITDLRVF DLRFPTSASL DGSDAMNPDP DYSAAYVILD TDRPGLAGHG LTFTIGRGND VCCMAIQAMR HLVVGQDIGD ILKHPGRFWR HLTSDSQLRW IGPEKGAIHL ATGAIVNAIW DLLAKDAGKP VWRLVSEMSA EEIADIVDYR YITDVLTRDE AIEILRRAET GKADRIAALE KEGYPCYTTS AGWLGYDDEK LRRLAQEAVD AGFNHIKMKV GRDLADDMRR LRIAREVIGP DRYLMIDANQ VWEVGEAIDW VQKLAFAKPF FIEEPTSPDD VAGHRKIREA IGPVKVATGE MCQNRIMFKQ FIAEGAIDIV QIDSCRMGGL NEVLAVLLIA AKYGLPVWPH AGGVGLCEYV QHLSMIDYVA VSGTKDGRVI EYVDHLHEHF LDPCVISDAA YMPPSRPGFS IEMKEKSIED YTFRG
|
| |