Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3675 |
Symbol | |
ID | 5077823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 308174 |
End bp | 309382 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481398 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001166060 |
Protein GI | 146275900 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.468042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA CCGCCGCCCG CGTCATCATC ACCTGTCCGG GCCGCAACTT CGTGACGCTC AAGATCGAGA CCGATCAGGG CGTCTACGGT ATCGGCGATG CCACGCTCAA CGGGCGCGAA CTCTCGGTCG TCGCCTACCT GCAGGAACAC GTCGCGCCTT GCCTCATCGG CATGGACCCG CGCCGCATCG AGGACATCTG GCAGTATGTC TATCGCGGCG CCTACTGGCG GCGCGGCCCG GTGACGATGC GCGCGATCGC CGCGGTCGAC ATGGCACTGT GGGACATCAA GGCCAAGATG GCCGGCATGC CGCTGTACCA GCTTCTGGGC GGACGCAGCC GCGACGGGAT CATGGTCTAC GGCCACGCCA ACGGCTCGGA CATCGCAGAG ACGGTGGAGG CCGTCGGCCA CTACATCGAC ATGGGCTACA AGGCGATCCG CGCCCAGACC GGCGTGCCCG GCATCAAGGA CGCCTATGGC GTGGGTCGCG GCAAGCTCTA CTACGAACCG GCCGATGCCA GCCTCCCCTC GGTCACCGGC TGGGACACGC GCAAGGCACT GAATTACGTG CCAAAGTTGT TCGAGGAACT GCGCAAGACC TACGGCTTCG ACCACCACCT CCTGCACGAT GGCCACCACC GCTACACCCC GCAGGAAGCC GCCAACCTCG GCAAGATGCT CGAACCCTAC CAGCTGTTCT GGCTGGAGGA CTGCACCCCT GCCGAAAACC AGGAAGCTTT CAGGCTCGTC CGCCAGCACA CCGTCACCCC GCTCGCCGTG GGCGAGATCT TCAACACGAT CTGGGACGCC AAGGACCTGA TCCAGAACCA GCTCATCGAC TACATCCGCG CCACCGTCGT CGGCGCGGGC GGCCTCACCC ACCTGCGCCG CATCGCCGAC CTTGCCAGCC TCTACCAGGT CCGCACCGGC TGCCACGGCG CGACCGACCT TTCGCCGGTG ACGATGGGCT GCGCGCTTCA CTTCGACACC TGGGTGCCCA ACTTCGGCAT CCAGGAATAC ATGCGCCACA CCGAGGAGAC CGACGCGGTG TTCCCGCACG ACTACTGGTT CGAGAAGGGT GAGCTGTTCG TCGGCGAAAC CCCCGGCCAC GGCGTCGACA TCGACGAGGA ACTGGCTGCC AAATACCCTT ACAAGCCCGC CTACCTGCCG GTTGCCCGGC TGGAAGACGG CACGATGTGG AACTGGTAA
|
Protein sequence | MKITAARVII TCPGRNFVTL KIETDQGVYG IGDATLNGRE LSVVAYLQEH VAPCLIGMDP RRIEDIWQYV YRGAYWRRGP VTMRAIAAVD MALWDIKAKM AGMPLYQLLG GRSRDGIMVY GHANGSDIAE TVEAVGHYID MGYKAIRAQT GVPGIKDAYG VGRGKLYYEP ADASLPSVTG WDTRKALNYV PKLFEELRKT YGFDHHLLHD GHHRYTPQEA ANLGKMLEPY QLFWLEDCTP AENQEAFRLV RQHTVTPLAV GEIFNTIWDA KDLIQNQLID YIRATVVGAG GLTHLRRIAD LASLYQVRTG CHGATDLSPV TMGCALHFDT WVPNFGIQEY MRHTEETDAV FPHDYWFEKG ELFVGETPGH GVDIDEELAA KYPYKPAYLP VARLEDGTMW NW
|
| |