Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0846 |
Symbol | |
ID | 4076021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 897499 |
End bp | 898674 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006144 |
Product | mandelate racemase/muconate lactonizing-like protein |
Protein accession | YP_612841 |
Protein GI | 99080687 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.122674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC CTTATTCCAC GTCCGGCGCA ATGCCGGTGA CACATGCGCG CGGTGCCGAC GACACTGGCC CGCGCATCAG CAAGATCGAA ACCTTCTGCA CGCCGCTCAT TGGCTTTGTG CGGGTGACTG CGGAGGATGG CAGCCAAGGC TGGGGGCAGG TCTCCACCTA CAACAGCGAT CTGACCTCGG AGATCCTGCA TCGGCAGGTC GCGCCCTGGG CACTCGGGCG GGGTATGGAT GCGCTGGAAG AGGTGATTGC CGAAATTCCC CTGCGCGAAC ATAAATTCCC CGGCACCTAT CTGCGCCGCG CCATGGCCGG TCTCGACACC GCCGTGTGGG ACTGGCGCGG CAAAGCTCAG GGCAAGCCGG TGGCAGAGCT TTTGGGTGGG TCTGCAGGGC CGGTCCGGGC GTATGCCTCC TCCATGCGCC GCGACATCAC CCCAGAGGCC GAGGCCGAGC GTATGCAGAG GCTGCGCGAC GCGCATGGGT TTGACGCCTT CAAAGTCCGT GTAGGGGCTG AATGCGGTCA GGACCGTGAC GAATGGGAGG GTCGCACCGA GGCGATCATT CCCGCCATGT GCAAGGCGAT GGGAGCGCAA GCTGCCTTGC TGGTCGACGG CAATTCTGGC TTCAGCCCCG CCCGCGCGAT CGAGGTCGGC AGAATGCTGG AGGCCAACGG ATACGAGCAT TTTGAAGAAC CCTGTCCCTA TTGGGAGCAG GAGCAGACCC GCGAGGTCAC CCAAGCGCTT GGGATCGATG TGGCGGGGGG CGAGCAGGAC TGCGATCTGC AGCACTGGAA GCGTATGATC GAGAACCGTG TGGTCGACAT CATCCAACCG GATATCCTCT ATCTCGGCGG GATGGTGCGC AGCATGGAAG TGGCCCGCAT GGGCCATGCG GCAGGGCTGC CCTGTACGCC CCATGCGGCG AATCTTTCGC TGGTGACGCT CTTCACCATG CACCTGATGC GTGCGCTGCC CAATCCCGGT CGCTATCTGG AGTTCTCCAT CGAAGGCGAC GACTACTATC CCTGGCAGCG CACCTTGTTC GCAAATGATC CCTTTCAGAT CACCAATGGC CAGGCGCTTG TCACCGACGC CCCCGGTTGG GGCGTGGAGA TTTGCCCCGA GTGGCTGGCC AAATCCACCT ACAAATGCAG CGAGGACGAC CAATGA
|
Protein sequence | MSTPYSTSGA MPVTHARGAD DTGPRISKIE TFCTPLIGFV RVTAEDGSQG WGQVSTYNSD LTSEILHRQV APWALGRGMD ALEEVIAEIP LREHKFPGTY LRRAMAGLDT AVWDWRGKAQ GKPVAELLGG SAGPVRAYAS SMRRDITPEA EAERMQRLRD AHGFDAFKVR VGAECGQDRD EWEGRTEAII PAMCKAMGAQ AALLVDGNSG FSPARAIEVG RMLEANGYEH FEEPCPYWEQ EQTREVTQAL GIDVAGGEQD CDLQHWKRMI ENRVVDIIQP DILYLGGMVR SMEVARMGHA AGLPCTPHAA NLSLVTLFTM HLMRALPNPG RYLEFSIEGD DYYPWQRTLF ANDPFQITNG QALVTDAPGW GVEICPEWLA KSTYKCSEDD Q
|
| |