Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0479 |
Symbol | |
ID | 4447034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 509813 |
End bp | 511102 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639688276 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_829978 |
Protein GI | 116669045 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.663029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCA GTACCACTTT CCAGGCCATC ACCACTCCCA CAGCCGCCGA CCTCAAGATC ACCGACGTCA CCATCACCCC CATCGCCTTC TCCGATCCAC CGCTGCTCAA CGCCGTCGGG GTCCATGAAC CGCTGGTCCA CCGGGTGGTG ATTGAGATTC GCACCGCCAA CGGCCTGCTC GGGCTGGGCG AGTGCGCCGG CGGGCAAAGC CGGCTGGCAA ACCTGGCCGT CGGGGCCCGG GCCATCAGGG GCGTGAGCGT CTTCGATACC ACCCTGATGG AGCTGCTCAT CAATGAAGCG CTCGCCGGCG AGCCCTCCGT ATTTGAACGG GCCGCCGTGT TCTCAGCCTT CGAAGTGGCA GCGCTGGACA TCCAGGGCCA TGCGACGGGC AGGAGTGTCA GCGAACTTCT GGGCGGCACG GTCCGGGACG AGGTTCCGTT CAGCGCCTAC CTTTTCTACA AGTGGGCGGA ACATCCGGCC TTGGACGGAA AGCCCGCCAT TTCCGATGAA TGGGGTGAAG CCCTGGACCC GGAGGGCATA GTCCGGCAGG CCCGCAAGAT GATCTCCGAG TACGGCTTCA AATCGATCAA GCTCAAGGGC GGCGTATTTC CGCCCGCCCA GGAAATCGAA GCAATCAAGG CGCTTCGCCA GGCCTTCCCC GGGCTGCCGC TGCGGCTCGA CCCCAACACG GCGTGGACCG TGGAAACCTC TCGCTGGGTA GCCCAGGAAA CGTCGGGGCT GCTTGAATAC CTCGAGGACC CCACTCCGGG CCTTGAGGGC ATGGCGGCGG TGGCCACGAC TGCCGCCATG CCGCTGGCCA CCAACATGTG CGTGGTGGCT TTTGACCACA TCAAGCGCGG CGTTGAACTC GGCGCGGTGC AGGTCATCCT CGGTGACCAC CATTATTGGG GAGGACTGCG GCACACCCGT GAACTCGGAG CGATCTGCCA GACGTTCGGG ATCGGCCTGT CGATGCATTC CAACTCGCAC CTGGGAATCA GTCTGGCAGC GATGGTGCAC GTCGCCGCCT CGACCCCGGC GCTTACCTAC GCCTGCGATA CCCACTATCC GTGGAACGGC CACAACGACG TCGTGAAACC GGGCGCCCTG CGGTTTGTTG ACGGCAGCGT CAAGGTTCCG GCCGGTCCCG GACTGGGCGT TCAGCTGGAC CGGGAGAAGC TGGCCGAACT GCACCAGCAA TACCTTGACG CCGGCATGAC GGCGAGGGAC GACACCGGCT ACATGCAGAA GTTCGTCCCG GACTACACAG CGGACCTTCC GCGCTGGTGA
|
Protein sequence | MNTSTTFQAI TTPTAADLKI TDVTITPIAF SDPPLLNAVG VHEPLVHRVV IEIRTANGLL GLGECAGGQS RLANLAVGAR AIRGVSVFDT TLMELLINEA LAGEPSVFER AAVFSAFEVA ALDIQGHATG RSVSELLGGT VRDEVPFSAY LFYKWAEHPA LDGKPAISDE WGEALDPEGI VRQARKMISE YGFKSIKLKG GVFPPAQEIE AIKALRQAFP GLPLRLDPNT AWTVETSRWV AQETSGLLEY LEDPTPGLEG MAAVATTAAM PLATNMCVVA FDHIKRGVEL GAVQVILGDH HYWGGLRHTR ELGAICQTFG IGLSMHSNSH LGISLAAMVH VAASTPALTY ACDTHYPWNG HNDVVKPGAL RFVDGSVKVP AGPGLGVQLD REKLAELHQQ YLDAGMTARD DTGYMQKFVP DYTADLPRW
|
| |