Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1931 |
Symbol | |
ID | 4445550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2175623 |
End bp | 2176723 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689741 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_831413 |
Protein GI | 116670480 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.132079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGATCA CACGCATCGA GGCCATCCCC TATGCCATCC CGTACTCGCG GCCGCTGAAG TTCGCCAGCG GAGAGGTGAG CACCGCGGAA CATGTGCTGG TCCGGATCCA CACGGATGCG GGTATCTGCG GAGTGGCAGA CACTCCTCCG CGGCCCTATA CATACGGCGA AACCCAGGAT TCGATCGTGT CGGTGGTGAC CAAGGTCTTT GCCCCGCAGC TCATCGGGAT GGACCCGATG GACCGCTCCA AAGTCCAGCA GTTGCTCGGG CGCACGGTCA ATAACCCCAC GGCCAAGGGG GCCCTGGACA TCGCACTCTG GGATGTCATC GGGATTTCGC TGGGCACCCC GGTGCACAAG CTCCTGGGTG GCTTCAGCGA CAGCATGCGG GTCTCGCACA TGCTGGGCTT CAAGGCCGCC GCAGAGCTCC TCGAGGAAGC GCTGCGGTTC CGTGAAACGT ACGGCATCGA CACCTTCAAG CTCAAGGTTG GCCGGCGGCC GCTCTCCCTG GACGTCGAGG CCTGCCACGT GCTGCGGGAA GGCCTCGGTG CGGACACCGA GATCTACCTC GATGCCAACC GCGGGTGGAC GGCGAACGAG GCCATGGAGG TGCTCCGCCG GACCGAAGGC CTGGGATTGT CAATGCTGGA GGAGCCGTGC GATGCCGCCG AGGCGATGGG ACGGCGCCGG CTGGTCCAGC ACTCGAGCAT CCCGATCGTG GGCGACGAAA GCGTCCCCAA CCTCGGGGAC GTTTCCCGGG AACTGCTCTC CGGCGGAAGC AACGCGATCT GCATTAAGAC AGCGCGCAGC GGTTTCACCG AGGCCCAGCA GATCCTCGGC CTCTGCGAGG GCCTTGGCGT GGACGTCACG ATGGGCAACC AGATCGACAC ACAGGTCGGC AGTCTCGCCA CGGTCACCTT CGGCGCGGCC TTCGAGGCCA GCTCCCGGCG GGCCGGGGAG CTCTCCAACT ACCTGGACAT GACGGATGAC CTGCTTGCGG AGCCGCTCGA AATTACCGAC GGGGCCATCC GGGTCCGCAA GGCCCCCGGC GTCGGGGCAG CCATCGACGC CGACAAGCTG CAGAAGTACC GTCAGGACTA G
|
Protein sequence | MKITRIEAIP YAIPYSRPLK FASGEVSTAE HVLVRIHTDA GICGVADTPP RPYTYGETQD SIVSVVTKVF APQLIGMDPM DRSKVQQLLG RTVNNPTAKG ALDIALWDVI GISLGTPVHK LLGGFSDSMR VSHMLGFKAA AELLEEALRF RETYGIDTFK LKVGRRPLSL DVEACHVLRE GLGADTEIYL DANRGWTANE AMEVLRRTEG LGLSMLEEPC DAAEAMGRRR LVQHSSIPIV GDESVPNLGD VSRELLSGGS NAICIKTARS GFTEAQQILG LCEGLGVDVT MGNQIDTQVG SLATVTFGAA FEASSRRAGE LSNYLDMTDD LLAEPLEITD GAIRVRKAPG VGAAIDADKL QKYRQD
|
| |