Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3153 |
Symbol | |
ID | 4444213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3541310 |
End bp | 3542368 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639690979 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_832631 |
Protein GI | 116671698 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01927] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCG ACGCACCCCG CCAGCCCCGG AACCTTCCTG CGGTGAACCT CCCTGCGGTG AACCTTCCTT CCGTGGAGCA GCTTGTGGCC TCCGCGCACG TGGTGAGCCT GCCCATGCGC GTCAAGTTCC GCGGCATTAC ACAGCGCGAA GCCCTGCTGC TGGACGGCAC GCTGGGCTGG GGGGAATTCT GCCCCTTCCC CGAGTACGGG GACGCCGAGG CGTCCCGCTG GCTTGCCTCC GCCGTCGAAG CCGGCTGGCA AGGATTTCCG CCTCCGCTGC GGTCAGTGAT TCCGGTCAAC GCCACCGTTC CTGCCATCCC CGCGGAGCGG GTTCCGGAAG TCCTGGCCCG CTTTGGCCGC GTGGACGCCG TCAAAATCAA GGTGGCCGAA CGCGGCCAGA CGCTCGACGA CGACGCCGCC CGGGTTGCGG CCGTGCGCCG CGCCCTCCCG GACGCCGCCA TTCGGGTGGA CGCGAACGGC GGCTGGGACG TGCCCGCCGC CGTCGAGGCA CTGACCCGGC TGTCCGCCGT CGGGCTCGAA TACGCCGAAC AGCCGGTCCC GGACATCGAA GGCCTGGCCG AGGTCCGCCG CCGGCTGCAG GCCGCCGGCA TCCCCGTGCT GATCGCTGCG GATGAAAGCG TGCGCAAGGA AGACGACCCC CTCCGCGTCG CGCGGGCAGG CGCGGCCGAT CTCATCGTGG TGAAAGTGGC GCCGCTCGGG GGAGTGCGGC GTGCCCTGGA CATCGTGGCG CAGGCCGGAC TGCCCGCCGT CGTCAGTTCC GCGCTGGACA CCTCAGTTGG GATCCGCGCC GGGCTGGCCC TGGCGGCGGC GCTCCCGGAA CTGCCCTACG CGTGCGGGCT GGGGACAGTC TCGCTGTTTG AGCAGGACAT CACGCTGGAT CCGCTGGTGG CCGACGACGG CGCCATCCGC GTCCGCGACG TCACCGCCGA CGCCGGGCTG CTGGAGCGAT ATGCGGCACC CGCCGAGCGC CGGGACTGGT GGCTGGACCG GCTCCGCCGC GTCCACGCAG TGCTGTTCAC CGGATCTTCA CCTGGCTGA
|
Protein sequence | MPADAPRQPR NLPAVNLPAV NLPSVEQLVA SAHVVSLPMR VKFRGITQRE ALLLDGTLGW GEFCPFPEYG DAEASRWLAS AVEAGWQGFP PPLRSVIPVN ATVPAIPAER VPEVLARFGR VDAVKIKVAE RGQTLDDDAA RVAAVRRALP DAAIRVDANG GWDVPAAVEA LTRLSAVGLE YAEQPVPDIE GLAEVRRRLQ AAGIPVLIAA DESVRKEDDP LRVARAGAAD LIVVKVAPLG GVRRALDIVA QAGLPAVVSS ALDTSVGIRA GLALAAALPE LPYACGLGTV SLFEQDITLD PLVADDGAIR VRDVTADAGL LERYAAPAER RDWWLDRLRR VHAVLFTGSS PG
|
| |