Gene Arth_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0479 
Symbol 
ID4447034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp509813 
End bp511102 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content64% 
IMG OID639688276 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_829978 
Protein GI116669045 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.663029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA GTACCACTTT CCAGGCCATC ACCACTCCCA CAGCCGCCGA CCTCAAGATC 
ACCGACGTCA CCATCACCCC CATCGCCTTC TCCGATCCAC CGCTGCTCAA CGCCGTCGGG
GTCCATGAAC CGCTGGTCCA CCGGGTGGTG ATTGAGATTC GCACCGCCAA CGGCCTGCTC
GGGCTGGGCG AGTGCGCCGG CGGGCAAAGC CGGCTGGCAA ACCTGGCCGT CGGGGCCCGG
GCCATCAGGG GCGTGAGCGT CTTCGATACC ACCCTGATGG AGCTGCTCAT CAATGAAGCG
CTCGCCGGCG AGCCCTCCGT ATTTGAACGG GCCGCCGTGT TCTCAGCCTT CGAAGTGGCA
GCGCTGGACA TCCAGGGCCA TGCGACGGGC AGGAGTGTCA GCGAACTTCT GGGCGGCACG
GTCCGGGACG AGGTTCCGTT CAGCGCCTAC CTTTTCTACA AGTGGGCGGA ACATCCGGCC
TTGGACGGAA AGCCCGCCAT TTCCGATGAA TGGGGTGAAG CCCTGGACCC GGAGGGCATA
GTCCGGCAGG CCCGCAAGAT GATCTCCGAG TACGGCTTCA AATCGATCAA GCTCAAGGGC
GGCGTATTTC CGCCCGCCCA GGAAATCGAA GCAATCAAGG CGCTTCGCCA GGCCTTCCCC
GGGCTGCCGC TGCGGCTCGA CCCCAACACG GCGTGGACCG TGGAAACCTC TCGCTGGGTA
GCCCAGGAAA CGTCGGGGCT GCTTGAATAC CTCGAGGACC CCACTCCGGG CCTTGAGGGC
ATGGCGGCGG TGGCCACGAC TGCCGCCATG CCGCTGGCCA CCAACATGTG CGTGGTGGCT
TTTGACCACA TCAAGCGCGG CGTTGAACTC GGCGCGGTGC AGGTCATCCT CGGTGACCAC
CATTATTGGG GAGGACTGCG GCACACCCGT GAACTCGGAG CGATCTGCCA GACGTTCGGG
ATCGGCCTGT CGATGCATTC CAACTCGCAC CTGGGAATCA GTCTGGCAGC GATGGTGCAC
GTCGCCGCCT CGACCCCGGC GCTTACCTAC GCCTGCGATA CCCACTATCC GTGGAACGGC
CACAACGACG TCGTGAAACC GGGCGCCCTG CGGTTTGTTG ACGGCAGCGT CAAGGTTCCG
GCCGGTCCCG GACTGGGCGT TCAGCTGGAC CGGGAGAAGC TGGCCGAACT GCACCAGCAA
TACCTTGACG CCGGCATGAC GGCGAGGGAC GACACCGGCT ACATGCAGAA GTTCGTCCCG
GACTACACAG CGGACCTTCC GCGCTGGTGA
 
Protein sequence
MNTSTTFQAI TTPTAADLKI TDVTITPIAF SDPPLLNAVG VHEPLVHRVV IEIRTANGLL 
GLGECAGGQS RLANLAVGAR AIRGVSVFDT TLMELLINEA LAGEPSVFER AAVFSAFEVA
ALDIQGHATG RSVSELLGGT VRDEVPFSAY LFYKWAEHPA LDGKPAISDE WGEALDPEGI
VRQARKMISE YGFKSIKLKG GVFPPAQEIE AIKALRQAFP GLPLRLDPNT AWTVETSRWV
AQETSGLLEY LEDPTPGLEG MAAVATTAAM PLATNMCVVA FDHIKRGVEL GAVQVILGDH
HYWGGLRHTR ELGAICQTFG IGLSMHSNSH LGISLAAMVH VAASTPALTY ACDTHYPWNG
HNDVVKPGAL RFVDGSVKVP AGPGLGVQLD REKLAELHQQ YLDAGMTARD DTGYMQKFVP
DYTADLPRW