Gene Achl_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2097 
Symbol 
ID7293558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2362253 
End bp2363542 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content65% 
IMG OID643590496 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002488155 
Protein GI220912846 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0000368984 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCA GCACCACCGT TCCGGCACCT TCGACGCCCA CGGCTACAGA CCTCGCAATC 
ACAGACATCA CCATTACCCC CATCGCCTTC TCCGATCCGC CGCTGCTGAA CGCTGTGGGC
GTCCACGAAC CCCTGGTCCA CCGGGTGGTC ATCGAAGTGC GGACCGCCAA CGGCCTCCTC
GGCCTGGGCG AATGCTCGGG AGGCAGCACC CGCCTCAAGA ACCTGGCGGC GGGGGCCAAC
GCCATCAAGG GTGTCAGCAT CTTTGAAACG TCGCGGATGG AACAACTCAT CAACCAGGCC
CTGGACCCGG GGCTCGGCGC ATTTGAACGC GCCGCGGTGT TCTCCGCCTT CGAGGTGGCC
GCACTGGACA TCCAGGGGCA CGCCACAGGC CGGACCGTCA GTGAACTGCT CGGCGGCACG
GTCCGCGACG AGGTCCCCTT CAGCGCCTAC CTCTTCTACA AATGGGCAGA ACACCCCGCG
CTGGACGGCA AGCCCGCCAT CACCGATGAC TGGGGCGAAG CCCTGGATCC GGCAGGCATC
GTCCGGCAGG CGCAGAAGAT GATTTCCGAG TACGGTTTCA AATCCATCAA GCTCAAAGGC
GGCGTCTTCC CACCCGCGCA GGAAATCGAA GCGATCCAGG CGCTGCGGGA CGCGTTTCCC
GGGATGCCGC TGCGGCTCGA TCCGAACACC GCGTGGACAG TGGAGACATC CCGATGGGTT
GCCCGCGAAA CGGAAGGCCT GCTCGAGTAC CTGGAGGATC CCACTCCCGG CCTTGAAGGG
ATGGGCGAGG TAGCTGCCAC GGCAGCCATG CCACTGGCCA CCAACATGTG CGTGGTGGCC
TTCGAACATA TCCGGCGCGG TGTGGAACTT GGCTCCGTCC AAGTCATTCT CGGTGACCAC
CACTACTGGG GCGGCCTCCG CCATACCAGG GAACTGGGCG CCATCTGCGA AACCTTTGGC
CTGGGACTGT CCATGCATTC CAACTCCCAC CTGGGCATCA GCCTCGCCGC GATGGTCCAC
GTCGCCGCCG CCACCCCCGC CCTCACCTAC GCCTGCGACA CCCACTACCC GTGGAACGGG
CACAACGACG TGGTCAAGCC AGGCACGTTG CGCTTCGTGG ACGGCAGCGT CCGCGTCCCC
ACCGGCACCG GACTTGGCAT TGAACTGGAC CGGGAAAAGC TCGCCGAACT GCACCGGCAG
TATCTGGATG CCCGCATGAC GGCGAGGGAC GACACCGGGT ACATGCAACG GTTCGTCCCC
GAATACACGG CGGACCTGCC CCGGTGGTAG
 
Protein sequence
MTASTTVPAP STPTATDLAI TDITITPIAF SDPPLLNAVG VHEPLVHRVV IEVRTANGLL 
GLGECSGGST RLKNLAAGAN AIKGVSIFET SRMEQLINQA LDPGLGAFER AAVFSAFEVA
ALDIQGHATG RTVSELLGGT VRDEVPFSAY LFYKWAEHPA LDGKPAITDD WGEALDPAGI
VRQAQKMISE YGFKSIKLKG GVFPPAQEIE AIQALRDAFP GMPLRLDPNT AWTVETSRWV
ARETEGLLEY LEDPTPGLEG MGEVAATAAM PLATNMCVVA FEHIRRGVEL GSVQVILGDH
HYWGGLRHTR ELGAICETFG LGLSMHSNSH LGISLAAMVH VAAATPALTY ACDTHYPWNG
HNDVVKPGTL RFVDGSVRVP TGTGLGIELD REKLAELHRQ YLDARMTARD DTGYMQRFVP
EYTADLPRW