Gene Arth_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3336 
Symbol 
ID4444065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3749706 
End bp3750851 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID639691159 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_832811 
Protein GI116671878 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATCGC CAGTAGCCGA AGCGCCGGTC GCCGCGGCGG TGCGGACGCT GCCCCGGATC 
ACCGGGCTGT CGAGCCGCCT GCTGACTGTT CCGCTCCTGC GGAGCTGGGG TGCGGAGGCA
CCGGAAAACC ATGTGATCGT CACCGAAGTC CGGACGGACG ACGGCGGCAC GGGTCACGGG
TTTTCGTGGA CCCCCACCAT CGGCCCGCAG GCGGTCAAAG CGCTCCTGGA CTTCGACGTC
GCGCCTTTCA TTACCGGGCT CGAAGCGAAC CCCGAGGGTG TGTGGGACCA GCTGTGGAAG
CGGCTGCACG AAGCCGGGGG CGGGGGACTG ACCACCATCG CCATGGCCGG CGCGGATCTT
GCCCTGTGGG ACCTGCAGGC GCGCCGCGCC GGAACCTCGG TCACCGGGCT TCTGGGCCAG
CGCCAGGACT CCGTGGAGGT GTACGGTTCC GGCGTGAACC TGCACTACTC CATCGGGGAG
CTCGTGGCGC AGGTGGAACG CTGGGTGGCT GCCGGGCACA ACGCCGTCAA GATCAAGGTG
GGCAAGCCGG ACATCCGCGA AGACGTGGAA CGCGTGGCCG CTGTCCGGAA GGTCCTGGGC
CCCCACCGGA AGCTGATGAT CGACGCCAAC CAGCGCTGGG ACCTGCCCGC CACATTCCGC
GCCCTGGAGG TGCTGGGGGA GTTCGGCCTC GAATGGCTGG AAGAACCCCT CCGGGCTGAC
GACCTCTGGG CCTACCGCAG GCTGCGCCAA CACTCGCCGG TACCCATCGC GCTGGGCGAG
AACCTGCACA ACATCTACCG CTTCCGCGAC TTCATCGAGG CGGGAGCGGT GGACATCATC
CAACCCAACA TCATCCGTGT CGGCGGCATC ACCCCGTTCC GCCGCATCGT GGAACTCGCC
CGCACGCACA GCATCAGGGT CATGCCGCAC CTGCTGCCGG AACTCTCCGG CCAGCTCGCG
CTGACCATGG CCGAGCCCAC CCTGGTGGAG GACGTCGAGG ACGCCTCCTT CGAACAACTT
GGCGTGCTGG ACGCGCCCTC ACCCGTGCAG GTCGGCAACA GCCGGCTCAC ACTCACTGGC
AGGCCCGGGC TGGGGTTCGT CTTCTCCGGG GCATCCGCCG ATCACCAGAA CAGGAACTCA
CTGTGA
 
Protein sequence
MGSPVAEAPV AAAVRTLPRI TGLSSRLLTV PLLRSWGAEA PENHVIVTEV RTDDGGTGHG 
FSWTPTIGPQ AVKALLDFDV APFITGLEAN PEGVWDQLWK RLHEAGGGGL TTIAMAGADL
ALWDLQARRA GTSVTGLLGQ RQDSVEVYGS GVNLHYSIGE LVAQVERWVA AGHNAVKIKV
GKPDIREDVE RVAAVRKVLG PHRKLMIDAN QRWDLPATFR ALEVLGEFGL EWLEEPLRAD
DLWAYRRLRQ HSPVPIALGE NLHNIYRFRD FIEAGAVDII QPNIIRVGGI TPFRRIVELA
RTHSIRVMPH LLPELSGQLA LTMAEPTLVE DVEDASFEQL GVLDAPSPVQ VGNSRLTLTG
RPGLGFVFSG ASADHQNRNS L