Gene Arth_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3153 
Symbol 
ID4444213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3541310 
End bp3542368 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content72% 
IMG OID639690979 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_832631 
Protein GI116671698 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCG ACGCACCCCG CCAGCCCCGG AACCTTCCTG CGGTGAACCT CCCTGCGGTG 
AACCTTCCTT CCGTGGAGCA GCTTGTGGCC TCCGCGCACG TGGTGAGCCT GCCCATGCGC
GTCAAGTTCC GCGGCATTAC ACAGCGCGAA GCCCTGCTGC TGGACGGCAC GCTGGGCTGG
GGGGAATTCT GCCCCTTCCC CGAGTACGGG GACGCCGAGG CGTCCCGCTG GCTTGCCTCC
GCCGTCGAAG CCGGCTGGCA AGGATTTCCG CCTCCGCTGC GGTCAGTGAT TCCGGTCAAC
GCCACCGTTC CTGCCATCCC CGCGGAGCGG GTTCCGGAAG TCCTGGCCCG CTTTGGCCGC
GTGGACGCCG TCAAAATCAA GGTGGCCGAA CGCGGCCAGA CGCTCGACGA CGACGCCGCC
CGGGTTGCGG CCGTGCGCCG CGCCCTCCCG GACGCCGCCA TTCGGGTGGA CGCGAACGGC
GGCTGGGACG TGCCCGCCGC CGTCGAGGCA CTGACCCGGC TGTCCGCCGT CGGGCTCGAA
TACGCCGAAC AGCCGGTCCC GGACATCGAA GGCCTGGCCG AGGTCCGCCG CCGGCTGCAG
GCCGCCGGCA TCCCCGTGCT GATCGCTGCG GATGAAAGCG TGCGCAAGGA AGACGACCCC
CTCCGCGTCG CGCGGGCAGG CGCGGCCGAT CTCATCGTGG TGAAAGTGGC GCCGCTCGGG
GGAGTGCGGC GTGCCCTGGA CATCGTGGCG CAGGCCGGAC TGCCCGCCGT CGTCAGTTCC
GCGCTGGACA CCTCAGTTGG GATCCGCGCC GGGCTGGCCC TGGCGGCGGC GCTCCCGGAA
CTGCCCTACG CGTGCGGGCT GGGGACAGTC TCGCTGTTTG AGCAGGACAT CACGCTGGAT
CCGCTGGTGG CCGACGACGG CGCCATCCGC GTCCGCGACG TCACCGCCGA CGCCGGGCTG
CTGGAGCGAT ATGCGGCACC CGCCGAGCGC CGGGACTGGT GGCTGGACCG GCTCCGCCGC
GTCCACGCAG TGCTGTTCAC CGGATCTTCA CCTGGCTGA
 
Protein sequence
MPADAPRQPR NLPAVNLPAV NLPSVEQLVA SAHVVSLPMR VKFRGITQRE ALLLDGTLGW 
GEFCPFPEYG DAEASRWLAS AVEAGWQGFP PPLRSVIPVN ATVPAIPAER VPEVLARFGR
VDAVKIKVAE RGQTLDDDAA RVAAVRRALP DAAIRVDANG GWDVPAAVEA LTRLSAVGLE
YAEQPVPDIE GLAEVRRRLQ AAGIPVLIAA DESVRKEDDP LRVARAGAAD LIVVKVAPLG
GVRRALDIVA QAGLPAVVSS ALDTSVGIRA GLALAAALPE LPYACGLGTV SLFEQDITLD
PLVADDGAIR VRDVTADAGL LERYAAPAER RDWWLDRLRR VHAVLFTGSS PG