Gene Arth_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0234 
Symbol 
ID4447325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp246708 
End bp248054 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content67% 
IMG OID639688030 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_829735 
Protein GI116668802 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCCA TCACATCCAT CACCACGCAG GACGTCCGGT TTCCCACGTC CCTGGAACTC 
GATGGGTCCG ACGCAGTCAA TGTTGACCCC GACTACTCCG CCGCCTACGT CGTCATCCGC
ACCGATGCGG GCGATGAAGG CCACGGCTTT GTGTTCAGCT GCGGCCGCGG CAGCGAAATC
CTCACGGCGG CCATCAACTC CTACGCGGAG CTGCTGCGGG GCCGGGACAT CGAGGAACTG
ATCTACGACC TCGGGAGCGC CTCCAAGCGC CTCATCCACG ACTCGCAGCT CCGCTGGCTC
GGCCCGGAGA AAGGTGTCAC CCAGATGGCG GCCGGCGCGC TGGTCAGCGC GCTCTGGGAC
ATCCGCGCCC GCCGCGAAAA CAAGCCGCTC TGGCTGCTCC TGAGCGAAAT GTCCCCGGAA
GAGATCGTTG ACGTCGTCGA CTTCACCCAC ATCCGTGACG CCCTGAATCC GCAGCAAGCC
CTGGACATCC TGCGCGCAGG CCAGGACGGC AAGGCGGCCC GCATCGCAAG CCTCAGGGCG
GACGGCTACC CCGCCTACAC CACGTCGCCG GGCTGGCTGG GCTACAGCGA CGAGAAGCTG
GTCCGGCTCA GCAAGGAGGC CGCCGCAGCG GGCTTCTCCA TGATCAAGCT CAAGGTCGGC
GGCGACCTCG CCGACGATCG CCGCCGCATG GCCCTCGCCC GGCAGGCCGT GGGCAACCTG
CCCATCGCCA TCGACGCCAA CCAGCGCTGG GAAGTGTCCG AGGCGATTGA ATGGGTCAAC
CAGCTGGCCG AGTTCAATCC CTACTGGATC GAAGAGCCCA CCAGCACCGA TGACATCCTG
GGCCATGCGG ACATCCGGAA GGGAGTAGCC CCGGTCCGCG TCGCCACAGG CGAGGCGGTA
GCCAGCCGTA TTGTGTTCAA GCAGCTGCTT CAGGCAGGGG CCATCGACGT CCTGCAGCTG
GATTCCACCC GGGTGGGCGG CGTCAACGAG AACATCGCCA ACCTGCTGCT GGCCGCCAAG
TTCGGCGTCC CGGTCTGCCC GCATGCCGGA GGCGTTGGCC TGTGCGAGCT GGTCCAGCAC
TTCTCCTTCT TCGACTACGC CGCCATCACC GGCAGCCAGG ACGGCCGCAT GATCGAATAC
GTGGACCACC TGCACGAACA CTTCGCCGAA CCGGTGCGGA TCGTTGGCGG ACGCTATGCC
GCCCCGGAAC GCCCGGGCAC CGGCGCCGAG ATGCTCAGTG CCTCACGGAC GCGCTGGGAA
TTCCCCTCCG GCGCAGGGTG GCTTGAAGTG GGCAACCGCG CCGCCGTCAC CGGTGCGAGC
CTTGCACCTG CCGGAGCCGG CCGATGA
 
Protein sequence
MPSITSITTQ DVRFPTSLEL DGSDAVNVDP DYSAAYVVIR TDAGDEGHGF VFSCGRGSEI 
LTAAINSYAE LLRGRDIEEL IYDLGSASKR LIHDSQLRWL GPEKGVTQMA AGALVSALWD
IRARRENKPL WLLLSEMSPE EIVDVVDFTH IRDALNPQQA LDILRAGQDG KAARIASLRA
DGYPAYTTSP GWLGYSDEKL VRLSKEAAAA GFSMIKLKVG GDLADDRRRM ALARQAVGNL
PIAIDANQRW EVSEAIEWVN QLAEFNPYWI EEPTSTDDIL GHADIRKGVA PVRVATGEAV
ASRIVFKQLL QAGAIDVLQL DSTRVGGVNE NIANLLLAAK FGVPVCPHAG GVGLCELVQH
FSFFDYAAIT GSQDGRMIEY VDHLHEHFAE PVRIVGGRYA APERPGTGAE MLSASRTRWE
FPSGAGWLEV GNRAAVTGAS LAPAGAGR