Gene Arth_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1931 
Symbol 
ID4445550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2175623 
End bp2176723 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID639689741 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_831413 
Protein GI116670480 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.132079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGATCA CACGCATCGA GGCCATCCCC TATGCCATCC CGTACTCGCG GCCGCTGAAG 
TTCGCCAGCG GAGAGGTGAG CACCGCGGAA CATGTGCTGG TCCGGATCCA CACGGATGCG
GGTATCTGCG GAGTGGCAGA CACTCCTCCG CGGCCCTATA CATACGGCGA AACCCAGGAT
TCGATCGTGT CGGTGGTGAC CAAGGTCTTT GCCCCGCAGC TCATCGGGAT GGACCCGATG
GACCGCTCCA AAGTCCAGCA GTTGCTCGGG CGCACGGTCA ATAACCCCAC GGCCAAGGGG
GCCCTGGACA TCGCACTCTG GGATGTCATC GGGATTTCGC TGGGCACCCC GGTGCACAAG
CTCCTGGGTG GCTTCAGCGA CAGCATGCGG GTCTCGCACA TGCTGGGCTT CAAGGCCGCC
GCAGAGCTCC TCGAGGAAGC GCTGCGGTTC CGTGAAACGT ACGGCATCGA CACCTTCAAG
CTCAAGGTTG GCCGGCGGCC GCTCTCCCTG GACGTCGAGG CCTGCCACGT GCTGCGGGAA
GGCCTCGGTG CGGACACCGA GATCTACCTC GATGCCAACC GCGGGTGGAC GGCGAACGAG
GCCATGGAGG TGCTCCGCCG GACCGAAGGC CTGGGATTGT CAATGCTGGA GGAGCCGTGC
GATGCCGCCG AGGCGATGGG ACGGCGCCGG CTGGTCCAGC ACTCGAGCAT CCCGATCGTG
GGCGACGAAA GCGTCCCCAA CCTCGGGGAC GTTTCCCGGG AACTGCTCTC CGGCGGAAGC
AACGCGATCT GCATTAAGAC AGCGCGCAGC GGTTTCACCG AGGCCCAGCA GATCCTCGGC
CTCTGCGAGG GCCTTGGCGT GGACGTCACG ATGGGCAACC AGATCGACAC ACAGGTCGGC
AGTCTCGCCA CGGTCACCTT CGGCGCGGCC TTCGAGGCCA GCTCCCGGCG GGCCGGGGAG
CTCTCCAACT ACCTGGACAT GACGGATGAC CTGCTTGCGG AGCCGCTCGA AATTACCGAC
GGGGCCATCC GGGTCCGCAA GGCCCCCGGC GTCGGGGCAG CCATCGACGC CGACAAGCTG
CAGAAGTACC GTCAGGACTA G
 
Protein sequence
MKITRIEAIP YAIPYSRPLK FASGEVSTAE HVLVRIHTDA GICGVADTPP RPYTYGETQD 
SIVSVVTKVF APQLIGMDPM DRSKVQQLLG RTVNNPTAKG ALDIALWDVI GISLGTPVHK
LLGGFSDSMR VSHMLGFKAA AELLEEALRF RETYGIDTFK LKVGRRPLSL DVEACHVLRE
GLGADTEIYL DANRGWTANE AMEVLRRTEG LGLSMLEEPC DAAEAMGRRR LVQHSSIPIV
GDESVPNLGD VSRELLSGGS NAICIKTARS GFTEAQQILG LCEGLGVDVT MGNQIDTQVG
SLATVTFGAA FEASSRRAGE LSNYLDMTDD LLAEPLEITD GAIRVRKAPG VGAAIDADKL
QKYRQD