Gene Arth_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0032 
Symbol 
ID4447502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp37965 
End bp39092 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID639687826 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_829533 
Protein GI116668600 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG TTGACCTCAT TCGCCATGTG AAACTTTCCA CAGCGAGGCT TCCCCTCGCC 
GTGCCGATCA GTGATGCCAA GGTATTCACC GGCCGCCAGA AGCCCATGAC CGAAGTGGTG
TTCCTGTTCG CTGAAATCAC CACCGAACAG GGCCACAGCG GCATCGGCTT CAGCTACTCC
AAGCGCGCCG GCGGACCGGC CCAGTACGCG CATGCTAAAG AGGTGGCCGA AGGAATCATC
GGCGAGGACC CAAACGACAT CGGCAAGATC TACACGAAGC TGCTCTGGGC CGGCGCCTCC
GTGGGCCGCT CGGGCGTGGC CACTCAGGCG CTGGCCGCCA TCGACATCGC CCTCTACGAC
CTCAAGGCAA AGCGCGCCGG GCTTCCCCTG GCCAAGCTCC TGGGCTCCTA TCGCGACTCG
GTCCAGACGT ACAACACGTC CGGTGGCTTC CTGAATGCCT CCCTGGATGA GGTCAAGGCC
CGCGCCACCC AGTCCATCGA CGACGGAATC GGCGGCATCA AGATCAAGGT TGGCCTCCCC
GACAGCAAGG AGGACCTGCG CCGCGTGGCC GGAATCCGCG AACACATCGG TTGGGACGTG
CCGCTCATGG TGGACGCCAA CCAGCAGTGG GACCGCGCCA CTGCCCTGCG GATGGGCCGG
CAGCTCGAGG AATTCAACCT CATCTGGATT GAAGAGCCGC TGGATGCCTA CGACTTCGAG
GGCCATGCCC ACCTGGCCAG CGTCCTGGAC ACCCCCATCG CCACCGGTGA GATGCTGGCC
TCCGTGGCGG AGCACAAGGG CCTGATCGAC GCCAGCGGCT GCGACATCAT CCAGCCTGAT
GCGCCGCGCG TCGGCGGCAT CACCCAGTTC CTGCGCCTGG CTGCCCTGGC GGACGAGCGG
GGCCTGGGCC TCGCACCGCA CTTCGCCATG GAAATCCACC TCCATCTCGC GGCCGCCTAC
CCCCGCGAAC CGTGGGTGGA GCACTTCGAC TGGCTCGACC CGCTGTTCAA TGAGCGCCTC
GAAACCAAGA ACGGCCGCAT GCTGGTTCCG GACCGCCCGG GCCTCGGCGT GTCCCTCAGC
GACCAGTCCC GCGCCTGGAC CACCGAGTCC GTGGAGTTCG GCGCGTAA
 
Protein sequence
MSTVDLIRHV KLSTARLPLA VPISDAKVFT GRQKPMTEVV FLFAEITTEQ GHSGIGFSYS 
KRAGGPAQYA HAKEVAEGII GEDPNDIGKI YTKLLWAGAS VGRSGVATQA LAAIDIALYD
LKAKRAGLPL AKLLGSYRDS VQTYNTSGGF LNASLDEVKA RATQSIDDGI GGIKIKVGLP
DSKEDLRRVA GIREHIGWDV PLMVDANQQW DRATALRMGR QLEEFNLIWI EEPLDAYDFE
GHAHLASVLD TPIATGEMLA SVAEHKGLID ASGCDIIQPD APRVGGITQF LRLAALADER
GLGLAPHFAM EIHLHLAAAY PREPWVEHFD WLDPLFNERL ETKNGRMLVP DRPGLGVSLS
DQSRAWTTES VEFGA