Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0032 |
Symbol | |
ID | 4447502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 37965 |
End bp | 39092 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639687826 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_829533 |
Protein GI | 116668600 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG TTGACCTCAT TCGCCATGTG AAACTTTCCA CAGCGAGGCT TCCCCTCGCC GTGCCGATCA GTGATGCCAA GGTATTCACC GGCCGCCAGA AGCCCATGAC CGAAGTGGTG TTCCTGTTCG CTGAAATCAC CACCGAACAG GGCCACAGCG GCATCGGCTT CAGCTACTCC AAGCGCGCCG GCGGACCGGC CCAGTACGCG CATGCTAAAG AGGTGGCCGA AGGAATCATC GGCGAGGACC CAAACGACAT CGGCAAGATC TACACGAAGC TGCTCTGGGC CGGCGCCTCC GTGGGCCGCT CGGGCGTGGC CACTCAGGCG CTGGCCGCCA TCGACATCGC CCTCTACGAC CTCAAGGCAA AGCGCGCCGG GCTTCCCCTG GCCAAGCTCC TGGGCTCCTA TCGCGACTCG GTCCAGACGT ACAACACGTC CGGTGGCTTC CTGAATGCCT CCCTGGATGA GGTCAAGGCC CGCGCCACCC AGTCCATCGA CGACGGAATC GGCGGCATCA AGATCAAGGT TGGCCTCCCC GACAGCAAGG AGGACCTGCG CCGCGTGGCC GGAATCCGCG AACACATCGG TTGGGACGTG CCGCTCATGG TGGACGCCAA CCAGCAGTGG GACCGCGCCA CTGCCCTGCG GATGGGCCGG CAGCTCGAGG AATTCAACCT CATCTGGATT GAAGAGCCGC TGGATGCCTA CGACTTCGAG GGCCATGCCC ACCTGGCCAG CGTCCTGGAC ACCCCCATCG CCACCGGTGA GATGCTGGCC TCCGTGGCGG AGCACAAGGG CCTGATCGAC GCCAGCGGCT GCGACATCAT CCAGCCTGAT GCGCCGCGCG TCGGCGGCAT CACCCAGTTC CTGCGCCTGG CTGCCCTGGC GGACGAGCGG GGCCTGGGCC TCGCACCGCA CTTCGCCATG GAAATCCACC TCCATCTCGC GGCCGCCTAC CCCCGCGAAC CGTGGGTGGA GCACTTCGAC TGGCTCGACC CGCTGTTCAA TGAGCGCCTC GAAACCAAGA ACGGCCGCAT GCTGGTTCCG GACCGCCCGG GCCTCGGCGT GTCCCTCAGC GACCAGTCCC GCGCCTGGAC CACCGAGTCC GTGGAGTTCG GCGCGTAA
|
Protein sequence | MSTVDLIRHV KLSTARLPLA VPISDAKVFT GRQKPMTEVV FLFAEITTEQ GHSGIGFSYS KRAGGPAQYA HAKEVAEGII GEDPNDIGKI YTKLLWAGAS VGRSGVATQA LAAIDIALYD LKAKRAGLPL AKLLGSYRDS VQTYNTSGGF LNASLDEVKA RATQSIDDGI GGIKIKVGLP DSKEDLRRVA GIREHIGWDV PLMVDANQQW DRATALRMGR QLEEFNLIWI EEPLDAYDFE GHAHLASVLD TPIATGEMLA SVAEHKGLID ASGCDIIQPD APRVGGITQF LRLAALADER GLGLAPHFAM EIHLHLAAAY PREPWVEHFD WLDPLFNERL ETKNGRMLVP DRPGLGVSLS DQSRAWTTES VEFGA
|
| |