Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0941 |
Symbol | |
ID | 8410457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 903025 |
End bp | 904194 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645019276 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003176777 |
Protein GI | 257387004 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.748604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.464463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATCA CCGACGTGAC AGCGACGACT GTCGACGTAC CGCTGGTGGA CCTCGACGAA CGTCTGGGCA TCGGCCCGTA CGTGACGAAC CACGGACGCG TCGAATCGAT GGAACGTGTC CTCGTCCGCG TGGACACCGA CGAGGGCATC GCCGGCTGGG GCGAGATGCG GACCTTCCTC TCACCGGCGG CCACGGAGTC GATCATCGAA GACGGAATCG GCCCGCTGAT CGAAGGCCAG TCGCCCTTCG AGGTCGAACG GCTCCGCCGA CAGGTGTTCA TCGAGTACAC CAACATCGAA CTGTTCTTCG CCGCCGTCGA GACTGCCTGC TGGGACATCG CCGGCAAGGC GCTCGACAAG CCGGTCTTCG AACTGCTCGG CGGGGCGACC GCACCGTACC AGACGACCGC GATGAACGCC GCCGCTGGCG GTGCCGACGC CGCCGACGAC CGCGCGGTCG AGTTCGCCTT CTGTCTGGGT ATCCTCTCTC CGGAAGAGTC CCGCGTGAAG GCCCAGGAAG CCCTCGACGC CGGCTTCTCC GTCCTCAAGA CCAAGGCCGG CCGTGACTGG CAACAGGACG TGGCCCGCAT CCAGGCGATG CACGACGAGG TCGACGGGCA ACTGGAGTTC CGGCTGGACC CCAATCAGGG GTGGTCGCTC GATCAGGCGG TTCGCGTCGG GGCCGCCCTC GAAGAGTCCG GCATCTTCCT GCAGTACATG GAACAGCCCA TCCGCGTCAA CGCCCACGAC TCGCTGGCGA CACTGCGCCA GCGTCTCCGC CAGCCGATCG CTCCCAACGA GGACACGTAC ATCCCCAACA ACCTCCGCTC GCTCGTCGAG GCCGGTGCGA TGGACGTGGC GGTCCTCGAT CTGACGCCAG CGGGCGGCAT CGCGGGACTG CGACAGCAGG CGGCCATCGT CGAGGACGCC GGCGTTCCCT TCACCCACCA CTGCGCGTTC GATCTCGGGA TCCGGACGGC CGCAATCTTG CACGCGGTCC ACGGGATCCC CGGATTTTCC CTCCCGCCGG ACACGACCTA CTACGGCTGG GAAGACGACG TCATCGCGGA CCCCTTCACG GTCGAGGAGG GCCGGATGAC CGTGCCGGAC GAACCCGGCC TCGGAATCGA CGTGGACCTC GACACCGTCG AGGAGTACCG CGTCCGATAG
|
Protein sequence | MEITDVTATT VDVPLVDLDE RLGIGPYVTN HGRVESMERV LVRVDTDEGI AGWGEMRTFL SPAATESIIE DGIGPLIEGQ SPFEVERLRR QVFIEYTNIE LFFAAVETAC WDIAGKALDK PVFELLGGAT APYQTTAMNA AAGGADAADD RAVEFAFCLG ILSPEESRVK AQEALDAGFS VLKTKAGRDW QQDVARIQAM HDEVDGQLEF RLDPNQGWSL DQAVRVGAAL EESGIFLQYM EQPIRVNAHD SLATLRQRLR QPIAPNEDTY IPNNLRSLVE AGAMDVAVLD LTPAGGIAGL RQQAAIVEDA GVPFTHHCAF DLGIRTAAIL HAVHGIPGFS LPPDTTYYGW EDDVIADPFT VEEGRMTVPD EPGLGIDVDL DTVEEYRVR
|
| |