Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1300 |
Symbol | |
ID | 7399395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1309950 |
End bp | 1311188 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643708364 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002565962 |
Protein GI | 222479725 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAA ACTACGAGTC GCTCCACGAT CCGAACGCGG AGTACACGAT GCGAGAACTC TCCGCCGAGA CGATGGGAAC AACGGGCTCG CGCGGCGGCG GGCGCGACGT CGAGATCACG GACGTGCAGA CGACGATGGT CGACGGGAAC TTCCCGTGGA CGCTGGTTCG CGTGTACACG GACGCGGGCG TCGTGGGTAC CGGCGAGGCG TACTGGGGCG CCGGCGTGCC GGAGCTCATC GAGCGCATGA AGCCGTTCGT GATCGGCGAG AACCCGCTCG ACATCGACCG GCTGTACGAA CACCTCATCC AGAAGATGTC GGGCGAGGGC TCCGTCGAGG GCGTCACTGT CACCGCGATC TCCGGGATCG AGGTGGCGCT GCACGACGTG GCCGGCAAGA TCCTCGACGT GCCCGCCTAC CAGCTGCTCG GTGGCAAGTA CCGCGACAAG ATGCGCGTCT ACTGCGACTG CCACACCGAG GAGGAGGCTG ACCCCGAGGC GTGCGCCGAC GAGGCCGAGC GCGTCGTCGA CGAACTCGGC TACGACGCCC TGAAGTTCGA CCTCGACGTG CCGAGCGGCT TCGAGAAGGA CCGCGCGAAC CGCCACCTGC GCCCCGGTGA GATCCGCCAC AAGGCGGAGA TCGTCGAGAC GGTCACCGAG CGCGTGAAGG ACAAGGCCGA CGTGGCCTTC GACTGCCACT GGACGTTCTC GGGCGGCTCC GCGAAGCGCC TCGCGGACGC CATCGAGGAG TACGACGTGT GGTGGCTCGA AGACCCCGTG CCGCCGGAGA ACCTCGAAGT GCAGGAGGAA GTGACGAAGA ACACGGTCAC GCCGATCGCG GTCGGCGAGA ACCGCTACCG CGTCACGGAG CTTCGCCGCC TCATCGAGAA TCAGGCGGTC GACATCGTCG CGCCCGACCT GCCGAAGGTC GGCGGGATGC GCGAAACCCG CAAGATCGCG GACGTGGCGA ACCAGTACTA CGTCCCGGTC GCGATGCACA ACGTCGCCTC CCCGATCGCG ACGATGGCGG CGACCCACGT CGGCGCCGCG ATCCCGAACT CGCTCGCGAT CGAGTACCAT TCCTACGAGC TCGGCTGGTG GGAGGACCTC GTCGAGGAGG ACGTCATCGA GGACGGCTAC ATCGAGGTGC CGGAGAAGCC CGGTATCGGC CTCACACTCG ACATGGACAC CGTCGAGGAG CATATGGTCG AAGGCGAGAC GCTGTTCGAT CCGGCGTAA
|
Protein sequence | MSRNYESLHD PNAEYTMREL SAETMGTTGS RGGGRDVEIT DVQTTMVDGN FPWTLVRVYT DAGVVGTGEA YWGAGVPELI ERMKPFVIGE NPLDIDRLYE HLIQKMSGEG SVEGVTVTAI SGIEVALHDV AGKILDVPAY QLLGGKYRDK MRVYCDCHTE EEADPEACAD EAERVVDELG YDALKFDLDV PSGFEKDRAN RHLRPGEIRH KAEIVETVTE RVKDKADVAF DCHWTFSGGS AKRLADAIEE YDVWWLEDPV PPENLEVQEE VTKNTVTPIA VGENRYRVTE LRRLIENQAV DIVAPDLPKV GGMRETRKIA DVANQYYVPV AMHNVASPIA TMAATHVGAA IPNSLAIEYH SYELGWWEDL VEEDVIEDGY IEVPEKPGIG LTLDMDTVEE HMVEGETLFD PA
|
| |