Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1622 |
Symbol | |
ID | 7399571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1642325 |
End bp | 1643374 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708688 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002566277 |
Protein GI | 222480040 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01928] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.624827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTCG AAACGGAGTT CGAGCGGGTC TCGCTTCCCT TGGAGAACCC GTTCACGATC TTCAGGGGCA CCCAGACGGA CGCCGAGAAC GTGATCGTGA AGATCGCGGA CGAGGCCGGA ATGACCGGCG TCGGCGGCGC GGCTCCCTCG GCTCACTACG GCGAGACCGC GGACACCGTC GAGGCCGTGC TCCCGGATCT GCTCGACGCC GTCGAGCGCG TCGGCGACCC CCACGCGCTC CACGAGATCG AGGCCGAACT GGCGGCCGTC GTCAACGGCA ACCCCGCCGC CCGCGCCGCC GTCTCGATCG CGGTCCACGA CCTCGCCGCG AAGCGGCTCG GCGTCCCCCT CCACCGTCTC TGGGGGCTCG ACCCCACGGC CGCGCCCGCG ACCTCCTACA CGATCGGACT CGACGAGACG GAGCGCGTCC GTGAGAAGGC CGAGGCCGCG GTCGATGCCG GCTACCCGAT CCTCAAGATC AAGCTCGGGA CCGACCGAGA CCGCGAGCTG ATCGACGCGG TCCGCGAGGC CGCGCCCGAC GCCCGGCTCC GGGTCGACGC GAACGAGGCG TGGACGCCCC GCGAGGCGGT CCGGAAGTGC GAGTGGCTCG CCGATCGCGA CGTGGAATTC GTCGAGCAAC CGGTGCCCGC CGAGGACCCG GAGGGGCTCC GGTTCGTCTA CGAGCGGTCG GCGCTCCCCG TCGCCGCCGA CGAGTCCTGC GTGACGCTCT CCGACATCCC CGCGATCGCC GACCGGTGTG ACATCGCGAA CCTGAAGCTG ATGAAGACCG GCGGCCTGCT GGAGGCGAAA CGGATGATCG CCGCCGCGCG CGCTCACGGG CTGGAAGTGA TGTGCGGCTG CATGATCGAG TCGAACGCCT CGATCGCTGC GGCCGCGCAG CTCGCGCCCC TACTCGACTA CGCCGACCTC GACGGGTCGC TGCTGCTCGC CGAGGACCAG TACGACGGGA TCGAAATGGG AGGCGGCGAG ATCCGGCTCG GGGACCAGGA GCGGGCGGGG ACCGGCGCCC GCCCGAGCGC GGAGCAGTAG
|
Protein sequence | MTLETEFERV SLPLENPFTI FRGTQTDAEN VIVKIADEAG MTGVGGAAPS AHYGETADTV EAVLPDLLDA VERVGDPHAL HEIEAELAAV VNGNPAARAA VSIAVHDLAA KRLGVPLHRL WGLDPTAAPA TSYTIGLDET ERVREKAEAA VDAGYPILKI KLGTDRDREL IDAVREAAPD ARLRVDANEA WTPREAVRKC EWLADRDVEF VEQPVPAEDP EGLRFVYERS ALPVAADESC VTLSDIPAIA DRCDIANLKL MKTGGLLEAK RMIAAARAHG LEVMCGCMIE SNASIAAAAQ LAPLLDYADL DGSLLLAEDQ YDGIEMGGGE IRLGDQERAG TGARPSAEQ
|
| |