Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1174 |
Symbol | |
ID | 7400983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1180484 |
End bp | 1181593 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643708239 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002565838 |
Protein GI | 222479601 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01927] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00378756 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGCTCC GCCCCTTCTC GCTCGCGCTG GCGAGTCCCC TCGAAACGGC GCGCGGCTCC ATCGATCGGC GCGAGGGATT TCTCGTCGCC GTCGACCCCG GAGCCGACGG CGAGTCGGTT CCCGCGCCCG GCCTCGGCGA GGCGACGCCG CTCCCCGGCT GGACCGAGTC GCGGTCGGCC TGCGAGGCGG CGCTTCGCGG GGCCGAAGAC GAGAACGATG GGAAGGCGCT CGCGACGGAT GCGCTCGACC GACTCGACCC TACCGAGACG CCTGCCGCGC GCCACGGCCT CGCGCTCGCG CTCGCGGATG CGACCGCTCG CGACGCCGGG CAGTCGCTGG CCGAGCGCCT CGCAGAGAAC GAGAACCTGC CCGCGCCAAC CGAGACGGTC CCGGTCAACG CGACGATCGG TGACGTCGAC TCGGAGGACA CCGTCGCCGC GGCCGAGAAC GCGGTCGAGA AGGGATTCGA CTGCCTGAAG GTGAAGGTCG GCGCGCGCGG CCTCGATGCC GACATCGAGC GCCTTCGAGC CGTTCGACGG GCAGTCGGCG GCGATGTCTC CCTCCGAGCC GACGCCAACG GTGCGTGGGA CCGGGAGACC GCCCGGGAGG CGGTCGAGCG ACTTGCACCG CTCGACCTCG CGTACCTCGA ACAGCCGCTG CCGGCCGACG ACCTCGACGG GGCGGCCGCC CTCAGAACGG TCGGGAGCGG TGTCGATACC GACACCGACC GCGATCCCCC GGTCCCGATC GCCCTCGACG AGTCGCTCGC GACCCGCGGG CTCGATGCGG TCCTCGATGC CGACGCCGCC GACGCCGTCG TCTTGAAACC GATGGCGCTC GGAGGGCCGG ACCGAGCGCT GGCGGCGGCG AGACGGGCGC GGGAGGCCGG CGTCGAGCCG GTCGTCACCA CCACGATCGA CGCGGTCGTC GCGCGCACCG CCGCGGTCCA CGTCGCCGCC GCTATCCCAG ACGTATCCCC CTGCGGGCTC GCCACCGGCT CCCTGCTCGA CACGGACCTC GCTCCGGATC CTTGCCCGAT CTCGGACGGC GCGGTGACGG TGCCGACCGA TCCCGGTCTG GCCGGCGACG CCTTTGACGA CCTCCTGTAG
|
Protein sequence | MRLRPFSLAL ASPLETARGS IDRREGFLVA VDPGADGESV PAPGLGEATP LPGWTESRSA CEAALRGAED ENDGKALATD ALDRLDPTET PAARHGLALA LADATARDAG QSLAERLAEN ENLPAPTETV PVNATIGDVD SEDTVAAAEN AVEKGFDCLK VKVGARGLDA DIERLRAVRR AVGGDVSLRA DANGAWDRET AREAVERLAP LDLAYLEQPL PADDLDGAAA LRTVGSGVDT DTDRDPPVPI ALDESLATRG LDAVLDADAA DAVVLKPMAL GGPDRALAAA RRAREAGVEP VVTTTIDAVV ARTAAVHVAA AIPDVSPCGL ATGSLLDTDL APDPCPISDG AVTVPTDPGL AGDAFDDLL
|
| |