Gene Hlac_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1174 
Symbol 
ID7400983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1180484 
End bp1181593 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content74% 
IMG OID643708239 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002565838 
Protein GI222479601 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00378756 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCTCC GCCCCTTCTC GCTCGCGCTG GCGAGTCCCC TCGAAACGGC GCGCGGCTCC 
ATCGATCGGC GCGAGGGATT TCTCGTCGCC GTCGACCCCG GAGCCGACGG CGAGTCGGTT
CCCGCGCCCG GCCTCGGCGA GGCGACGCCG CTCCCCGGCT GGACCGAGTC GCGGTCGGCC
TGCGAGGCGG CGCTTCGCGG GGCCGAAGAC GAGAACGATG GGAAGGCGCT CGCGACGGAT
GCGCTCGACC GACTCGACCC TACCGAGACG CCTGCCGCGC GCCACGGCCT CGCGCTCGCG
CTCGCGGATG CGACCGCTCG CGACGCCGGG CAGTCGCTGG CCGAGCGCCT CGCAGAGAAC
GAGAACCTGC CCGCGCCAAC CGAGACGGTC CCGGTCAACG CGACGATCGG TGACGTCGAC
TCGGAGGACA CCGTCGCCGC GGCCGAGAAC GCGGTCGAGA AGGGATTCGA CTGCCTGAAG
GTGAAGGTCG GCGCGCGCGG CCTCGATGCC GACATCGAGC GCCTTCGAGC CGTTCGACGG
GCAGTCGGCG GCGATGTCTC CCTCCGAGCC GACGCCAACG GTGCGTGGGA CCGGGAGACC
GCCCGGGAGG CGGTCGAGCG ACTTGCACCG CTCGACCTCG CGTACCTCGA ACAGCCGCTG
CCGGCCGACG ACCTCGACGG GGCGGCCGCC CTCAGAACGG TCGGGAGCGG TGTCGATACC
GACACCGACC GCGATCCCCC GGTCCCGATC GCCCTCGACG AGTCGCTCGC GACCCGCGGG
CTCGATGCGG TCCTCGATGC CGACGCCGCC GACGCCGTCG TCTTGAAACC GATGGCGCTC
GGAGGGCCGG ACCGAGCGCT GGCGGCGGCG AGACGGGCGC GGGAGGCCGG CGTCGAGCCG
GTCGTCACCA CCACGATCGA CGCGGTCGTC GCGCGCACCG CCGCGGTCCA CGTCGCCGCC
GCTATCCCAG ACGTATCCCC CTGCGGGCTC GCCACCGGCT CCCTGCTCGA CACGGACCTC
GCTCCGGATC CTTGCCCGAT CTCGGACGGC GCGGTGACGG TGCCGACCGA TCCCGGTCTG
GCCGGCGACG CCTTTGACGA CCTCCTGTAG
 
Protein sequence
MRLRPFSLAL ASPLETARGS IDRREGFLVA VDPGADGESV PAPGLGEATP LPGWTESRSA 
CEAALRGAED ENDGKALATD ALDRLDPTET PAARHGLALA LADATARDAG QSLAERLAEN
ENLPAPTETV PVNATIGDVD SEDTVAAAEN AVEKGFDCLK VKVGARGLDA DIERLRAVRR
AVGGDVSLRA DANGAWDRET AREAVERLAP LDLAYLEQPL PADDLDGAAA LRTVGSGVDT
DTDRDPPVPI ALDESLATRG LDAVLDADAA DAVVLKPMAL GGPDRALAAA RRAREAGVEP
VVTTTIDAVV ARTAAVHVAA AIPDVSPCGL ATGSLLDTDL APDPCPISDG AVTVPTDPGL
AGDAFDDLL