Gene Hlac_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1622 
Symbol 
ID7399571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1642325 
End bp1643374 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content71% 
IMG OID643708688 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002566277 
Protein GI222480040 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01928] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.624827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG AAACGGAGTT CGAGCGGGTC TCGCTTCCCT TGGAGAACCC GTTCACGATC 
TTCAGGGGCA CCCAGACGGA CGCCGAGAAC GTGATCGTGA AGATCGCGGA CGAGGCCGGA
ATGACCGGCG TCGGCGGCGC GGCTCCCTCG GCTCACTACG GCGAGACCGC GGACACCGTC
GAGGCCGTGC TCCCGGATCT GCTCGACGCC GTCGAGCGCG TCGGCGACCC CCACGCGCTC
CACGAGATCG AGGCCGAACT GGCGGCCGTC GTCAACGGCA ACCCCGCCGC CCGCGCCGCC
GTCTCGATCG CGGTCCACGA CCTCGCCGCG AAGCGGCTCG GCGTCCCCCT CCACCGTCTC
TGGGGGCTCG ACCCCACGGC CGCGCCCGCG ACCTCCTACA CGATCGGACT CGACGAGACG
GAGCGCGTCC GTGAGAAGGC CGAGGCCGCG GTCGATGCCG GCTACCCGAT CCTCAAGATC
AAGCTCGGGA CCGACCGAGA CCGCGAGCTG ATCGACGCGG TCCGCGAGGC CGCGCCCGAC
GCCCGGCTCC GGGTCGACGC GAACGAGGCG TGGACGCCCC GCGAGGCGGT CCGGAAGTGC
GAGTGGCTCG CCGATCGCGA CGTGGAATTC GTCGAGCAAC CGGTGCCCGC CGAGGACCCG
GAGGGGCTCC GGTTCGTCTA CGAGCGGTCG GCGCTCCCCG TCGCCGCCGA CGAGTCCTGC
GTGACGCTCT CCGACATCCC CGCGATCGCC GACCGGTGTG ACATCGCGAA CCTGAAGCTG
ATGAAGACCG GCGGCCTGCT GGAGGCGAAA CGGATGATCG CCGCCGCGCG CGCTCACGGG
CTGGAAGTGA TGTGCGGCTG CATGATCGAG TCGAACGCCT CGATCGCTGC GGCCGCGCAG
CTCGCGCCCC TACTCGACTA CGCCGACCTC GACGGGTCGC TGCTGCTCGC CGAGGACCAG
TACGACGGGA TCGAAATGGG AGGCGGCGAG ATCCGGCTCG GGGACCAGGA GCGGGCGGGG
ACCGGCGCCC GCCCGAGCGC GGAGCAGTAG
 
Protein sequence
MTLETEFERV SLPLENPFTI FRGTQTDAEN VIVKIADEAG MTGVGGAAPS AHYGETADTV 
EAVLPDLLDA VERVGDPHAL HEIEAELAAV VNGNPAARAA VSIAVHDLAA KRLGVPLHRL
WGLDPTAAPA TSYTIGLDET ERVREKAEAA VDAGYPILKI KLGTDRDREL IDAVREAAPD
ARLRVDANEA WTPREAVRKC EWLADRDVEF VEQPVPAEDP EGLRFVYERS ALPVAADESC
VTLSDIPAIA DRCDIANLKL MKTGGLLEAK RMIAAARAHG LEVMCGCMIE SNASIAAAAQ
LAPLLDYADL DGSLLLAEDQ YDGIEMGGGE IRLGDQERAG TGARPSAEQ