Gene Hlac_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2242 
Symbol 
ID7399952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2229535 
End bp2230773 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID643709316 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002566889 
Protein GI222480652 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.464149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.318777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATC ACGACAAGCT CCGCGACCCC AACGCGGAGT ACACCATGCG GGACCTCTCG 
GCGGAGACGA TGGACATCAC GAACTCGCGG GGCGGTGTCC GTGACGCCGA GATTACGGAC
GTACAGACGA CGATGGTCGA CGGCAACTAC CCGTGGATCC TCGTCCGCGT CTACACCGAC
GCGGGCGTCG TCGGCACCGG GGAGTCCTAC TGGGGCGGCG GCGACACCGC CATCATCGAG
CGCATGAAAC CGTTCATCGT CGGGGAGAAC CCGCTCGATA TCGACCGGCT GTACGAGCAC
CTCGTCCAGA AGATGTCGGG TGAGGGCTCG ATCTCGGGGA AGGTCATCTC CGCCATCTCC
GGTATCGAGA TCGCGCTCCA CGACGCCGCC GGGAAGCTCC TCGACGTGCC CGCCTACCAG
CTCGTCGGCG GGAAGTACCG CGACGAGGTC CGGGTCTACT GCGACCTCCA CACCGAGAAC
GAGGCCGACC CGCAGGCGTG CGCCGCCGAG GCCGAGCGCG TCGTCGAGAA CTTCGGCTAC
GACGCCATCA AGTTCGACCT CGATGTGCCG TCCGGCCACG AGAAAGACCG CGCCAACCGC
CACCTCCGCA ACCCCGAGAT CGATCACAAG GTCGACATCG TGGAGGCGAC CACCGAGGCC
GTCGGCGACA AGGCCGACGT GGCCTTCGAC TGCCACTGGT CGTTCACCGG CGGCTCCGCG
AAGCGCCTCG CGGAGGCCCT CGAGGAGTAC GACGTGTGGT GGCTCGAAGA CCCCGTGCCG
CCGGAGAACC ACGACGTGCA AGAGGAAGTG ACGAAGTCGA CGACGACGCC CATCGCGGTC
GGCGAGAACG TCTATCGGAA GCACGGCCAG CGGACCCTCC TCGAACCGCA GGCCGTCGAC
ATCGTCGCGC CCGACCTGCC GCGCGTCGGC GGGATGCGCG AGACCCGCAA GATCGCGGAT
CTGGCGGACA TGTACTACAT CCCGGTGGCG ATGCACAACG TCTCCTCGCC GATTGGCACG
ATGGCGTCCG CGCACGTCGG CGCCGCCATT CCGAACTCGC TCGCACTGGA GTACCACTCC
TACGAGCTCG GCTGGTGGGA AGATCTGGTC GAAGAGGACA ATCTCATCGA AGAGGGCCGT
ATGGAGATCC CGGAGGAACC CGGTCTCGGC CTGACGCTGA ACCTTGACGC CGTCGGAGAG
CACATGGTCG AAGGCGAGAC GTTGTTCGAC GAGGCGTGA
 
Protein sequence
MVDHDKLRDP NAEYTMRDLS AETMDITNSR GGVRDAEITD VQTTMVDGNY PWILVRVYTD 
AGVVGTGESY WGGGDTAIIE RMKPFIVGEN PLDIDRLYEH LVQKMSGEGS ISGKVISAIS
GIEIALHDAA GKLLDVPAYQ LVGGKYRDEV RVYCDLHTEN EADPQACAAE AERVVENFGY
DAIKFDLDVP SGHEKDRANR HLRNPEIDHK VDIVEATTEA VGDKADVAFD CHWSFTGGSA
KRLAEALEEY DVWWLEDPVP PENHDVQEEV TKSTTTPIAV GENVYRKHGQ RTLLEPQAVD
IVAPDLPRVG GMRETRKIAD LADMYYIPVA MHNVSSPIGT MASAHVGAAI PNSLALEYHS
YELGWWEDLV EEDNLIEEGR MEIPEEPGLG LTLNLDAVGE HMVEGETLFD EA