Gene Hlac_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1300 
Symbol 
ID7399395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1309950 
End bp1311188 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content67% 
IMG OID643708364 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002565962 
Protein GI222479725 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAA ACTACGAGTC GCTCCACGAT CCGAACGCGG AGTACACGAT GCGAGAACTC 
TCCGCCGAGA CGATGGGAAC AACGGGCTCG CGCGGCGGCG GGCGCGACGT CGAGATCACG
GACGTGCAGA CGACGATGGT CGACGGGAAC TTCCCGTGGA CGCTGGTTCG CGTGTACACG
GACGCGGGCG TCGTGGGTAC CGGCGAGGCG TACTGGGGCG CCGGCGTGCC GGAGCTCATC
GAGCGCATGA AGCCGTTCGT GATCGGCGAG AACCCGCTCG ACATCGACCG GCTGTACGAA
CACCTCATCC AGAAGATGTC GGGCGAGGGC TCCGTCGAGG GCGTCACTGT CACCGCGATC
TCCGGGATCG AGGTGGCGCT GCACGACGTG GCCGGCAAGA TCCTCGACGT GCCCGCCTAC
CAGCTGCTCG GTGGCAAGTA CCGCGACAAG ATGCGCGTCT ACTGCGACTG CCACACCGAG
GAGGAGGCTG ACCCCGAGGC GTGCGCCGAC GAGGCCGAGC GCGTCGTCGA CGAACTCGGC
TACGACGCCC TGAAGTTCGA CCTCGACGTG CCGAGCGGCT TCGAGAAGGA CCGCGCGAAC
CGCCACCTGC GCCCCGGTGA GATCCGCCAC AAGGCGGAGA TCGTCGAGAC GGTCACCGAG
CGCGTGAAGG ACAAGGCCGA CGTGGCCTTC GACTGCCACT GGACGTTCTC GGGCGGCTCC
GCGAAGCGCC TCGCGGACGC CATCGAGGAG TACGACGTGT GGTGGCTCGA AGACCCCGTG
CCGCCGGAGA ACCTCGAAGT GCAGGAGGAA GTGACGAAGA ACACGGTCAC GCCGATCGCG
GTCGGCGAGA ACCGCTACCG CGTCACGGAG CTTCGCCGCC TCATCGAGAA TCAGGCGGTC
GACATCGTCG CGCCCGACCT GCCGAAGGTC GGCGGGATGC GCGAAACCCG CAAGATCGCG
GACGTGGCGA ACCAGTACTA CGTCCCGGTC GCGATGCACA ACGTCGCCTC CCCGATCGCG
ACGATGGCGG CGACCCACGT CGGCGCCGCG ATCCCGAACT CGCTCGCGAT CGAGTACCAT
TCCTACGAGC TCGGCTGGTG GGAGGACCTC GTCGAGGAGG ACGTCATCGA GGACGGCTAC
ATCGAGGTGC CGGAGAAGCC CGGTATCGGC CTCACACTCG ACATGGACAC CGTCGAGGAG
CATATGGTCG AAGGCGAGAC GCTGTTCGAT CCGGCGTAA
 
Protein sequence
MSRNYESLHD PNAEYTMREL SAETMGTTGS RGGGRDVEIT DVQTTMVDGN FPWTLVRVYT 
DAGVVGTGEA YWGAGVPELI ERMKPFVIGE NPLDIDRLYE HLIQKMSGEG SVEGVTVTAI
SGIEVALHDV AGKILDVPAY QLLGGKYRDK MRVYCDCHTE EEADPEACAD EAERVVDELG
YDALKFDLDV PSGFEKDRAN RHLRPGEIRH KAEIVETVTE RVKDKADVAF DCHWTFSGGS
AKRLADAIEE YDVWWLEDPV PPENLEVQEE VTKNTVTPIA VGENRYRVTE LRRLIENQAV
DIVAPDLPKV GGMRETRKIA DVANQYYVPV AMHNVASPIA TMAATHVGAA IPNSLAIEYH
SYELGWWEDL VEEDVIEDGY IEVPEKPGIG LTLDMDTVEE HMVEGETLFD PA