Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2242 |
Symbol | |
ID | 7399952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2229535 |
End bp | 2230773 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643709316 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002566889 |
Protein GI | 222480652 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.464149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.318777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATC ACGACAAGCT CCGCGACCCC AACGCGGAGT ACACCATGCG GGACCTCTCG GCGGAGACGA TGGACATCAC GAACTCGCGG GGCGGTGTCC GTGACGCCGA GATTACGGAC GTACAGACGA CGATGGTCGA CGGCAACTAC CCGTGGATCC TCGTCCGCGT CTACACCGAC GCGGGCGTCG TCGGCACCGG GGAGTCCTAC TGGGGCGGCG GCGACACCGC CATCATCGAG CGCATGAAAC CGTTCATCGT CGGGGAGAAC CCGCTCGATA TCGACCGGCT GTACGAGCAC CTCGTCCAGA AGATGTCGGG TGAGGGCTCG ATCTCGGGGA AGGTCATCTC CGCCATCTCC GGTATCGAGA TCGCGCTCCA CGACGCCGCC GGGAAGCTCC TCGACGTGCC CGCCTACCAG CTCGTCGGCG GGAAGTACCG CGACGAGGTC CGGGTCTACT GCGACCTCCA CACCGAGAAC GAGGCCGACC CGCAGGCGTG CGCCGCCGAG GCCGAGCGCG TCGTCGAGAA CTTCGGCTAC GACGCCATCA AGTTCGACCT CGATGTGCCG TCCGGCCACG AGAAAGACCG CGCCAACCGC CACCTCCGCA ACCCCGAGAT CGATCACAAG GTCGACATCG TGGAGGCGAC CACCGAGGCC GTCGGCGACA AGGCCGACGT GGCCTTCGAC TGCCACTGGT CGTTCACCGG CGGCTCCGCG AAGCGCCTCG CGGAGGCCCT CGAGGAGTAC GACGTGTGGT GGCTCGAAGA CCCCGTGCCG CCGGAGAACC ACGACGTGCA AGAGGAAGTG ACGAAGTCGA CGACGACGCC CATCGCGGTC GGCGAGAACG TCTATCGGAA GCACGGCCAG CGGACCCTCC TCGAACCGCA GGCCGTCGAC ATCGTCGCGC CCGACCTGCC GCGCGTCGGC GGGATGCGCG AGACCCGCAA GATCGCGGAT CTGGCGGACA TGTACTACAT CCCGGTGGCG ATGCACAACG TCTCCTCGCC GATTGGCACG ATGGCGTCCG CGCACGTCGG CGCCGCCATT CCGAACTCGC TCGCACTGGA GTACCACTCC TACGAGCTCG GCTGGTGGGA AGATCTGGTC GAAGAGGACA ATCTCATCGA AGAGGGCCGT ATGGAGATCC CGGAGGAACC CGGTCTCGGC CTGACGCTGA ACCTTGACGC CGTCGGAGAG CACATGGTCG AAGGCGAGAC GTTGTTCGAC GAGGCGTGA
|
Protein sequence | MVDHDKLRDP NAEYTMRDLS AETMDITNSR GGVRDAEITD VQTTMVDGNY PWILVRVYTD AGVVGTGESY WGGGDTAIIE RMKPFIVGEN PLDIDRLYEH LVQKMSGEGS ISGKVISAIS GIEIALHDAA GKLLDVPAYQ LVGGKYRDEV RVYCDLHTEN EADPQACAAE AERVVENFGY DAIKFDLDVP SGHEKDRANR HLRNPEIDHK VDIVEATTEA VGDKADVAFD CHWSFTGGSA KRLAEALEEY DVWWLEDPVP PENHDVQEEV TKSTTTPIAV GENVYRKHGQ RTLLEPQAVD IVAPDLPRVG GMRETRKIAD LADMYYIPVA MHNVSSPIGT MASAHVGAAI PNSLALEYHS YELGWWEDLV EEDNLIEEGR MEIPEEPGLG LTLNLDAVGE HMVEGETLFD EA
|
| |