Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2654 |
Symbol | |
ID | 8743267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2724542 |
End bp | 2725780 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646513242 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003404203 |
Protein GI | 284165924 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGTCG ATTACTCACA GCTGCACGAC CCGAACGCCG AGTATACGAT GCGCGACCTC TCGGCGGAGA CGATGCAGGT CACCCGCGAA CGCGGGGGCG GCCGCGACGT CGAGATCACG GACATCCAGA CGACGATGAT CGACGGGAAC TTCCCGTGGA CGCTGGTCCG GATCTACACC GACGCCGGCA TCGTCGGCAC CGGCGAAGCC TACTGGGGCG CCGGCGTCCC GGAACTGATC CAGCGGATGA CGCCCTTCCT GCAGGGCGAG AACCCGCTCG ACATCGATCG TCTGACCGAG CACCTCGTCC AGAAGATGTC CGGCGAGGGG TCGATCGCCG GCGTCACCGT CACCGCCATC GCGGGCATCG AGGTCGCGCT GCACGATCTG GCGGGCAAGA TCCTCGAGGT GCCGGCCTAC CAGCTGCTGG GTGGGAAGTA CCGCGACGAG GTCCGGGTCT ACTGTGACTG TCACACCGAG GACGAGGCCG ACCCCGACGC CTGCGCCGAC GAGGCCGAAC GCGTCGTCGA GGAGCTGGGA TACGACGCGC TGAAGTTCGA CCTCGACGTC CCGTCGGGTC ACGAGAAGGA TCGGGCGAAC CGCCACCTCC GCGAGCCCGA GATCGAACAC AAGGCCGAAA TCGTGGAGAA GGTCACCGAG CGCGTCGGCG ACCGCGCCGA CGTCGCCTTC GACTGCCACT GGACGTTCTC CGGCGGCAGC GCGAAGCGCC TCGCGAAGCG TCTCGAGGAG TACGACGTCT GGTGGCTCGA GGACCCCGTC CCGCCGGAGA ACCACGACGT CCAGCAGGAG GTCACCCAGT CGACCGAGAC GCCGATCACC GTCGGCGAGA ACGTCTACCG GAAACACGGC CAGCGCCGCC TGCTCGAGGA GCAGGCCGTC GATATCATCG CGCCGGACAT GCCCAAAGTC GGCGGGATGC GCGAGACCCG AAAGATCGCC GACCTCGCGG ACATGTACTA CGTTCCGGTC GCGATGCACA ACGTCTCCTC GCCCGTGGCG ACGGTCGCCA GCGCCCACGT CGGTGCGGCG ATTCCGAACT CGCTCGCCGT TGAGTACCAC TCCTACGAAC TCGGCTGGTG GGAGGACCTC GTCGAGGAGG ACGTCATCGA GGACGGCTAC ATCGAGATCC CCGAGGAACC GGGGCTCGGC GTGACGCTCG ATATGGACGC CGTCGAGGAA CACATGATCG AGGGCGAGGA GCTGTTTGAC GCGGCGTAA
|
Protein sequence | MGVDYSQLHD PNAEYTMRDL SAETMQVTRE RGGGRDVEIT DIQTTMIDGN FPWTLVRIYT DAGIVGTGEA YWGAGVPELI QRMTPFLQGE NPLDIDRLTE HLVQKMSGEG SIAGVTVTAI AGIEVALHDL AGKILEVPAY QLLGGKYRDE VRVYCDCHTE DEADPDACAD EAERVVEELG YDALKFDLDV PSGHEKDRAN RHLREPEIEH KAEIVEKVTE RVGDRADVAF DCHWTFSGGS AKRLAKRLEE YDVWWLEDPV PPENHDVQQE VTQSTETPIT VGENVYRKHG QRRLLEEQAV DIIAPDMPKV GGMRETRKIA DLADMYYVPV AMHNVSSPVA TVASAHVGAA IPNSLAVEYH SYELGWWEDL VEEDVIEDGY IEIPEEPGLG VTLDMDAVEE HMIEGEELFD AA
|
| |