Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4480 |
Symbol | |
ID | 8745109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 70905 |
End bp | 72134 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515017 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003405964 |
Protein GI | 284172582 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA GACAGCTATC CGATCCAAAC GCAGAGTATA CGATGCGAGA CCTCTCCGCG GAGACGATGG AGATTTCCAA CTCCCGCGGG CCGCGCGACG TCGAGATTAC GGACGTTCAG ACGACCATCG TCGACGGCAA CTACCCGTGG ACGCTGGTCC GCGTCTACAC CGACGCTGGA CTCGTCGGCA ACGGTGAATC CTACTGGGGC GCAGGCGAGC GAGAGATCAT CGAGCGTATG GCGCCGTTCC TCGAGGGCGA GAACCCGCTC GACATCGACC GTCTCTACGA GCACCTCGTT CAGAAGCTCT CCGGCGAGGG ATCGATCTCC GGCAAGGCGA TCTCCGCGAT CTCCGGCATC GAGCTCGCGC TCCACGACGT CGCCGGCAAG ATCCTCGAGG TGCCGGCCTA CCAGCTGCTC GGTGGCAAGT ACCGCGACGA GATGCGGATG TACTGTGACT GCCACACCGA GGAGGAAGCC GACCCCGACG CCTGCGCCGA CGAGGCCGAA CGCGTCGTCG AGGACCTGGG GTACGACTCC CTGAAGTTCG ACCTCGACGT TCCCTCGGGC CACGAGAAGG ACCGCGCGAA CCGTCACCTT CGGAACCCGG AGATCGAACA CAAAGCCGAG ATCGTCGAGA AGGTCACCGA GCGCGTCGGC GACCGCGCCG ACGTCGCCTT CGACTGCCAC TGGTCGTTCA GCGCCGGCAG CGCACACCGC CTCGCCGAGC GCTTAGAGGA GTACGACGTC TGGTGGCTCG AAGACCCGAT CCCGCCGGAG AACCACGACG TCCAGGAGGA GGTCACCAAG CGGACCTCGA CGCCGATCAC GGTCGGCGAG AACGTCTACC GCAACCACGG CAACCGTCGC CTGCTCGAGA ACCAGGCCGT CGACATTGTC GCGCCTGACG TCCCCCGCGT CGGCGGCATG CGCCAGACGA GGAAGATCGC CGACCTCGCG GACATGTACT ACGTGCCGGT CGCGATGCAC AACGTCTCCT CGCCGATCGG GACGATGGCG AGCATCCACG TCGGCGCGGC GATCCCGAAC TCGCTGGCCG TCGAATACCA CTCCTACGAA CTCGGCTGGT GGTCGGATCT CGTCGAGGAG GACATCATCG AGAACGGCTA CGCCGAAGTG CCGGAGAAGC CGGGCCTCGG CCTCACGCTC GACCTCGACG CCGTTGAGGA GCATCTGGCT GACGGCGCCG AGATGTTTGA CGAGGCATAA
|
Protein sequence | MDYRQLSDPN AEYTMRDLSA ETMEISNSRG PRDVEITDVQ TTIVDGNYPW TLVRVYTDAG LVGNGESYWG AGEREIIERM APFLEGENPL DIDRLYEHLV QKLSGEGSIS GKAISAISGI ELALHDVAGK ILEVPAYQLL GGKYRDEMRM YCDCHTEEEA DPDACADEAE RVVEDLGYDS LKFDLDVPSG HEKDRANRHL RNPEIEHKAE IVEKVTERVG DRADVAFDCH WSFSAGSAHR LAERLEEYDV WWLEDPIPPE NHDVQEEVTK RTSTPITVGE NVYRNHGNRR LLENQAVDIV APDVPRVGGM RQTRKIADLA DMYYVPVAMH NVSSPIGTMA SIHVGAAIPN SLAVEYHSYE LGWWSDLVEE DIIENGYAEV PEKPGLGLTL DLDAVEEHLA DGAEMFDEA
|
| |