Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3894 |
Symbol | |
ID | 8744522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 131013 |
End bp | 132167 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646514478 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003405425 |
Protein GI | 284167147 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.519515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAA TCGTCGACTA CGAACTGTTC GAGGTTCCGC CGCGGTGGCT ATTCCTGCGT CTCGAAACGA GCGACGGAAC GGTCGGCTGG GGCGAACCGG TCGTGGAGGG GCGCGCGAAG ACCGTTCGGA CCGCAGTCGA GGAGCTTCTC GACAACTACC TACTGGGGGA GGATCCGGAC CGAATCGAGG ACCACTGGCA GGCGATGTAT CGCGGCGGCT TCTATCGCGG CGGCCCGGTG CTGATGAGCG CTATCGCGGG GATCGACCAG GCCCTGTGGG ACATCAAGGG TAAGCGACTG GGCGTTCCGG TCCACGAACT GCTCGGCGGT GCGACCCGGG ACCGGATCCG GGTCTACCAG TGGATCGGCG GCGACCGCCC CTCCGACGTG GCCGAACAGG CTCGAGCGCA GGTCGAGGCG GGCTTTACGG CCCTCAAGAT GAACGCGACG GAGGAGATCG AGCGTGTCGA CGATCCCGCG ACGATCGAGG CCGCGGTCAC GCGGCTCCGC GAGGTCCGCG AGGCCGTCGG CGACGAGGTC GATATCGGCG TCGACTTCCA CGGTCGCGTC TCGAAGCCGA TGGCGAAACG ACTCGTCGAA AAGCTCGAAC CCCACGAGCC GATGTTCGTC GAGGAGCCGG TCCTGCCCGA GCACAACGAC GCGCTGCCGG AGATCGCGTC CCACACCACG ATCCCGATCG CCACCGGCGA GCGGATGTTC TCCCGGTGGG ACTTCAAGCA GGTGTTCGAG AACGGGGCCG TCGACGTGAT TCAGCCCGAT CTCAGCCACG CCGGCGGGAT CACGGAGGTC AAGAAGATCG CCGCGATGGC CGAAGCGTAC GACGTGGCGA TGGCCCCCCA CTGCCCGCTC GGGCCGATCG CGCTCGCCTC CTGTCTCCAG GTGGACGCGT GCTCGCACAA CGCGTTCATT CAGGAACAGA GCCTGAACAT CCACTACAAC GAGACCAGCG ACGTGCTCGA GTACCTGGCC GATCCGTCCG TCTTCGATTA CGACGACGGC TACGTCCAGA TCCCGGACGA GCCGGGACTC GGCATCGAGA TCGACGAGGA CCACGTCCGC GCCCAGGCCG ACGTCGGCCA CGACTGGCAC AACCCCGTCT GGCGACACGA CGACGGCAGC GTCGCCGAGT GGTAA
|
Protein sequence | MTEIVDYELF EVPPRWLFLR LETSDGTVGW GEPVVEGRAK TVRTAVEELL DNYLLGEDPD RIEDHWQAMY RGGFYRGGPV LMSAIAGIDQ ALWDIKGKRL GVPVHELLGG ATRDRIRVYQ WIGGDRPSDV AEQARAQVEA GFTALKMNAT EEIERVDDPA TIEAAVTRLR EVREAVGDEV DIGVDFHGRV SKPMAKRLVE KLEPHEPMFV EEPVLPEHND ALPEIASHTT IPIATGERMF SRWDFKQVFE NGAVDVIQPD LSHAGGITEV KKIAAMAEAY DVAMAPHCPL GPIALASCLQ VDACSHNAFI QEQSLNIHYN ETSDVLEYLA DPSVFDYDDG YVQIPDEPGL GIEIDEDHVR AQADVGHDWH NPVWRHDDGS VAEW
|
| |