Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4441 |
Symbol | |
ID | 8745070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 22541 |
End bp | 23716 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646514978 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003405925 |
Protein GI | 284172543 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0607658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGTTA CAGACTACGA GCTCTACGCG GTGCCGCCGC GCTGGCAGTT CCTCAGACTC GAGACGAGCG ACGGGCGCGT CGGCTGGGGC GAGGTCTACA CCAAGTGGCA CTTCGCGGGC GACAGCGAAC CGGCGACCCG GAGCGCGGTC GATCAGCTGA TGCACCAGTA CGTCCTCGGC GAGGACCCGA GTCGCATCGA GTACCTCTGG CAGGCGATGT ACCGCAGCAG CTTCTACCGC GGCGGACCGG TCCACATGAG CGCCATCGCC GGCATCGACG AGGCGCTGTG GGACCTGAAG GGGAAGGCGG CCGGGATGCC GGTCTACGAA CTGCTCGGCG GTCCTGCACG CGACCGCGTC CGACTCTACC AGCACGTCAG GGCTCACGGC GCCGACGACG TGGCGGATCC GGCGGCCGCG GCCGCCGACG AGGCGCGCGA GCACGTCGAA GCGGGCTACA CCGCCGTGAA GCTGGTTCCG ACGGGCGGAC TCGAGATCAT CGATGCGCCG GCAGCCGTCG AAGAAGCGCG CGAAATCGTC GGCGCGGTCC GCGACGCCGT CGGCCCCGAG GTCGACGTCG CGCTGGATTT CCACGGCCGC GCCTCGAAGG CGATGGCCCG CCGACTGGCG ACGGCGCTCG AGGAGTTCCA GCCGATGTTC GTCGAGGAGC CGGTTACCCC CGAGCACGAC CACGCGCTGC CCCGGATCGC CGAGGGGACG ACGATTCCGA TCGCGACGGG CGAGCGCCTC TACTCTCGGA GCGAGTTCCG GCCGATCCTC GAGGCCGACG CGGTCGACGT CGTCCAGCCG GACGTCTCGA GCGCCGGGGG GATCACTGAG ACGAAGAAAA TCGCCGACAT GGCCGAGACG TACGACGCCT CGATCGCGCC CCACTGCCCC ATCGGCCCGC TGGCGCTGGC GGCCTCGCTA CACGTCGACG CGGCCGCGCC GAACGCGCTG GTACAGGAGC AAGTGGTCGT CGACGACGAA GACGCGATGC GGTACGTCGA AAACGACGAG ATCTTCGAAC CGGCCGACGG CTATCTGGAC CTGCCTGACG GACCGGGGCT CGGAATCGAG ATCGACGAGA ATCGCGTCCG CGAACTCGCG GGAACGGACC TCGGCTTCGA CCGCTCGCCG GGCCACCGCG CCGACGGCAG CGTCGGCGAG CGGTGA
|
Protein sequence | MHVTDYELYA VPPRWQFLRL ETSDGRVGWG EVYTKWHFAG DSEPATRSAV DQLMHQYVLG EDPSRIEYLW QAMYRSSFYR GGPVHMSAIA GIDEALWDLK GKAAGMPVYE LLGGPARDRV RLYQHVRAHG ADDVADPAAA AADEAREHVE AGYTAVKLVP TGGLEIIDAP AAVEEAREIV GAVRDAVGPE VDVALDFHGR ASKAMARRLA TALEEFQPMF VEEPVTPEHD HALPRIAEGT TIPIATGERL YSRSEFRPIL EADAVDVVQP DVSSAGGITE TKKIADMAET YDASIAPHCP IGPLALAASL HVDAAAPNAL VQEQVVVDDE DAMRYVENDE IFEPADGYLD LPDGPGLGIE IDENRVRELA GTDLGFDRSP GHRADGSVGE R
|
| |