Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4390 |
Symbol | |
ID | 8745018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 658672 |
End bp | 660165 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646514928 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_003405875 |
Protein GI | 284167597 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.325404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAGC AAAGAGAGCG AGCCGTTTCA CCAGCCGTTC GTGACCACGT CGACAACTAC GTCGGCGGCG AGTGGACCAC GCCCGCAAGC GAGACGTCCC AGCCCGTCGT CGATCCCGCG ACGGGGGACG AACTCGCGAC GGTCGAGTTC TCGACCGTCG AGACCGTCGA CGACGCAGTA CAGACCGCAA ACGAGGCCTT CGAAGAGTGG CGACAGACAC CCGTCGCCGA GCGCGTCCAG TACCTCTTCG ACCTGAAGAC CGAACTCGAG GACCGAATCG ACGAGATCGC GACCGCGCTC TCCCGCGAGC ACGGGAAGAC CGTCGCAGAG GCGCGCGGAG AGATCAAGCG CGGCATCGAG AACGTCGAAG TGGCCTGCGG AATGCCCAAC ATGATGCGCG AGGGCAGCGG CAACGTTGAG GACGTCGCGT CCGGGATGGA CGAACACGCC GTCCGGCAGC CCCTTGGCGT CTTCACTGCC ATCACGCCGT TTAACTTCCC GGCGATGATC CCGCTGTGGT TCCTGCCGTA CGCGGTCGCG TCCGGGAACA CGTTCATTCT CAAGCCCAGC GAGAAGGTCC CGCTGAGCTC CCAGCTCATC TTCGAGGCCG TCGACGCGGT CGACTTTCCC GACGGCGTCG TCAACCTCGT CAACGGTGGG GCCGAGACCG TCAACACGCT GCTCGAGCAC GATGACATCG AGGGCGTCTC GTTCGTCGGC AGTTCGCCCG TCGCCAAGCA CGTCTACGAG ACCGCCGCCG AACACGGCAA GCGCGTGCAG GCCCAGGGCG GGGCGAAGAA CTACGCCGTC GTCAGCGAGA ACGCGGAAGT CGAGGAATCG GTTCCGAACA TCATCGGCTC TGTCTACGGC AACGCCGGTC AGCGCTGTCT CGCCAACGAC GTCGTCGTCG GCGTCGGCGA CGTCTACGAC AACCTCCGCG AGCAGCTGGT CGACGCCGCG GAGAACCTCA CCGTCGGCGC CGGCGTCGAC GAGGAGACCG AAGTCGGCCC CCTGATCACC GACGACTCGC GCGAGCGCGT GCTCGGCCTG ATCGAGAACG CGCTCGAGGA AGGCGCCGAA CTCGTCCTCG ACGGCCGGGA CTTCGAACAT CCCGAACTGG ACGGTAACTT CCTCGGCCCG ACGCTCCTCG AGGGCGTCAC GACCGACATG GAGATCGCCC AAGAGGAGAT CTTCGGGCCG GTGCTCTGTC TCGCCGAGGC CGAGGACTTA GACGAGGCCA TCGAGATGGT CAACTCGACG AAGTACGGCA ACGCCTCCTC GCTCTACACC GAGTCCGGCA GCGAGGCCCG CCAGTACCGG TACGAGGTCG ACGCGGGGAA CATCGGGATC AACGTCGGCG TCTGCGCCCC GATGGGCTTC TTCCACTTCG GCGGCCGGAA GGCGTCGTTC TTTGGCGACC TCCACGCTCA GGGCGAGGAC GCGGTCAACT TCTACACCGA GAAGACCATC CAGATCGAAC GCTGGTACAG CTAA
|
Protein sequence | MSEQRERAVS PAVRDHVDNY VGGEWTTPAS ETSQPVVDPA TGDELATVEF STVETVDDAV QTANEAFEEW RQTPVAERVQ YLFDLKTELE DRIDEIATAL SREHGKTVAE ARGEIKRGIE NVEVACGMPN MMREGSGNVE DVASGMDEHA VRQPLGVFTA ITPFNFPAMI PLWFLPYAVA SGNTFILKPS EKVPLSSQLI FEAVDAVDFP DGVVNLVNGG AETVNTLLEH DDIEGVSFVG SSPVAKHVYE TAAEHGKRVQ AQGGAKNYAV VSENAEVEES VPNIIGSVYG NAGQRCLAND VVVGVGDVYD NLREQLVDAA ENLTVGAGVD EETEVGPLIT DDSRERVLGL IENALEEGAE LVLDGRDFEH PELDGNFLGP TLLEGVTTDM EIAQEEIFGP VLCLAEAEDL DEAIEMVNST KYGNASSLYT ESGSEARQYR YEVDAGNIGI NVGVCAPMGF FHFGGRKASF FGDLHAQGED AVNFYTEKTI QIERWYS
|
| |