Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3822 |
Symbol | |
ID | 8744450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 45737 |
End bp | 47293 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646514408 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003405355 |
Protein GI | 284167077 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.959646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCCG AGAACTACCG AGGAGAACCG GACGCCGTGA GCGAAGACGT ACTCGAGCGC CACCGCGAGG CGGCGGCGCC GATTCAAGAG CGCTACGACC TCTACATCGG CGGCGAGTGG AGTGAGAGCG ACGGTGGAGA CGTCGTGGAG TCGTTAGACG CCATCACCGG TGAGACGCTC GCGGAGTACC AGCAGGGCAC CGCGGCGGAC GTCGACAGGG CCGTCGACGC GGCCGAAGAA GCGTATCGGA GCGAGTGGGG CGAACTCTCG AGTTCGGAGC GCGCGGAGTA CCTCGAGGAG CTCGCGGACG CGATCGAAGA GAAAGCCGAA CTGCTGTCGA CTATCGATAC GCTGGAGACC GGATTTCCGA TACTGACGAC GCCGGCGCTC GCTCAGCAGA CCGCCGGCCA GTTGCGGTAC TTCGCGTCGG TAGCGCGGTC GACGGACAAG GGCGCCGTCC CGTCGACCGG CGAGATGGAG AAACACCACC ACATCTACAC GCAGGAGGAA CCCTACGGCG TCGTCGGCCT CATCACGCCG TGGAACGGCC CGCTAGGGAA CCTCGGCGTG AAGCTGGCGC CCGCGCTCGC GGCCGGGAAC ACCGTCGTCT ACAAGCCCTC GCCGCGCGGG GTGGTCTCGT CGTTCGAGGC GATGGACATC ATCGACGACG TCCTGCCCGA CGGCACGGTG AACATGGTTC CGGGCACCGG GCCCGAGGTC GGGGAAGCGA TCTCTTCCCA CTCGCGGATT CGGAAGGTAT CGCTCACCGG GTCGACGGCC GCCGGCCAGT CGGTGATGAA GAGCGCGGCG TCGAACATCA AGGCCATCTC GCTGGAACTC GGCGGGAAAT CCCCGAGCAT CGTCTTCCCC GACGCGGACC TCGAGACGAC CGCGCAGGGC GTCGCCATGG GTATCTACGG CTTCAACGGG CAGGTCTGTA CGGCCGGGTC GAGACTGTTC CTCCACGAGG ACATCTACGA CGAGTTCCTC GAGACCCTGT CGGCGACGGC CGAACAGATG TTCCAGATGG GCGATCCGCT CGACGACGGG ACGATGCTCG GTCCCCTGAT CGATCACAAG CACCTCGACC GGGTGCAGTC GTACGTCGAC GAGGCCGTCG AGGACGGAGC GACGTTGTAC ATGGGCGGCG AGAGTCAGGA TATCGAGGGG CTCGGCGGCG CGCCGTTCTT CGAGCCGACC ATCCTCACGG ACGTCGACAA CGACGATACG GTGGCCTGCG AAGAGGTGTT CGGCCCCGTC CTGACCGTTC TGAAGTGGAG CGACCGCGAG GAGATGCTCG AGTTGGCTAA CGACACCGAG TACGGGCTCG CCTCCGGAAT CTGGACCCAG GACCTACAGA CCGCCCACAC CGTGAGCGAC GAACTGGAAG CCGGCGTCGT CTGGGTCAAC TGCTACAACG CGTTCCAGAC GGGCGTTCCC CACGGCGGCT ACAAACAGAG CGGCGCCGGC CGCGAGATGA ACCGACAGGC ATACCACGAG TACCGCCAGA CGAAGACGGT CAACATCAAC CTCTCCGATA CTTGGCCGCG CATGTGA
|
Protein sequence | MQSENYRGEP DAVSEDVLER HREAAAPIQE RYDLYIGGEW SESDGGDVVE SLDAITGETL AEYQQGTAAD VDRAVDAAEE AYRSEWGELS SSERAEYLEE LADAIEEKAE LLSTIDTLET GFPILTTPAL AQQTAGQLRY FASVARSTDK GAVPSTGEME KHHHIYTQEE PYGVVGLITP WNGPLGNLGV KLAPALAAGN TVVYKPSPRG VVSSFEAMDI IDDVLPDGTV NMVPGTGPEV GEAISSHSRI RKVSLTGSTA AGQSVMKSAA SNIKAISLEL GGKSPSIVFP DADLETTAQG VAMGIYGFNG QVCTAGSRLF LHEDIYDEFL ETLSATAEQM FQMGDPLDDG TMLGPLIDHK HLDRVQSYVD EAVEDGATLY MGGESQDIEG LGGAPFFEPT ILTDVDNDDT VACEEVFGPV LTVLKWSDRE EMLELANDTE YGLASGIWTQ DLQTAHTVSD ELEAGVVWVN CYNAFQTGVP HGGYKQSGAG REMNRQAYHE YRQTKTVNIN LSDTWPRM
|
| |