Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4733 |
Symbol | |
ID | 8745325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 336539 |
End bp | 338059 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646515233 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003406180 |
Protein GI | 284172798 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.29898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCAG AGACTGTTAG CGGCCAGTTA CGGGAGAATC ACGCGGAAGC CAGAGACCGG GTCGAGGATA TCAGCACGTT CGACTCGTGG ATAGACGGAT CTCGATACGA GACCGACGAG GTCGTCGAGA CCACCGATCC CGTCGTCGGT GAACCGATCA CCGCGATTCC GCGCTGCGGC TCGAGTGAGG TCGATACGGC AGTCGACGCC GCGTGGAGAG CGTTCGACGA GGAGTGGTCG GAGACGACCC CTGCCGAACG ATCTCGGCGC ATGTTCGAAT GGATCGACGT GCTCCGCGAT CACGTCGACG AGCTGGCGTT CCTGGAGTGC GTAGACACCG GGAAGCCGAT GTCGCAGGCT CGAGGAGAGG TCGAAGGGGC GATCGGAACC TTAGAATACT ACGCGTCGAT CTGTCAGGGC CAGAGTCAGG ACGGGAGACA GGTGTCTACG TCCGAGGATC TGCACCTCTA CACGCGGAAG GAGCCTTACG GCGTGGTCGG ACAGATCACG CCGTGGAACT TCCCGGTCTG GGCGGCCGCG TGGAAGCTCG GGCCCGCGCT CGGGACGGGG AACGCGACCG TGCTGAAGCC GTCGGCCGAG GCCCCGCTGA CGACGATCCG CATCGCCGAA CTGTCCGAGG GTATCTTCCC GGACGGCGTG CTTAACGTCG TAACCGGGAC CGGATCAGAG GCCGGCGGTG CGCTGACTGA ACACGATCGG GTCCGCAAAA TCTCGTTCAC CGGCAGCGTC GGCGTCGGAC AGCGAGTGAT GCAGGCGGCC GCCGAAAACG TTGCCCCAGT TACGTTGGAG CTCGGCGGGA AGTCGCCGTT TATCGTGTTT CCCGACGCCG ACTTGGAGAA GGCGGTTTCC GCCGTCGCCG ACGGTATCTT TTACAGCACG GGGGAGATCT GTGACGCGTT CTCTCGAGCG ATCGTTCACG AGAGCGTTCA CGAGGAGTTC GTCGACCGGT TCGTCGAGAA GGCCGAGTCC TACACGCTCG GGGATCCGCT CGACGAGGAG ACGACGATGG GACCGCTCAC TACCGAGTCC CAGTACGAGA CGGTCACGGA ATACATCGAC GTTGGTGAGA GTGAAGGCGC GACCCTGCTT ACGGGCGGTG GACCGCCGGA CGATTCCGAT CTCCGGGACG GCTGGTTCGT CAAACCGACG GTGTTCGACG ACGTGGAGAA CGATATGCGC ATCGCTCGGG AGGAAATCTT CGGGCCCGTA CAGACGATCA ACACGTTCTC CAGCTACGAC GAGGCGATCG AACTCGCGAA CGACACCGAG TTCGGGCTCG CAGCGGGCAT CGCGACCGAG CGGACGTCCG TCGTCCACAA CGCGGCGGCG GACATCGAAG CGGGGCTCGT GTACGTCAAC GAGTACGGTC CGATCCTGCC GCAGGCTCCG TACGGCGGCT TCAAGGAGTC GGGTATCGGA AAAGATCTGG GCACGGAGGT GCTCGACCAC TACCAGCAGA CGAAATCGGT CTACGTCAAT CTCGATGAGC CGGAACTCTG A
|
Protein sequence | MASETVSGQL RENHAEARDR VEDISTFDSW IDGSRYETDE VVETTDPVVG EPITAIPRCG SSEVDTAVDA AWRAFDEEWS ETTPAERSRR MFEWIDVLRD HVDELAFLEC VDTGKPMSQA RGEVEGAIGT LEYYASICQG QSQDGRQVST SEDLHLYTRK EPYGVVGQIT PWNFPVWAAA WKLGPALGTG NATVLKPSAE APLTTIRIAE LSEGIFPDGV LNVVTGTGSE AGGALTEHDR VRKISFTGSV GVGQRVMQAA AENVAPVTLE LGGKSPFIVF PDADLEKAVS AVADGIFYST GEICDAFSRA IVHESVHEEF VDRFVEKAES YTLGDPLDEE TTMGPLTTES QYETVTEYID VGESEGATLL TGGGPPDDSD LRDGWFVKPT VFDDVENDMR IAREEIFGPV QTINTFSSYD EAIELANDTE FGLAAGIATE RTSVVHNAAA DIEAGLVYVN EYGPILPQAP YGGFKESGIG KDLGTEVLDH YQQTKSVYVN LDEPEL
|
| |