Gene Htur_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0498 
Symbol 
ID8741079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp523792 
End bp525327 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content66% 
IMG OID646511076 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003402069 
Protein GI284163790 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGC AGACGACAGC TCAGCCTGAC CAGCACTACA TCAACGGCGA GTGGACCGAC 
GGGGAGGGCG AGGAGACCTT CGAAAGCGAG AACCCGGCGA CCGGCGAGAC CCTGCGGACC
TTCCGCCGCG GGACCGCAGC CGACGTCGAC GCGGCCCTCG AGGCTGCGGA AGAAGCACAG
GACGAGTGGC GCGAACTCTC CCACATCGAC CGCGCTGAGT ACCTCTGGGA CATCTACCAC
GAGCTCCGCC AGCGGACGGA CGAGCTCGGC GAGATCGTCA CCAAGGAGTG CGGCAAGGAG
ATCTCGGAAG GAAAGGCCGA CGTCGTCGAG GCCGCTCACA TGGTCGAGTG GGCCGCGGGC
AACGCCCGTC ACCCCCACGG CGACGTCGTC CCGAGCGAGA TCGGGAGCAA GGACGCCTAC
ATGCGCCGCA AGCCCCGCGG GGTCATCGGC TGTATCACGC CGTGGAACTT CCCGGTCGCG
ATCCCGTTCT GGCATATGGC CGTCTCGCTG GTCGAGGGTA ACACGGTCGT CTGGAAGCCC
GCCGAACAGA CGCCGTGGTG CGCTCAGATC GTCGCCGAGA TGTTCGAGGA CAGCGGCATC
CCGGACGGCG TCTTCAACAT GATCCAAGGC TTCGGCGACG CCGGTGAGGC TATCGTCGAA
GACGAGCGCG TCGACACCGT GCTGTTCACC GGCTCGGCGG AGGTCGGCCA AGAGGTCGCC
CGCAGCGTCG CCGAACAGCC CGGGAAGCTC GCGGCCTGCG AGATGGGCGG CAAGAACGCG
GTCGTCATCA CCGACGAGGC CGACCTCGAC ACGGCCGTCC ACTCGGCGGT CATGTCCTCG
TTCAAGACGA CCGGCCAGCG CTGCGTCTCG GCCGAGCGCC TGATCGTCCA CGAGGACGTC
TACGACGAGT TCAAGGAACG GTTCGTCGAC GTCGCCGAGA ACGTCGCCGT CGGCGACCCG
CTGGCCGAAG ACACCTTCAT GGGCCCGCTC GTCGAGGGCG AGCACAAGGA GAAGGTCCTC
GAGTACAACG AACTCGCTCG CGAGGAGGAC GTCGACGTGC TCGTCGACCG CGACGAACTG
GGCGACGACG AGATTCCGGA CGGCCACGAG GACGGTCACT GGGTCGGTCC GTTCGTCTAT
GAGGCCGATC ACGAGGCCGA CCTCCGCTGT ACGCAGGAGG AGGTCTTCGG TCCCCACGTC
GCACTCATGG AATACTCCGG CGACATCGAG GACGCCGTCG AGGTCCACAA CGACACGGAG
TACGGCCTCG CAGGAGCGAT CATCTCCGAG GACTACCGCG ACATCAACTA CTACCGCGAC
AACGCCGAGG TCGGACTCGC GTACGGGAAT CTGCCGTGTA TCGGCGCCGA GGTTCACCTG
CCCTTCGGTG GCGTCAAGAA GTCCGGTAAC GGCTACCCGA GCGGTCGCGA AGTGATCGAG
GCCGTCACCG AGCGCACCGC CTGGACGCTG AACAACTCGA AGGACATCGA GATGGCCCAA
GGGCTGTCCG CCGACATCAC GACCGACGAT GACTGA
 
Protein sequence
MSQQTTAQPD QHYINGEWTD GEGEETFESE NPATGETLRT FRRGTAADVD AALEAAEEAQ 
DEWRELSHID RAEYLWDIYH ELRQRTDELG EIVTKECGKE ISEGKADVVE AAHMVEWAAG
NARHPHGDVV PSEIGSKDAY MRRKPRGVIG CITPWNFPVA IPFWHMAVSL VEGNTVVWKP
AEQTPWCAQI VAEMFEDSGI PDGVFNMIQG FGDAGEAIVE DERVDTVLFT GSAEVGQEVA
RSVAEQPGKL AACEMGGKNA VVITDEADLD TAVHSAVMSS FKTTGQRCVS AERLIVHEDV
YDEFKERFVD VAENVAVGDP LAEDTFMGPL VEGEHKEKVL EYNELAREED VDVLVDRDEL
GDDEIPDGHE DGHWVGPFVY EADHEADLRC TQEEVFGPHV ALMEYSGDIE DAVEVHNDTE
YGLAGAIISE DYRDINYYRD NAEVGLAYGN LPCIGAEVHL PFGGVKKSGN GYPSGREVIE
AVTERTAWTL NNSKDIEMAQ GLSADITTDD D