Gene Htur_4733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4733 
Symbol 
ID8745325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp336539 
End bp338059 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content63% 
IMG OID646515233 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003406180 
Protein GI284172798 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.29898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAG AGACTGTTAG CGGCCAGTTA CGGGAGAATC ACGCGGAAGC CAGAGACCGG 
GTCGAGGATA TCAGCACGTT CGACTCGTGG ATAGACGGAT CTCGATACGA GACCGACGAG
GTCGTCGAGA CCACCGATCC CGTCGTCGGT GAACCGATCA CCGCGATTCC GCGCTGCGGC
TCGAGTGAGG TCGATACGGC AGTCGACGCC GCGTGGAGAG CGTTCGACGA GGAGTGGTCG
GAGACGACCC CTGCCGAACG ATCTCGGCGC ATGTTCGAAT GGATCGACGT GCTCCGCGAT
CACGTCGACG AGCTGGCGTT CCTGGAGTGC GTAGACACCG GGAAGCCGAT GTCGCAGGCT
CGAGGAGAGG TCGAAGGGGC GATCGGAACC TTAGAATACT ACGCGTCGAT CTGTCAGGGC
CAGAGTCAGG ACGGGAGACA GGTGTCTACG TCCGAGGATC TGCACCTCTA CACGCGGAAG
GAGCCTTACG GCGTGGTCGG ACAGATCACG CCGTGGAACT TCCCGGTCTG GGCGGCCGCG
TGGAAGCTCG GGCCCGCGCT CGGGACGGGG AACGCGACCG TGCTGAAGCC GTCGGCCGAG
GCCCCGCTGA CGACGATCCG CATCGCCGAA CTGTCCGAGG GTATCTTCCC GGACGGCGTG
CTTAACGTCG TAACCGGGAC CGGATCAGAG GCCGGCGGTG CGCTGACTGA ACACGATCGG
GTCCGCAAAA TCTCGTTCAC CGGCAGCGTC GGCGTCGGAC AGCGAGTGAT GCAGGCGGCC
GCCGAAAACG TTGCCCCAGT TACGTTGGAG CTCGGCGGGA AGTCGCCGTT TATCGTGTTT
CCCGACGCCG ACTTGGAGAA GGCGGTTTCC GCCGTCGCCG ACGGTATCTT TTACAGCACG
GGGGAGATCT GTGACGCGTT CTCTCGAGCG ATCGTTCACG AGAGCGTTCA CGAGGAGTTC
GTCGACCGGT TCGTCGAGAA GGCCGAGTCC TACACGCTCG GGGATCCGCT CGACGAGGAG
ACGACGATGG GACCGCTCAC TACCGAGTCC CAGTACGAGA CGGTCACGGA ATACATCGAC
GTTGGTGAGA GTGAAGGCGC GACCCTGCTT ACGGGCGGTG GACCGCCGGA CGATTCCGAT
CTCCGGGACG GCTGGTTCGT CAAACCGACG GTGTTCGACG ACGTGGAGAA CGATATGCGC
ATCGCTCGGG AGGAAATCTT CGGGCCCGTA CAGACGATCA ACACGTTCTC CAGCTACGAC
GAGGCGATCG AACTCGCGAA CGACACCGAG TTCGGGCTCG CAGCGGGCAT CGCGACCGAG
CGGACGTCCG TCGTCCACAA CGCGGCGGCG GACATCGAAG CGGGGCTCGT GTACGTCAAC
GAGTACGGTC CGATCCTGCC GCAGGCTCCG TACGGCGGCT TCAAGGAGTC GGGTATCGGA
AAAGATCTGG GCACGGAGGT GCTCGACCAC TACCAGCAGA CGAAATCGGT CTACGTCAAT
CTCGATGAGC CGGAACTCTG A
 
Protein sequence
MASETVSGQL RENHAEARDR VEDISTFDSW IDGSRYETDE VVETTDPVVG EPITAIPRCG 
SSEVDTAVDA AWRAFDEEWS ETTPAERSRR MFEWIDVLRD HVDELAFLEC VDTGKPMSQA
RGEVEGAIGT LEYYASICQG QSQDGRQVST SEDLHLYTRK EPYGVVGQIT PWNFPVWAAA
WKLGPALGTG NATVLKPSAE APLTTIRIAE LSEGIFPDGV LNVVTGTGSE AGGALTEHDR
VRKISFTGSV GVGQRVMQAA AENVAPVTLE LGGKSPFIVF PDADLEKAVS AVADGIFYST
GEICDAFSRA IVHESVHEEF VDRFVEKAES YTLGDPLDEE TTMGPLTTES QYETVTEYID
VGESEGATLL TGGGPPDDSD LRDGWFVKPT VFDDVENDMR IAREEIFGPV QTINTFSSYD
EAIELANDTE FGLAAGIATE RTSVVHNAAA DIEAGLVYVN EYGPILPQAP YGGFKESGIG
KDLGTEVLDH YQQTKSVYVN LDEPEL