Gene Htur_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4012 
Symbol 
ID8744640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp266611 
End bp268137 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content56% 
IMG OID646514586 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003405533 
Protein GI284167255 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACTG AATCCCAGTC AGCCACTGAG CGGAAGAGTG ATATCGAACA ACGTCATCAA 
GAGACCGCAG CGGACGTCGT TCCACCGAAT CTGAAACTTT ATATCGGTGG CGAGTGGACG
ACAAGTTCTT CGGGAGAAAC ATTCGAAACC CGAGATCCAA CAACCGGCGA CTCACTTGCA
ACAGTCCAAG CAGGAAACGA CAAGGACATC GATCGCGCAG TCGAGGCGGC ATGGACTGCT
TACGACGACA CTTGGTCGAA CTACTCGGCG GCGGATCGCC AGCGTGTTCT CGAGGAGATC
GCCGACAGAG TCGAGCAAAG CAAGGAGGAA TTCGCGCTCC TCGAAACGCT CGACAATGGG
AAACCGATCA GTGAGTCCAG AGTCGATATG GAGCTGGTCG CTGATCACTT CCGCTATTTC
GCTGGCGCAA CCCGTGTCAA CGGCGGGGAC ACCATCCCAA GTGGTGGTGA GAGCCAGCAC
GTCCAAACGA TCTCCGAACC GTACGGTGTC GTTGGCCAGA TCACACCGTG GAACTTCCCA
CTGTTGATGG CAGCGTGGAA ACTCGGCCCA GCGCTCGCTG CAGGCAATTG TTCGGTGCTC
AAGCCGGCCG AACAGACACC GTTGACAATC CTCAAGCTGA TGGACGAAGT CGACGACGTA
CTTCCCGATG GGGTCGTGAA CGTCGTTACC GGCTTCGGAC CTGAAGCTGG CGAACCACTG
GCGAAACATC CAGATATTCG AAAACTTGCC TTCACCGGAT CAACCGAGAT CGGTAAGCAA
GTAATGGCAC AGGCTGCCGA GAACGTCCAC GACATCACGC TCGAGCTTGG TGGAAAAAGC
CCGCTGATCA TCTACCCTGA CGCGGATCTT GAAAAGGCAG TCAACACGAC GATAACAGCC
ATCTTCTACA ACACGGGTGA GTGCTGTTCC GCGGGATCAC GGCTGTTCAT CCACAGCAAT
ATCAAAGAAG AGTTCCTCGA CGCTCTGGCG TCCACTGCTG AGGACCTCGT GATTGACGAT
CCCCTTCTCG AAGAAACGAC TCTTGGGCCG AAGGTGACCG AAGAACAGGC CCAGAATACG
CTCGAGTACA TCCAGGAAGC TCGTGACGCT GGTGCCGACT TCATTACCGG CGGTGACGTA
CCCGACGACG ACGCTCTCGA AGAGGGAAGC TTCGTCTCGC CGACGCTGAT CGACAATATT
GATCACAACA ACCGTGCCGT CCAAGAGGAA ATCTTCGGCC CGGTTCAAGA AGTCTTCGAG
TGGACCGACT ACGAGAAGAT GATCAAACTG GCGAACGACG TCGACTATGG GCTCGCAGCG
GGTATCCTCA CAAACGACCT GACGAAAGCT TATCAGACAG CGAAAGATAT CGAAGCTGGG
ACGATCTGGG TGAACCAGTA CAACTCCTTC CCAGCTGGAC AGCCCTTCGG CGGCTACAAA
GAGTCCGGTA TCGGCCGTGA AATCGGATAC GAGGCACTCG CCGACCACTA CACACAAACG
AAAACCATCA ACATCGGTCT GCAGTAG
 
Protein sequence
MSTESQSATE RKSDIEQRHQ ETAADVVPPN LKLYIGGEWT TSSSGETFET RDPTTGDSLA 
TVQAGNDKDI DRAVEAAWTA YDDTWSNYSA ADRQRVLEEI ADRVEQSKEE FALLETLDNG
KPISESRVDM ELVADHFRYF AGATRVNGGD TIPSGGESQH VQTISEPYGV VGQITPWNFP
LLMAAWKLGP ALAAGNCSVL KPAEQTPLTI LKLMDEVDDV LPDGVVNVVT GFGPEAGEPL
AKHPDIRKLA FTGSTEIGKQ VMAQAAENVH DITLELGGKS PLIIYPDADL EKAVNTTITA
IFYNTGECCS AGSRLFIHSN IKEEFLDALA STAEDLVIDD PLLEETTLGP KVTEEQAQNT
LEYIQEARDA GADFITGGDV PDDDALEEGS FVSPTLIDNI DHNNRAVQEE IFGPVQEVFE
WTDYEKMIKL ANDVDYGLAA GILTNDLTKA YQTAKDIEAG TIWVNQYNSF PAGQPFGGYK
ESGIGREIGY EALADHYTQT KTINIGLQ