Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4012 |
Symbol | |
ID | 8744640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 266611 |
End bp | 268137 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646514586 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003405533 |
Protein GI | 284167255 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACTG AATCCCAGTC AGCCACTGAG CGGAAGAGTG ATATCGAACA ACGTCATCAA GAGACCGCAG CGGACGTCGT TCCACCGAAT CTGAAACTTT ATATCGGTGG CGAGTGGACG ACAAGTTCTT CGGGAGAAAC ATTCGAAACC CGAGATCCAA CAACCGGCGA CTCACTTGCA ACAGTCCAAG CAGGAAACGA CAAGGACATC GATCGCGCAG TCGAGGCGGC ATGGACTGCT TACGACGACA CTTGGTCGAA CTACTCGGCG GCGGATCGCC AGCGTGTTCT CGAGGAGATC GCCGACAGAG TCGAGCAAAG CAAGGAGGAA TTCGCGCTCC TCGAAACGCT CGACAATGGG AAACCGATCA GTGAGTCCAG AGTCGATATG GAGCTGGTCG CTGATCACTT CCGCTATTTC GCTGGCGCAA CCCGTGTCAA CGGCGGGGAC ACCATCCCAA GTGGTGGTGA GAGCCAGCAC GTCCAAACGA TCTCCGAACC GTACGGTGTC GTTGGCCAGA TCACACCGTG GAACTTCCCA CTGTTGATGG CAGCGTGGAA ACTCGGCCCA GCGCTCGCTG CAGGCAATTG TTCGGTGCTC AAGCCGGCCG AACAGACACC GTTGACAATC CTCAAGCTGA TGGACGAAGT CGACGACGTA CTTCCCGATG GGGTCGTGAA CGTCGTTACC GGCTTCGGAC CTGAAGCTGG CGAACCACTG GCGAAACATC CAGATATTCG AAAACTTGCC TTCACCGGAT CAACCGAGAT CGGTAAGCAA GTAATGGCAC AGGCTGCCGA GAACGTCCAC GACATCACGC TCGAGCTTGG TGGAAAAAGC CCGCTGATCA TCTACCCTGA CGCGGATCTT GAAAAGGCAG TCAACACGAC GATAACAGCC ATCTTCTACA ACACGGGTGA GTGCTGTTCC GCGGGATCAC GGCTGTTCAT CCACAGCAAT ATCAAAGAAG AGTTCCTCGA CGCTCTGGCG TCCACTGCTG AGGACCTCGT GATTGACGAT CCCCTTCTCG AAGAAACGAC TCTTGGGCCG AAGGTGACCG AAGAACAGGC CCAGAATACG CTCGAGTACA TCCAGGAAGC TCGTGACGCT GGTGCCGACT TCATTACCGG CGGTGACGTA CCCGACGACG ACGCTCTCGA AGAGGGAAGC TTCGTCTCGC CGACGCTGAT CGACAATATT GATCACAACA ACCGTGCCGT CCAAGAGGAA ATCTTCGGCC CGGTTCAAGA AGTCTTCGAG TGGACCGACT ACGAGAAGAT GATCAAACTG GCGAACGACG TCGACTATGG GCTCGCAGCG GGTATCCTCA CAAACGACCT GACGAAAGCT TATCAGACAG CGAAAGATAT CGAAGCTGGG ACGATCTGGG TGAACCAGTA CAACTCCTTC CCAGCTGGAC AGCCCTTCGG CGGCTACAAA GAGTCCGGTA TCGGCCGTGA AATCGGATAC GAGGCACTCG CCGACCACTA CACACAAACG AAAACCATCA ACATCGGTCT GCAGTAG
|
Protein sequence | MSTESQSATE RKSDIEQRHQ ETAADVVPPN LKLYIGGEWT TSSSGETFET RDPTTGDSLA TVQAGNDKDI DRAVEAAWTA YDDTWSNYSA ADRQRVLEEI ADRVEQSKEE FALLETLDNG KPISESRVDM ELVADHFRYF AGATRVNGGD TIPSGGESQH VQTISEPYGV VGQITPWNFP LLMAAWKLGP ALAAGNCSVL KPAEQTPLTI LKLMDEVDDV LPDGVVNVVT GFGPEAGEPL AKHPDIRKLA FTGSTEIGKQ VMAQAAENVH DITLELGGKS PLIIYPDADL EKAVNTTITA IFYNTGECCS AGSRLFIHSN IKEEFLDALA STAEDLVIDD PLLEETTLGP KVTEEQAQNT LEYIQEARDA GADFITGGDV PDDDALEEGS FVSPTLIDNI DHNNRAVQEE IFGPVQEVFE WTDYEKMIKL ANDVDYGLAA GILTNDLTKA YQTAKDIEAG TIWVNQYNSF PAGQPFGGYK ESGIGREIGY EALADHYTQT KTINIGLQ
|
| |