Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4057 |
Symbol | |
ID | 8744685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 310218 |
End bp | 311591 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 646514622 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003405569 |
Protein GI | 284167291 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCCA TCAATCCAGC AACTGAGGAA GTCCTCGCAA CGTACAAAGA AGATGACACC GGAACGGTCT ACGCTCAGCT AGATACGGCT CGGGAAGCTT TCGAATCGTG GTCTGAGAGA TCGGTTTCCA GCCGAGAACG TTTATTAGCA AACGTCGCTG ATGTACTGCG CGAAAACATT GATAAATATG CGAAACAGAT AACCATAGAA GTAGGGAAAC CGCTCGATCA GGCGGTTTCC GAGATCGAGA AATGTGTATG GGTATGTAAT TACTATTCCC AGCATTCATC CGAATATCTA CAGAATGAAC ATATCGGCAC TGAACCGGGG GCGAAGACGT ATGTCTCGTA TGAACCACTG GGGGCTATCT TGGCAATAAT GCCGTGGAAC TATCCATTTT GGCAAGTATT CCGATTTGCA GCCCCCCATA TTACGTCCGG TAACGTTGCT ATCTTAAAGC ATTCTCCGAA CGTATTTGGC TGTGCACAGG CCATCGAAGA CATCTTTGAA GAAGCAGGTT ATCCGGACGG TGTCTTCACA TCAGTCCAAG TGTCTGTAAA TCAGGTTTCG GATATCATCG AAGACGATCG AGTCAGGGCA GTTACACTGA CCGGCAGCAC ACGGGCGGGC AAGGAGGTGG CAACGATAAG TGGCCGTGAG CTCAAACCAA CCGTTTTCGA ACTCGGGGGG AGTGATCCCT TTGTTGTCCT AGATGATGCT CCAGTCGAAC GCGCAGCCGA AGTCGGTGCG ACAGCACGAA CTCAGAACGC CGGTCAGTCT TGTATCGCAG CGAAACGATT TATCGTTCAC GAGACGGTCT TCGATGAATT TCTCGAGCGA TTAACGACGG AATTTAAAGC ACTTACTATC GGCGATCCGA CCGACGTGGA AACGGACGTC GGTCCCCTGG CGAGAGAAGA CCTACTCGAA ACACTCCACG ACCAGGTGAT GACGTCTGTC GACGCTGGAG CGACCTTGCA TCTGGGCGGT GAACCGCTCG ATCAATCGGG ATACTTCTAC CCGCCCACAA TCTTGGTCGA CGTTCCCGAT GAAACGCCCG CAGCGACCGA AGAACTATTC GGACCCGTCG CTACTGTGTT CAAGGTCGAA AGCGAACGGG AAGCACTTCG AATTGCGAAC GATACTCAGT ACGGACTCGG TGCAAGCGTT TGGACAACGA ACACGGATCG CGGCGAATCC GTTGCTCAAG AGCTCGAAGC CGGGAACGTG TTCGTAAATC AGCTCGTCAA GTCTGATCCT CGAGTCCCAT TTAGCGGGGT GAAGGATTCG GGTTATGGAA CAGAGCTTTC GCGACATGGG ATCCACGAGT TCGTGAATAA GAAGACGATA TGGATCGAAG GACAAGGTGA ATAA
|
Protein sequence | MKSINPATEE VLATYKEDDT GTVYAQLDTA REAFESWSER SVSSRERLLA NVADVLRENI DKYAKQITIE VGKPLDQAVS EIEKCVWVCN YYSQHSSEYL QNEHIGTEPG AKTYVSYEPL GAILAIMPWN YPFWQVFRFA APHITSGNVA ILKHSPNVFG CAQAIEDIFE EAGYPDGVFT SVQVSVNQVS DIIEDDRVRA VTLTGSTRAG KEVATISGRE LKPTVFELGG SDPFVVLDDA PVERAAEVGA TARTQNAGQS CIAAKRFIVH ETVFDEFLER LTTEFKALTI GDPTDVETDV GPLAREDLLE TLHDQVMTSV DAGATLHLGG EPLDQSGYFY PPTILVDVPD ETPAATEELF GPVATVFKVE SEREALRIAN DTQYGLGASV WTTNTDRGES VAQELEAGNV FVNQLVKSDP RVPFSGVKDS GYGTELSRHG IHEFVNKKTI WIEGQGE
|
| |