Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4209 |
Symbol | |
ID | 8744837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 480473 |
End bp | 481924 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514756 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003405703 |
Protein GI | 284167425 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCA GCTACGAGAA CTTCGTAAAC GGACAGTGGG TCGAATCTGA GACGAATGCA ACCTTCGAGG TTAGAAATCC GGCACGGACC GACGAATTGG TCGGCGAATA CCAGGACTCG AGTCTGGCAG ACGTCGAGGC CGCCGTCGAG GCGGCCGCCG ACGCACAGGA CGACTGGGCC GGCACGCCGG GCCCGGAACG GGGCGACCTC CTCAAGCGAG CGGGAACGCT TCTCGAGCAG CGCAAAGACG AGACGACCGA GGCGCTCGTC CGTGAAGAGG GGAAAACGCG CGCCGAGGCA GCCGGCGAGG TTCAGCGTGC AATCGACATT TTCGCGTACT ACGGACAGAA GACACGCGAT CTCGGGGGGA CCGTCAAATC TGCAAGCGGC CGCAACACGG AACTCACGAC CAGACAGGAG CCGCTCGGAA CGGTCGCATT AGTGACCCCG TGGAACTACC CGATCGCGAT TCCGGCATGG AAGCTCGCGC CGGCGCTGGC CGCCGGAAAC ACGGCGGTCG TCAAGCCCGC TTCGGAGGCC CCTGGCATGG CACGGGTTCT GTTCGAGTGT CTCGAGGAGG CAGGGCTCCC CGACGGCGTG GCGAATTTGG TCACGGGATC GGGGAGCGAC GTCGGCAGGC CGCTCGTCGA ACACCAGGCG ATCGACGGCG TCTCGTTCAC GGGGAGTACG GCCGTCGGGA CGCAGGTCGC ACAGACGGTG ACCGACGACC TCAAACGCGT CCAGTGTGAG ATGGGCGGGA AGAACCCGAC GGTCGTGATG CCGAGCGCCG ACGTCAACGA GGCGGTGGAG ATCGTCGGCC AGGGTGCGTT CGGAACGACA GGCCAGTCCT GTACGGCCGC CTCTCGAGCG ATCGTCCACG ACCAGATTTA CGACGAGTTC GTCGACACAA TGGTCGAGTA CGCCGAATCT GTGGACATCG GTCCCGGACT CGACGACCCT GATATGGGGC CCCATGTTTC CGAGTCGGAA CTCCAGTCGA CGCTCGAGTA CGTCGAGATC GGCCGAGACG AGGGCGCGAC ACTCGAAACC GGCGGGGAAC GCCTCACGGA CGGGGAGTAC GCGGATGGCT ACTACGTGGA ACCCGCCGTC TTCTCGGACG TCGACAACGA GATGCGGATC GCACAGGAGG AGATCTTCGG ACCGGTACTG GCGGTCATTC GCGCCGACGA CTTCGAGGAC GCGCTGTCGC TTGCGAACGA CGTCGACTAC GGGCTTTCGG CGAGCATCGT GACGCAGGAT ACGACCGAAG CCAACGAGTT CCTCGATCGC ATCGAAGCGG GCGTCGCCAA AGTCAACGAA AAGACGACCG GGCTGGAATT ACACGTCCCC TTCGGCGGCT ACAAGAATTC CTCCACGAAC ACGTACCGAG AACAAGGCGA CGCCGGATTG GACTTCTTCA CGACGACGAA GACGATCTAC CGTAACTACT AA
|
Protein sequence | MASSYENFVN GQWVESETNA TFEVRNPART DELVGEYQDS SLADVEAAVE AAADAQDDWA GTPGPERGDL LKRAGTLLEQ RKDETTEALV REEGKTRAEA AGEVQRAIDI FAYYGQKTRD LGGTVKSASG RNTELTTRQE PLGTVALVTP WNYPIAIPAW KLAPALAAGN TAVVKPASEA PGMARVLFEC LEEAGLPDGV ANLVTGSGSD VGRPLVEHQA IDGVSFTGST AVGTQVAQTV TDDLKRVQCE MGGKNPTVVM PSADVNEAVE IVGQGAFGTT GQSCTAASRA IVHDQIYDEF VDTMVEYAES VDIGPGLDDP DMGPHVSESE LQSTLEYVEI GRDEGATLET GGERLTDGEY ADGYYVEPAV FSDVDNEMRI AQEEIFGPVL AVIRADDFED ALSLANDVDY GLSASIVTQD TTEANEFLDR IEAGVAKVNE KTTGLELHVP FGGYKNSSTN TYREQGDAGL DFFTTTKTIY RNY
|
| |