Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3398 |
Symbol | |
ID | 8744018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3509089 |
End bp | 3509877 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513980 |
Product | HpcH/HpaI aldolase |
Protein accession | YP_003404934 |
Protein GI | 284166655 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCT CGCCACGAAC GAACCTCTTA CAGCAGACGC TCGCAGACGG CGACGTCGCG CTCGGTGTCC TCGAGAACGC GTACGATCCG ACGCTGGTCG AGTTCTACGG CGAACTTGGC CTCGATTTCG TCTGGATCGA CCTCGAACAC GCCGGACCGA GCCCGTTCGA CGGCGACCGA CTCGAGGACC TGGCCCGGGC CGCGAACGTG ACGGGAACGG AACTGCTCGT TCGCTTGCCC GAGCCCGACC CCGGGATGGT CCGGAAGACG CTCGACGCCG GCGTCCGGTC GCTGTTCGTC TCCCGGATCG AGTCCGCCGA CGAGGTGCGG CGGGCGATCG AGGCCTCGCG GTTCGAGTAC GACGGCGAGC CGGGCAAACG CGGCTTCGCC AGCCCCCGCG CGAGTCGGTG GGGGACGACC GACGACTACG CCGGTACCGA GGACGACGAG ATTATCGTCG GCGTGACGAT CGAGAACCCG ACCGCGGTCG ACAACATCGA GGAGATCCTC GAGGTGCCAG GGTTGGGGTT CGTCTTCGCC GGCCCGCTCG ATCTGGCCGT CTCGCTGGGC CACCCCGGCG AGCCCACCCA CGACGAGGTC GAAGAGCGAA TCGAGGAGAT TCGAGAGGCG GCCCTCGAGG CCGAGGTCCC GCTGGGCGGG CTCGGGTTCG GGATGGACGA CGTCAACGAG AAGGCCGAGT CGGGCTACCA GATCCTCAAC CTGGGGAGTA CGACCGGAGC GCTGGGCGGC GCCGTGCGCT CGTGGCTGAA CGAATACGGA GGTACCTGA
|
Protein sequence | MATSPRTNLL QQTLADGDVA LGVLENAYDP TLVEFYGELG LDFVWIDLEH AGPSPFDGDR LEDLARAANV TGTELLVRLP EPDPGMVRKT LDAGVRSLFV SRIESADEVR RAIEASRFEY DGEPGKRGFA SPRASRWGTT DDYAGTEDDE IIVGVTIENP TAVDNIEEIL EVPGLGFVFA GPLDLAVSLG HPGEPTHDEV EERIEEIREA ALEAEVPLGG LGFGMDDVNE KAESGYQILN LGSTTGALGG AVRSWLNEYG GT
|
| |