Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4424 |
Symbol | |
ID | 8745053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 1350 |
End bp | 2357 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514961 |
Product | aldo/keto reductase |
Protein accession | YP_003405908 |
Protein GI | 284172526 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00178858 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTCG ATCACGTGTC TCTCGGTAGC ACCGGACTCT CAGTGAGCGA GCTCGCGCTG GGGACGTGGC GCTTCGGCCG ACGGCTGGTC GACGGCGAGG AACGCTACGA CGCGGAGGGA GTCGTCGAAA CGGACGAGGA CCGCGCGTAC GAACTCCTTG ACGCCTACGC CGACGCGGGC GGGAACTTCA TCGACACCGC CGACAAGTAC GGCGATGGGC GGGCCGAGCG GTGGATCGGC AACTGGCTCG AGGACCGCGA CCGACTGGAC TACGTCATCG CCACGAAGAT CCACCGTCCC CGACGGGAGG GCGACCCCAA CGCACGGGGA CTCAACCGTC GGCACCTGCG CCGGCAGATC GACACCTGCC TCGAGCGACT GGGGACGGAC TACATCGATC TGCTCTACTG CCATCGGTGG GACGACGACA CCCCCGCGGA GGAGTTCATG CGCACCCTGA ACGGACTCGT CGAGTCCGGG AGGGTGAACT ATCTCGGTAT CTCCTCGGGC CGTCCCGACG CGTGGAAGAT CGTCAAGGCG AACGAGATCG CCCGGCGCGA GGGGTACGAA CCGTTCACGG TGACCCAACC GCGGTACAAT CTCGTGGATC GGGAAATCGA CGCGAACTAC CTCCCGATGT GTCGCGATTA CGGGCTCGGC GCCGTGACGT GGAGCCCGCT GGCGTGGGGC TTTCTGACCG GCAAGTACCG GCGCGACGAT CGGAACGACG AGTCGTCGAC CGCCGCGCAG GACGGGCGCT TCGCGGATCG GTATCTGACC GAGGAGAACT TCGACGCGCT CGAGGTCCTC CTCGAGATCG CGGACGACAT CGACGCGACG CCGGCGCAGG TCGCCCTCGC GTGGCAGCTT CACCACCCCG ACATCACGGC GCCCATCGTC GGCGCCAGCA CGGTCGAGCA GTTGACGGAG AATCTCGGCG CCGCGGACGT GTCGCTCTCC GCCGAGCAGT TCGAGCGACT CTCCGCGACG TTCGCCGGTT CGGAGTGA
|
Protein sequence | MALDHVSLGS TGLSVSELAL GTWRFGRRLV DGEERYDAEG VVETDEDRAY ELLDAYADAG GNFIDTADKY GDGRAERWIG NWLEDRDRLD YVIATKIHRP RREGDPNARG LNRRHLRRQI DTCLERLGTD YIDLLYCHRW DDDTPAEEFM RTLNGLVESG RVNYLGISSG RPDAWKIVKA NEIARREGYE PFTVTQPRYN LVDREIDANY LPMCRDYGLG AVTWSPLAWG FLTGKYRRDD RNDESSTAAQ DGRFADRYLT EENFDALEVL LEIADDIDAT PAQVALAWQL HHPDITAPIV GASTVEQLTE NLGAADVSLS AEQFERLSAT FAGSE
|
| |