Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1588 |
Symbol | |
ID | 8742180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1645383 |
End bp | 1646714 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646512165 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003403147 |
Protein GI | 284164868 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGACC TCGTCGTCGA CCTCGCGCGG CTGTTGGGAG CGTTCGTACT GGTCGCCCTG AACGGCTTCT TCGTCGCCGC GGAGTTCGCC TACGTCCGCG TTCGCTCGTC GGCCGTCGAA CGGGCGGTCG CGGAGGGGAA GACCGGGGCG ACGCTCCTGC AGGAGGCCGT CGACAACTTA GACGACTACC TCGCGACCAC CCAGCTGGGA ATTACCCTCG CCTCGCTGGG GCTGGGGTGG CTGGGCGAAC CGGCCGTGGC GGCCCTCCTC GAGCCGGTGC TCGCGCCGGT TCTCCCGGCG AGCCTGCTCC ACCTCGTCGC CTTCATCATC GGATTCAGCT TCATCACGTT CTTGCATGTC GTCTTCGGCG AACTCGCGCC GAAGACGATC TCGATCGCCC AGGCCGAACG CGTCGCACTG CTGGTCTCGC CGCTGATGAA GTTCTTCTAC TTCATCTTCA TCCCCGGGAT CATCGTCTTC AACGGGACGG CCAACGCCTT CACGCGACTG ATCGGGATTC CGCCGGCCTC GGAGACCGAC GAGACGCTCT CCGAGGAGGA GATCCTGACC GTCCTGAGCA GGTCCGGGAA CGAGGGCCAG ATCGACGCGG AGGAGGTCGA GATGATCGAA CAGGTCTTCG AACTCGACGA CACGACCGTC CAGGAGGTGA TGGTGCCCCG GCCCGACGCC GTGACGATCA CCGACGACCT TCCGCTCTCT GACCTTCGCA CGCTGATCCT CGAGGAGGGA CACACTCGCT ATCCGGTGCT CGACCCCGAC GGGGACGATC AGGTGATCGG CTTCGTCGAC GCCAAGGACG TGCTGCGAGC GGGCGAGTCC GCGGGCGACC TCGCGGACGT GACCGTCGGC GAGGTCACCC GCGAGATGCC CGTCGTGCCG GAGACGACGC CCGTTACCGA TCTCCTGGAG CAGTTTCAGG GAGACCGGGC ACAGATGGCC GCGGTGATCG ACGAGTGGGG CGTTTTCGAG GGAATCGTCA CGGTCGAGGA CCTCGTCGAG CAGATCGTCG GCGACCTCCG CGACGGGTTC GACGCCGACG AGCCCTCGAT CGACCGGCGG GGCGACGGTT CCTACGTCGT CGACGGTGCC GTCACCGTCT CGGACGTCAA CGAGCGACTC GACGCCGACT TCGAGTCCGA CGAGTTCGGC ACGATCGGCG GGCTCGTGCT GGATCGGCTG GGTCGGGCCC CCGACGTCGG CGACCACGTC GAGGTCGACG GCTACGCGCT CGAGGTCGCC GCCGTCGAGG GCGCACGAAT CTCCTCGCTG GTCGTTCGCG AGAACCCGAA GGCGGAGGAG ACGGCCGAAT GA
|
Protein sequence | MVDLVVDLAR LLGAFVLVAL NGFFVAAEFA YVRVRSSAVE RAVAEGKTGA TLLQEAVDNL DDYLATTQLG ITLASLGLGW LGEPAVAALL EPVLAPVLPA SLLHLVAFII GFSFITFLHV VFGELAPKTI SIAQAERVAL LVSPLMKFFY FIFIPGIIVF NGTANAFTRL IGIPPASETD ETLSEEEILT VLSRSGNEGQ IDAEEVEMIE QVFELDDTTV QEVMVPRPDA VTITDDLPLS DLRTLILEEG HTRYPVLDPD GDDQVIGFVD AKDVLRAGES AGDLADVTVG EVTREMPVVP ETTPVTDLLE QFQGDRAQMA AVIDEWGVFE GIVTVEDLVE QIVGDLRDGF DADEPSIDRR GDGSYVVDGA VTVSDVNERL DADFESDEFG TIGGLVLDRL GRAPDVGDHV EVDGYALEVA AVEGARISSL VVRENPKAEE TAE
|
| |