Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2039 |
Symbol | |
ID | 8742638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2109652 |
End bp | 2111238 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646512621 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003403596 |
Protein GI | 284165317 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACA GAGACGACGA CGGACGATCG GCGGACAGCC GGAGCGAGCG AGCCGACGGC GGCGAGGTCG CCGGAACCGA GACGGAGACT CGCTCCGAGT CCGGCCCCGA CGTCGGTACC GGTAGTCGAA CGAACGCCGA GACTGAGCCC GACGCTGCCG AGGCTGAAGC CGGCGCCGCT GCCGAGGTTA CGTCTAACAC AGACACCGAG ACGGACGACA GTACTGACGA CGATACCGAC CCCGATCCCG ACGGTGCCGG CGAGCCCGCA GACGGCGATT CCGGGACCGA AGACGCCCCG GAATCGGTCG TCGTCGACTC GAGCGTCCGG GAGACGAACC GCTGGGCAGG CGTCGCCGCC GTCGCCTCGG TGTTCGGCGG CGCCGGCGTT ATCGTCACGT CGCCGGCGCT GCTGCTGGCG GCGGTCGTCG GCATCGCGTA CGCGGCCTAC GCCCGGGCCG GTCGGCCGCC GACCCCGACG CTCTCGATCA GCCGCGAACT CGAGGACGAC GACCTCGAGC CGGGTGAGCC CGTTCGCGTG ACGGTACGCG TTCGCAACGA CGGGGACGAA CTCCTGCCGG ATCTCCGGCT CGTCGACGGC GTGCCGTCGG AACTCGTCGT CACCGACGGC TCCCCGAGAC GCGGCACCGC GCTCCGGCCG GGCGAGACGG AGACGTTCTC CTACGCGGTG ACCGCTCGCC AGGGCACCCA CGCGTTCGAA CCGATAGCCG TCGTCGCGCG AGGGTTCACC GGCGACGCCG AGCGCGTCCA GCGGATCCGC GTCGACACCG AACTCCGCTG TCCGCCCGCC GAGGCGGAGA CCGACCTCCC GCTGCGGTCG CTGACGCTGC CTTTGACCGG CCGCGTCGAG ACCGACGTCG GCGGCGAGGG CCTCGAGTTC CACTCGACGA GGGAGTACCG ACGCGGCGAT CCGCTCTCGC GGATCGACTG GAACCGCCGG GCTCGCGGTC AGGAGCTGAC GACCGTCACG TTCCGCGAGG AGCGGTCGGC GACGGTGATG CTCGCCGTCG ACACGCGCAC GGACGCGTAC CGTCGACCCG ACGAGACGGG TCGTCATGCG GTCGATCGCA GCATCGAGGC GGCCGTCGCC GTGCTCGACG CGCTGGATGC CGGCGGCAAC AACGTCGGCC TCGCCAGCTT CGGACCGCAC ACAGAGTGGC TGGCACCGGG CTCGGGGCCC GACCATCGAG CGTCGGCCCG CCGACTGCTC GCGACCGATT CCGCGTTCGA ACTGTCCCCG CCGGAGTCGT CGGCGTCGAT GCTGTACGTC CAGCGCAAGC GATTCCGCTC GCGGATCCCC GCTGATTCGC AGGTGATCCT GTTCACGCCG CTGTGCGACG ACGACATCGT GCGTACCGCG CAGTTGCTCG AGGCCGACGG CCATCTCGTC ACCGTTATTA GTCCCGATCC GACTGGACGC GACGCGCCCG GCGAACGGCT CGGGGTGGTC GAACGAGCGG CTCGCATCTC GACGCTCAGG GGGGCGGGCA TCAGGGTCGT CGATTGGCCG GCGGACGACT CGCTGGCGGC GACGCTCGAG CGCAGTCGAC GGCGGTGGTC GGCGTGA
|
Protein sequence | MTDRDDDGRS ADSRSERADG GEVAGTETET RSESGPDVGT GSRTNAETEP DAAEAEAGAA AEVTSNTDTE TDDSTDDDTD PDPDGAGEPA DGDSGTEDAP ESVVVDSSVR ETNRWAGVAA VASVFGGAGV IVTSPALLLA AVVGIAYAAY ARAGRPPTPT LSISRELEDD DLEPGEPVRV TVRVRNDGDE LLPDLRLVDG VPSELVVTDG SPRRGTALRP GETETFSYAV TARQGTHAFE PIAVVARGFT GDAERVQRIR VDTELRCPPA EAETDLPLRS LTLPLTGRVE TDVGGEGLEF HSTREYRRGD PLSRIDWNRR ARGQELTTVT FREERSATVM LAVDTRTDAY RRPDETGRHA VDRSIEAAVA VLDALDAGGN NVGLASFGPH TEWLAPGSGP DHRASARRLL ATDSAFELSP PESSASMLYV QRKRFRSRIP ADSQVILFTP LCDDDIVRTA QLLEADGHLV TVISPDPTGR DAPGERLGVV ERAARISTLR GAGIRVVDWP ADDSLAATLE RSRRRWSA
|
| |