Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1209 |
Symbol | |
ID | 8741798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1264033 |
End bp | 1265502 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646511788 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003402773 |
Protein GI | 284164494 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCA CGCGCCGACT GTGGGCCGTC GCGAGCCTCG CGGCCTTTCT CGCGGGCGTC GCAGTCGTTA CTGCCCGCCC GCTCCTCCTC GGCGGTGCCG GGCTGGTCGG CTCGTGGATC GTCGCGCGCC AGTATCGGTT CTACCGCGCG CTCGAGGAGA CGGTCGACGC GCTGGCCGTC GAGCAGTCGG CCGTCCGCGC CGGCGTTCGA ACGGGCGATA CCGTTCCGGT GACTCTCTCG GCGAGGCTGG CCTCACCGTC GCCGCTCGCC GTCGCGATCG AGGCCGGGCT CCCGACGACC GCCGTGGCCG ACGAATCCTT CTCGCTGTCC CTCGATCCGT CGACGTCCGC GACCACCCGA ACGGTCGACG TCTCGTGGCC GGTCGCGGGC CGCCACCGGT TTGACGAACC GACTGTGACC GCGACGGACG GATTCCTCCG CGAGACGGTG TCGCTCGGAA CGACCTCGAC GGTCACCGTC GAGCCCCGCG GTCCGCGAAC CATCCACGTC GGCGAGGGCG GCGATCGGAT CACGATGGCC TACGGCGAAC ACGAGGCCGG TCGTCTCGGG TCGGGGATCG AACCCGCGGA ACTCCGCGAG TACATGCCCG GCGACACGGC CGACCGGATC GACTGGAAGG CCACGGCCAG GCTGGCGACG CCTCACGTCC GCGAGTACGA GGCCGAGACC GACCGACGGA CGCTACTGGT CGTCGACCAC CGCGGCTCGC TGGCGACGGG GCGGCCGGAC GAAACCGAAC TCGACTACCT CCGCGACGTC GCGCTCGCGA CGGCCGCGAG CGCGCGCCGA CTCGGCGACC CCGTTGGACT GCGTACCGTC GGCGACGAGG GGATCACGTT TCGCCTCGAC CCGACGGCGA CGCCGGTGGC GTACGATCGG ATCCGGCGTC GATTGCTCGA CCTGGAGCCG ACGGTCGATC CGACGACGCT CGACGGAAGC GGCCGAGAGG GGCGGCGGAG ACGGACCCCA ACACCCCGGG GCGGCGGCTT CACCGCGGCC GACGCCCGGG CGAAACGCAT CGGCCTCGGG GACGACGACG ACCCGTTCGC TTCGACCCTT CGCCCGTTCT ACGCCGCGCG GGAGGGCTAC CGCGAACGCA TCGAATCGGA TCCGCTCTAC GGCGCCGTCA AGCGTGCCCA CAGCGGCAAC ACCGAGGGGC TCTGGACGAT CCTCTTCACC GACGACTCGC GGCCGGCGGA GCTCCGCGAG ACGGTCAAAC TCGCCCGCGG CAACGGCAAC TCGGTGCTGG TGCTGCTCGC GCCGACGGTG CTCTACGAAT CCGACGGTCT CGCGGACGTC GAGGACGCCT ACGATCGCTA CGTCGAGTTC GAGAACCTGC GTCGCGACCT CGCCCGGATG CCCCGCGTGA CCGCCCTCGA GGTCGGCCCG CGGGATCGCC TCTCGACGAT CCTCTCGGAC GGCCGCGCCG CTCGAGGTGA GCGCGCGTGA
|
Protein sequence | MKPTRRLWAV ASLAAFLAGV AVVTARPLLL GGAGLVGSWI VARQYRFYRA LEETVDALAV EQSAVRAGVR TGDTVPVTLS ARLASPSPLA VAIEAGLPTT AVADESFSLS LDPSTSATTR TVDVSWPVAG RHRFDEPTVT ATDGFLRETV SLGTTSTVTV EPRGPRTIHV GEGGDRITMA YGEHEAGRLG SGIEPAELRE YMPGDTADRI DWKATARLAT PHVREYEAET DRRTLLVVDH RGSLATGRPD ETELDYLRDV ALATAASARR LGDPVGLRTV GDEGITFRLD PTATPVAYDR IRRRLLDLEP TVDPTTLDGS GREGRRRRTP TPRGGGFTAA DARAKRIGLG DDDDPFASTL RPFYAAREGY RERIESDPLY GAVKRAHSGN TEGLWTILFT DDSRPAELRE TVKLARGNGN SVLVLLAPTV LYESDGLADV EDAYDRYVEF ENLRRDLARM PRVTALEVGP RDRLSTILSD GRAARGERA
|
| |