Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4646 |
Symbol | |
ID | 8745406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 225680 |
End bp | 227065 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646515157 |
Product | sulfatase |
Protein accession | YP_003406104 |
Protein GI | 284172722 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATT ATAAGGTGCC GGGCGGGCGA TGGGAGATAA TGACCGACGA CGACCATGAC CGGCCGAACG TCATCGCGGT GGTGACCGAT CAGCAGCGCT GGGACACGGT TGGCGTCTAC GGGTGTCCGC TGGACCTCAC TCCGACGCTC GATACGCTCG CAGCGCAGGG AAGCGTCCTC ACACAGGCAA TTACACCGCA GCCGCTCTGT GGCCCCTTCC GGGCGGCGTT TCAAAGCGGA AAGTACGCTA GCGAGGTCGA CGTATGGCGG GACGCGGTGA GAATGCCAAG CGATGAGCTG CATCTCTCCA GACAGTTCAA AGATGCCGGG TACGACGTCG GATACGTCGG GAACTGGCAT ATTGCCGGAA CCTTCGATAA TCCCGTTCCT GAACAGTCCC GCGGCGGATA CGAGGACTTC TGGATCGCTG CGGACGTTCC GGAATTCACT ACACAACCGA CGGAGGGTCA CTTGTTCGAT GCCGACGGAA ATCCCGTCAA GTTCGAACGG TATCGTGTGG ATGCGTTTAC TGCGTTCGCC TGCGAAGCTA TCGAGTCGCT GTCTGAGCCG TTTTTCCTTG TCGTCGCGTA CGTCGAACCG CATAACCAGA ACGATATGTG GTCGTACGTC GCGCCGGACG GGTACGCAGA GCCGTACCAG AAACGCCCGT ACGTACCAGA GGATTTGCAG GACCGGCCAG GCGACTGGTA CGAAGCGTTA CCAGACTACT ATGGAATGGT CGAGCGAATC GACGAATGCG TCGATAATCT TCTCGAAGTG TTGTCTGATC GGGGTATCCG AGACCGGACA ATTATCGCTT ACACGTCCGA TCACGGGTGC CACTTCCGGA CGCGGCCGGG CGAGTACAAG CGAGACCCCC ATGAGTCCGC CATTCGGGTG CCCGCAATAC TCGTCGGGCC GGGATTCGAC AAGGGAGTCG ACGTCACTCA GCCAACGAGC ATGATCAATC TCCCACCGAC CTTGCTTGAT GCCGCCGGCA TCGATGTCCC TAACGAAATG CACGGTGAGA GCCTCCTCCC GATCATCCGC AGAGATGTAC CTGATGTCAA CGGTGAGGCA TTCATCCAGA TTAGCGAATC ACAGGTTGGC CGGGCGCTCC GAACCGACCG CTGGAAGTAC GCCGTCGCCG CTTCGTCGCT AACCGGATGG CGCGGCGGCA GCGCCGAAAA ATCGAGTGAC GTGTACGTCG AACGTTATCT CTACGATCTT GAACGCGATC CACACGAGCA GGTTAACCTA GTTGGTCATC CAGACTTTCG ATCTATCGCT GATGATCTTC GCGATCGGAT CCTTGCATAC ATTCAGGAGA TTGAAGACGA ATCACCCCGG ATCAAGCCCT ACGAGGGCGG TTACACCGGG TTTTGA
|
Protein sequence | MMDYKVPGGR WEIMTDDDHD RPNVIAVVTD QQRWDTVGVY GCPLDLTPTL DTLAAQGSVL TQAITPQPLC GPFRAAFQSG KYASEVDVWR DAVRMPSDEL HLSRQFKDAG YDVGYVGNWH IAGTFDNPVP EQSRGGYEDF WIAADVPEFT TQPTEGHLFD ADGNPVKFER YRVDAFTAFA CEAIESLSEP FFLVVAYVEP HNQNDMWSYV APDGYAEPYQ KRPYVPEDLQ DRPGDWYEAL PDYYGMVERI DECVDNLLEV LSDRGIRDRT IIAYTSDHGC HFRTRPGEYK RDPHESAIRV PAILVGPGFD KGVDVTQPTS MINLPPTLLD AAGIDVPNEM HGESLLPIIR RDVPDVNGEA FIQISESQVG RALRTDRWKY AVAASSLTGW RGGSAEKSSD VYVERYLYDL ERDPHEQVNL VGHPDFRSIA DDLRDRILAY IQEIEDESPR IKPYEGGYTG F
|
| |