Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3376 |
Symbol | |
ID | 8743996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3485194 |
End bp | 3486723 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513958 |
Product | sulfatase |
Protein accession | YP_003404912 |
Protein GI | 284166633 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGT CAGTCGATTC ACCCGATTCA TCAGACGTCG ACGCAGCGTC GGCTAACGGA CGTGATCCGG AGTCACATTC CACTGTGCGG AACGTCGTGC TCGTCGTCCT CGATACCGCA CGGGCGACGA GTACGGGGCC GAAGACGACG CCGAATCTGA ACCGCCTCGC GGCCGACGGA ACCCACTTCG ACAACGCCTT CGCGACCGCA CCCTGGACGC TCCCGTCTCA CGCCTCGATG TTCACCGGGA CCTATCCCTC CGAGCACGGC ACCCACGGCG GACACACCTA TCTCGACGAC GAGTTGCGGA CCCTCCCAGA GTCCTTCGCC GATTCCGGAT ACGAGACGAT CGGCGTCTCG AACAACACCT GGATCACCGA GGAGTTCGGC TTCGACCGCG GCTTCGACGA CCTCCGGAAG GGCTGGCAGT ACATCCAGTC CGACGCGGAC ATGGGCGCCG TCGTCCGCGG CGAGGACCTC CGGGAAAAGC TCCAGGCGAC CCGGAACCGG CTCTTCGACG GCAACCCGCT GGTCAACGCC GCGAACATCC TCTACAGCGA GGCCCTCCAG CCCGCGGGCG ACGACGGCGC CGACCGATCG ACGACCTGGA TCACCAACTG GCTCGACGAT CGCGACGACA GCCGTCCGTT CTTCCTGTTC TGTAACTTCA TCGAACCCCA CGTCGAGTAC GATCCGCCCC GCGAGTACGC CGAGCGGTTC CTCCCAGACG GCGCGAGCGT CGACGAGGCA CTCGCCATCC GGCAGGACCC CCGCGCCTAC GACTGCGAGG ACTACGGCCT CTCCGAACGG GACTTCGCCC TGCTCCGCGG GCTCTACCGG GCCGAACTCG CCTACGTCGA CGAGCAGCTC GGACGGCTCC GGGCGGCCCT CGAGGACGCC GGCGAGTGGG AGGACACCCT CTTCGTCGTC TGCGGCGACC ACGGCGAGCA CATCGGCGAC CACGGCTTCT TCGGCCACCA GTACAACCTC TACGACACCC TGATCAACGT CCCGCTGGTC TGCCACGGCG GCCCCTTCAC CGACGGCGGC CAGCGCGAGG ACCTCGTCCA GTTGCTCGAC CTCCCCGCCA CGCTGCTCGA GACCGCGGGG ATCGACGATC CCGAACTGCG CGCGCAGTGG TCCAGCCGCT CGTTCCACCC CGCGTCGGAC GACGACCCCC GAGACGCCGT CTTCGCGGAG TACGTCGCCC CCCAGCCCTC GATCGACCGC CTCGAGGCCC GCTTCGACGA ACTTCCCGAC CGCGTCTACG AGTACGACCG TCGCCTCCGG GCCGTCCGGA CGCGCGAGTA CAAGTACGTC CGCGGCGACG ACGGGTACGA CCGGCTCCAC GACGTCGAGA CCGACCCGCT CGAGCGCGAC GACATCGCCG CACGGGAGCC CGAGCAGGTG CGAGCGATGC AGCGGCGCCT CGAGGAGCGG TTCGACCCGC TCGCCGAGGC CGGCGAGAGC GGCGAGGTCG AGATGCGCGA GGGGACCAAG GAGCGACTCG CGGATCTGGG GTATCTCTAA
|
Protein sequence | MAESVDSPDS SDVDAASANG RDPESHSTVR NVVLVVLDTA RATSTGPKTT PNLNRLAADG THFDNAFATA PWTLPSHASM FTGTYPSEHG THGGHTYLDD ELRTLPESFA DSGYETIGVS NNTWITEEFG FDRGFDDLRK GWQYIQSDAD MGAVVRGEDL REKLQATRNR LFDGNPLVNA ANILYSEALQ PAGDDGADRS TTWITNWLDD RDDSRPFFLF CNFIEPHVEY DPPREYAERF LPDGASVDEA LAIRQDPRAY DCEDYGLSER DFALLRGLYR AELAYVDEQL GRLRAALEDA GEWEDTLFVV CGDHGEHIGD HGFFGHQYNL YDTLINVPLV CHGGPFTDGG QREDLVQLLD LPATLLETAG IDDPELRAQW SSRSFHPASD DDPRDAVFAE YVAPQPSIDR LEARFDELPD RVYEYDRRLR AVRTREYKYV RGDDGYDRLH DVETDPLERD DIAAREPEQV RAMQRRLEER FDPLAEAGES GEVEMREGTK ERLADLGYL
|
| |