Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0475 |
Symbol | |
ID | 8382742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 472034 |
End bp | 473338 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644971537 |
Product | sulfatase |
Protein accession | YP_003129395 |
Protein GI | 257051562 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.501128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA CCCGCCCAAA CCTCGTCCTC GTCTCGATCG ACAGTCTCAG GGCCGATCAT TGCGGATTCC TGGGTGACGA CCGCGGACTC ACGCCGACGA TGGACGAACT CGCCGACGAT GGCGTCACGT TCGAGACAGC CATCGCACCC GGACCCCAAA CCTTCTCGTC GATGCCGGCC GTGTTTACCA GCCATCATCG GCCGACAGGC GATCTGGAAA CGTATCCTGG CGAGACAAAC TGGAAGCGTC GCCTTGCCGC GATCGACGGG CACCTCCGTC GGCATCCATC GCTTCCCGAG CGGTTGACGG AACTGGGCTA CTCGACGGCC GGTTTCAGCC CCAACCTCTG GGCGTCGGCA GCCTCGGGGT TCGATCGGGG GTTCGATTAC TTTGCCGATC TCGCCGGTGA GCCAGCCGAC AGTAGACTCC ATACGTTGCT GAGCCGGACG CCAGGAGTCG ACGAGACGAG CAAGCCCGTC GAGCTAACGC TTGACATGCT TTCCGGCAAC TCGTTTTTCA CCCGGTGGCA GCAGTTCTAC GACGAACTCG ACGCGGTTCG CCGTCGCCTC TCGGAGCCGT ACTTTCTGTG GGTCTTCCTG CTGGACACCC ACTTTCCGTT CCTGCCCACG CGCACATACC GTCAGGAACA GTCACTGCTT CGAATGTATC TCAACACGGC CCGGACCGAG AAAGTCATGC GGGGACACGC CGAAACAGTG TCGACTCGCG CCCGTGAGTC GATGCAGCGA AGCTATCGGG ACACAGTTCG ATCGGTCGAC GGCTTCCTCC AACGAATCCG CTCGGATCTA GCGTCGGACG ATCCGGTGAT GATAGTCCAC TCGGATCACG GCGAATCGTT CAACGAGCAC GACAACTACG GTCACCATCA TCGGGAACTG TACGAGGAGA ACATACACGT TCCCTACGTC GTCTCGAACG CCGGCACGAC GGGGACGATC ACCGAACCGA CCTCGCTCGC GACCATTCCG GAGGTCGCAC TCACCATCGC CCGCGAAGGG GCTTTCACTC CCGAAACCGT CGCCGATACC GGCGTGGTTT CCCGGTGTGA GTACGGCACA CACCAGGCAG TTCGATATCC ACGGTTCAAG TATGTCGAAC ACGAGGACAA GCAGTCGTTG TTCGACCTCG AGAACGATCA CCGAGAGACA GTCGACGTCT CCGATGAGTA TCCGGGTCGG ATCGCCGATA GCAGGGCGCG GCTCGCCCGG TTGGAGCGTC ACGATCGAGA GACCCACGAG CTCTCGCGTG CCGCCCGGAA TCTCGCCGTG GAGTGCAACC TGTAG
|
Protein sequence | MSETRPNLVL VSIDSLRADH CGFLGDDRGL TPTMDELADD GVTFETAIAP GPQTFSSMPA VFTSHHRPTG DLETYPGETN WKRRLAAIDG HLRRHPSLPE RLTELGYSTA GFSPNLWASA ASGFDRGFDY FADLAGEPAD SRLHTLLSRT PGVDETSKPV ELTLDMLSGN SFFTRWQQFY DELDAVRRRL SEPYFLWVFL LDTHFPFLPT RTYRQEQSLL RMYLNTARTE KVMRGHAETV STRARESMQR SYRDTVRSVD GFLQRIRSDL ASDDPVMIVH SDHGESFNEH DNYGHHHREL YEENIHVPYV VSNAGTTGTI TEPTSLATIP EVALTIAREG AFTPETVADT GVVSRCEYGT HQAVRYPRFK YVEHEDKQSL FDLENDHRET VDVSDEYPGR IADSRARLAR LERHDRETHE LSRAARNLAV ECNL
|
| |