Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2755 |
Symbol | |
ID | 8412306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2645012 |
End bp | 2646433 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645021100 |
Product | sulfatase |
Protein accession | YP_003178567 |
Protein GI | 257388794 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.8787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.694021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCCC CTAACGTCCT CCTGGTGATA CTGGACAGCG TCAGAGCCAG AAACACCGGT CTGCACGGGT ACGGGGACCG GACGACGCCG TTTCTCGACT CGTTCGCCGG CGAGGCCGAG TGGTACCGGC AGGCACGGGC CCCGAGCATC CACAGCGTCG CCAGCCACGC AAGCCTCTTT AGCGGGCTCC ACGTGGACCA GCACGGCGTC ACGAAACACG AGTCGCAGCT GGATCCCGAC GCCACTGTCT GGTCAGACCT GTCCGAACGG GGCTACGAGA CCGGTCTGTT CACGCCCAAC GTCGTCGTCA CGGAGTCGTC GAACCTCGCG GAGCCGTTCG ACACTGTCGA CGGCCCGCGG CGGGATCCGA AACACCGTTA CTTCGAGGAC GCCCTCTCGC CGACGGACGT GGAGGGCCAC CAGACGAACG TCGAGTACCT GCGGCGGTGT GTCGCCAGCG GCAAACCGCT CCGATCGGCG TTCAACGGCC TGTACTTCAT CTCCAGCAAC AAGGACGCGT ACGATCCAGA GCAGGAGGCG GGCACCGAAT ACGTCGAGAG CTTCCTGCAA TGGTCGGACG AGCGAGACGG CCCGTGGGCT GCCTGTCTGA ACTTCATGGA CGCCCACTTC CCGTACGAAC CCGTTCGGGA ATTCCGGTCG GCGGGGACGG AGGAACTGCT GGACGTACAC GATTCGCTGT CGTCGCCGAT CTCGAAAGAC GTCGCCGCGT CCGGGGAGTG GTGGAAGCTC CGAGCGATCG AAGCCCTCTA CGACGACTGT ATCCGGCAGG CCGACGACGC CGTGGCGACG CTGGTCGACG CGCTGCGAGA GCGCGGTGTT CTCGACGACA CGCTCCTGGT CGTCACCAGC GACCACGGCG ACGGCTTCGG CGAGTGGAGC CGCGTCGACC CGCGGGTCCG TGCCGCCTAC CACAGCTGGA GCGTCCACGA GGTGCTGACC CACGTGCCGC TCCTCGTGCG CCGTCCGGGC GGAGAGGACG GCGGAGCGAA CGACTCGCTC GCCAGTCTCA CCCGCTTCCC GGCCGTCGTC GAAGCGACGC TCGACGGCGA GACGGAGAGT TTCGCCGTCG ACGACCGCGC GTTCGCCTCG ACCCACCGGC TCGAACGGCC GACGGTCATG CTACCGGACG CCTGCGAGGA TCCCGAGCGG TACGCCGGAC CGTGGCGGGC GGTCTACGAG CAGGACGACG ACGGCGTGTA CAAGTACGGG ACCCACGATT CGGCGTCGGC GACGATACGC GTCAGAGACG CACAGGTGTC CTACCGCGTC GCCGACGACG ACGGCGGTGT CGTCGCGCAG TCGTACGGCG AGCTGGAAGG GGCCGACGTG AGCGACGGCG AGTCGGACTC GCTGGAGGAC TCCGTCGAAG ACCAGCTAGA GAGTCTCGGC TACATCCGGT AG
|
Protein sequence | MTAPNVLLVI LDSVRARNTG LHGYGDRTTP FLDSFAGEAE WYRQARAPSI HSVASHASLF SGLHVDQHGV TKHESQLDPD ATVWSDLSER GYETGLFTPN VVVTESSNLA EPFDTVDGPR RDPKHRYFED ALSPTDVEGH QTNVEYLRRC VASGKPLRSA FNGLYFISSN KDAYDPEQEA GTEYVESFLQ WSDERDGPWA ACLNFMDAHF PYEPVREFRS AGTEELLDVH DSLSSPISKD VAASGEWWKL RAIEALYDDC IRQADDAVAT LVDALRERGV LDDTLLVVTS DHGDGFGEWS RVDPRVRAAY HSWSVHEVLT HVPLLVRRPG GEDGGANDSL ASLTRFPAVV EATLDGETES FAVDDRAFAS THRLERPTVM LPDACEDPER YAGPWRAVYE QDDDGVYKYG THDSASATIR VRDAQVSYRV ADDDGGVVAQ SYGELEGADV SDGESDSLED SVEDQLESLG YIR
|
| |