Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1102 |
Symbol | |
ID | 8410621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1053580 |
End bp | 1055100 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645019438 |
Product | sulfatase |
Protein accession | YP_003176936 |
Protein GI | 257387163 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0852605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.561012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGGTG CGGACGGGAC GAACGTGCTC TTCGTCGTGC TCGATACGGT CCGCAAGGAC CACCTCTCGG CGTACGGGTA CGACGAGCCG ACGACTCCGA CCCTGGAGGC GTTCGCCGAG GAAGCGGCGG TCTTCGAACA CGCCGTTGCG CCGGCCCCCT GGACCCTGCC GGTCCACGCC TCCCTGTTTA CGGGGCTGTA CCCCTCCGAA CACGAGGCCA CACAGGAGGA CCCCTATCTC GACGGGGCGA CCACGCTGGC CGAGTCGCTG TCGGCGGCCG GCTACGACAC GGCCTGTTAC TCCTCGAACG CCTGGATCAC GCCCTACACG AACCTCACCG CGGGGTTCGA CGACCACGAC AACTTCTTCC AGATCATGCC CAGCGAACTC CTCTCGGGGC CGCTGGCCCG GATCTGGCAG ACGATGAACG ACAGCGACAC CCTGCGGGGC GTCGCCGATC GGATGGTCCA GCTGGGCAAC AAGTTCCACG AGTACTTCGC CTCCGAGGGC GGGGGCGACA CGAAGACCCC CGCCGTGATC GACAAGACGA TGGACTTCAT CGACGACTCG GAGAACTTCT TCACGTTCAT CAACCTGATG GACGCCCACC TCCCCTACCA CCCGCCCGAG GAGTACGTCG AGCAGTTCGC GCCCGGCGTC GACTCGGCCG AGGTGTGCCA GAACTCCAAG GAGTTCAACT GCGGCGCTCG CGACATCGAC GACGCGGAGT GGGACGACAT CGAGGGGCTG TACGACGCCG AGATCCGCCA CATCGACGAC CAGCTCGACC GGCTCTTTAC CCACCTCAAG GAGACCGGCC AGTGGGACGA GACGATGGTC GTCGTAGCGG CCGACCACGG CGAACTGCAC GGCGAACACG GCCTCTACGG CCACGAGTTC TGCATCTACG ATCCGCTGGT GAACGTCCCC TGCATGGTCA AGCACCCCGA GATCGAGCCC GAACGCGACG ACGAGACGGT CGTCGAACTC GTCGACCTGT ATCACTCCGT GCTGGACGCG ACCGGCGTCG CGGGCGACGG CGTCTCGCTC GATCCCGCTC GCTCCCTGCT CTCGACGGAG TATCGCGAGT TCGCCGGGAC GGCCTCGAAC GGTGCGGGCG CACACGGCCC CCGCGGTGAC GTGGGCTTCG TCGAGTACCA CCAGCCGGTC GTCGAGCTCC GCCAGCTGGA GGGGAAAGCC AGCGCGGCCG GCATCGCCCT GGAGACGGAC TCGCGGTTCT ACTCTCGGAT GGGCGCGGCC CGTTCGCCCG AGGGCAAGTA CATCCACTGT ACGCGCATCC CGGACGAGGC GTACCGGATC GACAGCGACC CCGGCGAGAC CGAGGACCGT GCGGGCAGTG ACGACGAACT GCTGGTCGAC CTCGAAGCGG AGCTCTCCGA CTTCGTCGAC CGCGTCGACG CCGACTGGCC CGACGAGGCC GACGGCACCG ACGGCGAGGT GCTGGACTCG ATGGACGACG ACGCGAAGGA TCGCCTGAAA GACCTGGGCT ACATCGACTG A
|
Protein sequence | MTGADGTNVL FVVLDTVRKD HLSAYGYDEP TTPTLEAFAE EAAVFEHAVA PAPWTLPVHA SLFTGLYPSE HEATQEDPYL DGATTLAESL SAAGYDTACY SSNAWITPYT NLTAGFDDHD NFFQIMPSEL LSGPLARIWQ TMNDSDTLRG VADRMVQLGN KFHEYFASEG GGDTKTPAVI DKTMDFIDDS ENFFTFINLM DAHLPYHPPE EYVEQFAPGV DSAEVCQNSK EFNCGARDID DAEWDDIEGL YDAEIRHIDD QLDRLFTHLK ETGQWDETMV VVAADHGELH GEHGLYGHEF CIYDPLVNVP CMVKHPEIEP ERDDETVVEL VDLYHSVLDA TGVAGDGVSL DPARSLLSTE YREFAGTASN GAGAHGPRGD VGFVEYHQPV VELRQLEGKA SAAGIALETD SRFYSRMGAA RSPEGKYIHC TRIPDEAYRI DSDPGETEDR AGSDDELLVD LEAELSDFVD RVDADWPDEA DGTDGEVLDS MDDDAKDRLK DLGYID
|
| |