Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1436 |
Symbol | |
ID | 8410956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1360909 |
End bp | 1362354 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645019767 |
Product | sulfatase |
Protein accession | YP_003177264 |
Protein GI | 257387491 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.314748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAATG TATTCGTAAT TTCTTTTGAT GCGTTGCGAT ACGATCACGT TTCGGCCACG AGAAACGGGA CACAGACGAC CCCGTTCCTC GACTCTATCA AGGAAGATGC GATCGAGTTC ACACAGCACA TCTCGACGGG TTCCGGGACA TCAACGTCGT TTCCCGGTAT TCACGCGAGC AGTCTCCCGT TAGATCACGG ATACGCCGGA CTGAACGAGA ACCACGTCAC CCTCGCCGAG GTTCTCTCCG ATGCATCGAT CCAGACGGTT GGTGTCACTG CCCAGACTTC CTGTTCCAGT ATTTATGACT ATGACCGTGG GTTCGAGGTG TTCGAGGACT GGGTCGACGA CGACCAACAG ACCGGCGAGA GTTTTCCGCG CCGCCTGAAA TCGACGCTGG TCGATGGGAT AGAGAACACA CCGGTTCTCT CCCCGATCGC CTCGGAACTC AAATTGCAGT ACGACGGGCT CAAAGACGTG TACGACACGC CAGCGTGTCC GTACCCCCGT GCCGAAGACG TGACCGATAC GACCATCTCG TTAGTTGATC AGCACGTCGA TACGACGGCG GACGCCTTCG TCTGGACCCA CTACATGGAG CCCCACGCGC CGTACTATCC GCCCAGAGAA GATATCGAGC GGTTCCACGA CGGCCGCTAC GATGTCGGAC GCATTCGACG GGTCGTCCGC AAGGCCAGGC GCGCGCGGCC CGACATTATC GACGGTTCGA TGATTGAGGC TGTCTCCGAG ACCGAAATCG AGGCCCTCCG GGATTTCTAC GCGGCGGCAA CACGGTACGT CGACCGCGAG GCCAAACGGT TGGTCGACGA GCTCGACGCC AGAGGACTGC TCGAAGACAG CGTTGTACTG TTCACTGCGG ACCACGGCGA AGAGCTGTTC GACCGGGGGA CGCTCGGCCA CCGGACGAAA ATGTACGACG AGCTGATACG AGTGCCGCTT TTGCTCTACG ACAACTCCGG CCGGTACGCC GGTGAGACTT CCATCGACGC AGTGAGGAGT CACGTCGACA TCGCGCCAAC GATCGCCGAC TGGTACGGCG TCGACCCCCC GGCGGAATGG CGGGGCGTTT CGCTGCTCGA ACCCCTTCGT GATGAGACGG GACAGATAGA CCGCGACTAC GCAATTGCCG AACTGTGTCA CACGCAGGGG CTCGGAGGAG ACGTGACGCT AGAGACGCTG GTCGCCGCTG TTCGGTCGAA GCGCTGGAAA TACATCCGGA ATCGCCAGCT CGACACAGAG CACCTGTACG ATCTCCGGAC GGATCCCGAC GAGCAGCACA ACATCGCCGC CGACCACGGC GATATCGTCG CAGAACTGAG CACGGTTCTG AACGATCGAC TGGAGGGCGT CTCCGACACC GCACGAGATG TCGACCTGTC CAGCGATGTC GAGAAACAGC TCCGAGAGCT CGGCTACGTA GAGTAA
|
Protein sequence | MSNVFVISFD ALRYDHVSAT RNGTQTTPFL DSIKEDAIEF TQHISTGSGT STSFPGIHAS SLPLDHGYAG LNENHVTLAE VLSDASIQTV GVTAQTSCSS IYDYDRGFEV FEDWVDDDQQ TGESFPRRLK STLVDGIENT PVLSPIASEL KLQYDGLKDV YDTPACPYPR AEDVTDTTIS LVDQHVDTTA DAFVWTHYME PHAPYYPPRE DIERFHDGRY DVGRIRRVVR KARRARPDII DGSMIEAVSE TEIEALRDFY AAATRYVDRE AKRLVDELDA RGLLEDSVVL FTADHGEELF DRGTLGHRTK MYDELIRVPL LLYDNSGRYA GETSIDAVRS HVDIAPTIAD WYGVDPPAEW RGVSLLEPLR DETGQIDRDY AIAELCHTQG LGGDVTLETL VAAVRSKRWK YIRNRQLDTE HLYDLRTDPD EQHNIAADHG DIVAELSTVL NDRLEGVSDT ARDVDLSSDV EKQLRELGYV E
|
| |