Gene Hmuk_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1436 
Symbol 
ID8410956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1360909 
End bp1362354 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content60% 
IMG OID645019767 
Productsulfatase 
Protein accessionYP_003177264 
Protein GI257387491 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.314748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATG TATTCGTAAT TTCTTTTGAT GCGTTGCGAT ACGATCACGT TTCGGCCACG 
AGAAACGGGA CACAGACGAC CCCGTTCCTC GACTCTATCA AGGAAGATGC GATCGAGTTC
ACACAGCACA TCTCGACGGG TTCCGGGACA TCAACGTCGT TTCCCGGTAT TCACGCGAGC
AGTCTCCCGT TAGATCACGG ATACGCCGGA CTGAACGAGA ACCACGTCAC CCTCGCCGAG
GTTCTCTCCG ATGCATCGAT CCAGACGGTT GGTGTCACTG CCCAGACTTC CTGTTCCAGT
ATTTATGACT ATGACCGTGG GTTCGAGGTG TTCGAGGACT GGGTCGACGA CGACCAACAG
ACCGGCGAGA GTTTTCCGCG CCGCCTGAAA TCGACGCTGG TCGATGGGAT AGAGAACACA
CCGGTTCTCT CCCCGATCGC CTCGGAACTC AAATTGCAGT ACGACGGGCT CAAAGACGTG
TACGACACGC CAGCGTGTCC GTACCCCCGT GCCGAAGACG TGACCGATAC GACCATCTCG
TTAGTTGATC AGCACGTCGA TACGACGGCG GACGCCTTCG TCTGGACCCA CTACATGGAG
CCCCACGCGC CGTACTATCC GCCCAGAGAA GATATCGAGC GGTTCCACGA CGGCCGCTAC
GATGTCGGAC GCATTCGACG GGTCGTCCGC AAGGCCAGGC GCGCGCGGCC CGACATTATC
GACGGTTCGA TGATTGAGGC TGTCTCCGAG ACCGAAATCG AGGCCCTCCG GGATTTCTAC
GCGGCGGCAA CACGGTACGT CGACCGCGAG GCCAAACGGT TGGTCGACGA GCTCGACGCC
AGAGGACTGC TCGAAGACAG CGTTGTACTG TTCACTGCGG ACCACGGCGA AGAGCTGTTC
GACCGGGGGA CGCTCGGCCA CCGGACGAAA ATGTACGACG AGCTGATACG AGTGCCGCTT
TTGCTCTACG ACAACTCCGG CCGGTACGCC GGTGAGACTT CCATCGACGC AGTGAGGAGT
CACGTCGACA TCGCGCCAAC GATCGCCGAC TGGTACGGCG TCGACCCCCC GGCGGAATGG
CGGGGCGTTT CGCTGCTCGA ACCCCTTCGT GATGAGACGG GACAGATAGA CCGCGACTAC
GCAATTGCCG AACTGTGTCA CACGCAGGGG CTCGGAGGAG ACGTGACGCT AGAGACGCTG
GTCGCCGCTG TTCGGTCGAA GCGCTGGAAA TACATCCGGA ATCGCCAGCT CGACACAGAG
CACCTGTACG ATCTCCGGAC GGATCCCGAC GAGCAGCACA ACATCGCCGC CGACCACGGC
GATATCGTCG CAGAACTGAG CACGGTTCTG AACGATCGAC TGGAGGGCGT CTCCGACACC
GCACGAGATG TCGACCTGTC CAGCGATGTC GAGAAACAGC TCCGAGAGCT CGGCTACGTA
GAGTAA
 
Protein sequence
MSNVFVISFD ALRYDHVSAT RNGTQTTPFL DSIKEDAIEF TQHISTGSGT STSFPGIHAS 
SLPLDHGYAG LNENHVTLAE VLSDASIQTV GVTAQTSCSS IYDYDRGFEV FEDWVDDDQQ
TGESFPRRLK STLVDGIENT PVLSPIASEL KLQYDGLKDV YDTPACPYPR AEDVTDTTIS
LVDQHVDTTA DAFVWTHYME PHAPYYPPRE DIERFHDGRY DVGRIRRVVR KARRARPDII
DGSMIEAVSE TEIEALRDFY AAATRYVDRE AKRLVDELDA RGLLEDSVVL FTADHGEELF
DRGTLGHRTK MYDELIRVPL LLYDNSGRYA GETSIDAVRS HVDIAPTIAD WYGVDPPAEW
RGVSLLEPLR DETGQIDRDY AIAELCHTQG LGGDVTLETL VAAVRSKRWK YIRNRQLDTE
HLYDLRTDPD EQHNIAADHG DIVAELSTVL NDRLEGVSDT ARDVDLSSDV EKQLRELGYV
E