Gene Hmuk_1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1210 
Symbol 
ID8410730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1153328 
End bp1154764 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content41% 
IMG OID645019545 
Productsulfatase 
Protein accessionYP_003177042 
Protein GI257387269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.200453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACA TATTGTTAAT TGTGATGGAC AGTGTACGAG CGAAGAATAC CTCATTACAC 
GGATATAAGA GAGAGACAAC TCCTTATCTT AAACAATTCG CTAATGGAGC AACTACCTAT
CAAGAGGCTA AAGCTCCTGG AGTGGATAGT GCGACATCTC ATACATCCAT ATTTACTGGT
TACGATGTCC CACAACATCA AGTTGTATCG GGTGAAGATA CACTCAAAAC GGGCCATTCC
GCGTGGGAAT ATCTTACGAA GGATGGCTTT GAGACTGCGC TTTTTACGCC AAACCCATTT
TTTTCCGATA GAGATATCGG TCTTGGAGAA GGGTTCGATA AGAAGTTCTT CGGAAAAGAC
TACGACTTGC CGTTTAGGCG GGGACTTAAT CCAAGGAAAT ACACAAATAA AGGTGACGAG
AGATGGATAG ATTTCCTATA TGATTCTATA AGAGAGATTC GGCCACTATC TTCGATAGGT
AACGGATTAG TGATGAAAGC AGATGAAATT TCCCCACTTC TTATTGAACC GCACTATAAT
GATCGGCCAG CGAGTATCTT TGCATCTGAA TTTACTAAAT GGGTTAAGGA CCAATCGGAA
CCGTGGGCCG CTTGCATAAA CTTCATGGAT GCACATACAC CGTTTTTCCC AAAAAGAAAA
TTTGATAATT GGTCTGAAAG ATACCACTGG AAGATTCAAG ACAAGATAAA TCCGCCATGG
GATTTTATTT CTAAGGAAGA ATCCTCCTGG AAATTTGAAG CACTAACTCC CCTCTATGAT
GCCAGCATAA GTCAGATAGA CAGCTCAATT GGAAGAATAA TAAATCATCT TAAAAAGGAG
GGAAAGTATG ATGATACCCT TATTATCGTG ACTTCCGATC ATGGGGAGGC ATTCAACGAT
CAAAGCCGAG TCAAACCCGA GCTTAACCTA GTTCACCATA GTATCGGTAT TCATGAGAGT
TTACTTCACG TTCCACTTGT TATTAAATAC CCACACCAAA ATGAGGAAAG ACACGTACAC
AAGCCAGTAT CATTGACTGA CTTATATAAA TTAATTAAAT ATCCCGGAGA CGATATTGGG
ATGGTTGATA GGCTACTAGA TGAATCACCA ATAATTTCAG CGATGCCTAG CTATCAGACT
CGGTCTGATG GTATGATTGA CAGGCTAGAA ACACACGCAC CAAACGCATC AACTTATAAA
GGTTGGGCAT ACGCAATCTA TGAGCAAAAT GGTGATGGAA TTAGAAAACA TATCCAATGG
GAAAACAGAT CTGCTCTTGT AAAAATAGAT GGCACGTATT CTTTTCGTGA TGGACATCCA
GACAAACAGA TTCTCGAACA TGCTACCAAA CTTCAAGATG CTGATGTGGC GAGAAAACAA
ACTAACGAAG TTACTGCGGA CATCGAATCA CGTTTAACAA ACCTTGGATA CAGATAA
 
Protein sequence
MTNILLIVMD SVRAKNTSLH GYKRETTPYL KQFANGATTY QEAKAPGVDS ATSHTSIFTG 
YDVPQHQVVS GEDTLKTGHS AWEYLTKDGF ETALFTPNPF FSDRDIGLGE GFDKKFFGKD
YDLPFRRGLN PRKYTNKGDE RWIDFLYDSI REIRPLSSIG NGLVMKADEI SPLLIEPHYN
DRPASIFASE FTKWVKDQSE PWAACINFMD AHTPFFPKRK FDNWSERYHW KIQDKINPPW
DFISKEESSW KFEALTPLYD ASISQIDSSI GRIINHLKKE GKYDDTLIIV TSDHGEAFND
QSRVKPELNL VHHSIGIHES LLHVPLVIKY PHQNEERHVH KPVSLTDLYK LIKYPGDDIG
MVDRLLDESP IISAMPSYQT RSDGMIDRLE THAPNASTYK GWAYAIYEQN GDGIRKHIQW
ENRSALVKID GTYSFRDGHP DKQILEHATK LQDADVARKQ TNEVTADIES RLTNLGYR