Gene Hmuk_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0637 
Symbol 
ID8410144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp597064 
End bp598437 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content67% 
IMG OID645018966 
Productsulfatase 
Protein accessionYP_003176476 
Protein GI257386703 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCGC CGAACGTCTT GCTCGTCGTG CTCGATGCGA CCAGGAAGGA CCACCTCTCG 
TGTTACGGTT ACGACCGGCC GACGACGCCG GAGCTGGACG CGGTGGCCCA GGCCGGGACC
CGCTACGAGC AGGCGATCGC GCCGGCACCG TGGACGCCGC CCTCCCACGC GTCGATGTTC
ACGGGTCGAT ATCCGTCCGG GCACGGGAGT TTCGGCACGC AGCCGCTGGG AGAGTACGAC
GGGGCGACCG TCGCCGAGTT GCTCTCGGCG GCGGGGTACG CGACGTTCGG GTTCAGCAAC
TCCCACCACA CCAGCATCGA GCAGGAGTTC GACCGCGGAT TCGACTACTA CCACGACATC
CTCGCGTTGC CGCGGTTCAT GGACACGATG TACGAGCCCA GCCTGGATTT CCTCCGGTTT
CTCCCGCGGT ACTTCCGGGA CGGCTACGAC ATCTCGGACT TCCAGCGGCG CAAGCTCGAA
ACGCAGGTTC GGCGAGCCGA CGGCCCCTTC TTCGGGTTCA TCAACTTCAA CGCGACACAC
GCGCCCTACA GGCCACCCGA GGAGTTCAGA GCGCCGTTCG AGGAGCGCTT CGACGACTGG
GACGCGGTCG ACGAGACGGC GGCCCGGAAG GTCGGCGACG ACGAGGGATA CGAGTACATC
CTGGGCGACG TGACGATGTC GCCGACCGAG TGGGAGCTCG TGGAGTGCTG GTACGACGCC
GAGATCAGGT ACGTGGACGC GCTGCTGGGC GAGCTGTTCG ACTGTCTGCG ACGGCAGGGG
GTCTACGACG ACACGCTCGT CGTCGTCACC GCGGACCACG GCGAGCACTT CGGCGAGCAC
GGGCTGGCCT ACCACCAGTT TTCGCTGTTC GAGGAGCTGC TCAACGTCCC GCTGCTGGTC
AAGTGGCCCG AGGGCGACCG GCCGTCGCCG GCTCCCGGGA CGGTGTCGGA CCGGCTCGTG
TCGCTGGTCG ATCTCGTCCC GACGATCTGT GAGTGGGCCG GCGTCGCGGT GCCAGACGAG
GTCGACGGCC GGGCACTGAC CGGCGACGAC GACCGCGACG CGGTCTTCGC GGAGTACGAC
CGGCCGTATC CGCCGCTGCG CGAGCGGCTC CAGCAGTACG ACAGCTTCGA AGCCTACGAC
AGGGGCTTGC AGGCGGTCCG GACCGAGACG CACAAGCTGA TCCGACCGAC AGTCGGCGAA
GCGACGCTGT ATCGCCTGAC CGGCGACGGC GAGGTCGAGG TCTCCGACGA CGAGCGCGAG
GCGGCACTGG CCCAGCGTCT GGACGAGACG CTCGAACCGC TCCCCGATAC GAGTCGAACC
ACGGAGCTGG ACGATCACGT CAGCGATCAC CTGGAGAAGA TGGGGTACCT GTGA
 
Protein sequence
MGSPNVLLVV LDATRKDHLS CYGYDRPTTP ELDAVAQAGT RYEQAIAPAP WTPPSHASMF 
TGRYPSGHGS FGTQPLGEYD GATVAELLSA AGYATFGFSN SHHTSIEQEF DRGFDYYHDI
LALPRFMDTM YEPSLDFLRF LPRYFRDGYD ISDFQRRKLE TQVRRADGPF FGFINFNATH
APYRPPEEFR APFEERFDDW DAVDETAARK VGDDEGYEYI LGDVTMSPTE WELVECWYDA
EIRYVDALLG ELFDCLRRQG VYDDTLVVVT ADHGEHFGEH GLAYHQFSLF EELLNVPLLV
KWPEGDRPSP APGTVSDRLV SLVDLVPTIC EWAGVAVPDE VDGRALTGDD DRDAVFAEYD
RPYPPLRERL QQYDSFEAYD RGLQAVRTET HKLIRPTVGE ATLYRLTGDG EVEVSDDERE
AALAQRLDET LEPLPDTSRT TELDDHVSDH LEKMGYL