Gene Hmuk_2755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2755 
Symbol 
ID8412306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2645012 
End bp2646433 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID645021100 
Productsulfatase 
Protein accessionYP_003178567 
Protein GI257388794 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.8787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.694021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCC CTAACGTCCT CCTGGTGATA CTGGACAGCG TCAGAGCCAG AAACACCGGT 
CTGCACGGGT ACGGGGACCG GACGACGCCG TTTCTCGACT CGTTCGCCGG CGAGGCCGAG
TGGTACCGGC AGGCACGGGC CCCGAGCATC CACAGCGTCG CCAGCCACGC AAGCCTCTTT
AGCGGGCTCC ACGTGGACCA GCACGGCGTC ACGAAACACG AGTCGCAGCT GGATCCCGAC
GCCACTGTCT GGTCAGACCT GTCCGAACGG GGCTACGAGA CCGGTCTGTT CACGCCCAAC
GTCGTCGTCA CGGAGTCGTC GAACCTCGCG GAGCCGTTCG ACACTGTCGA CGGCCCGCGG
CGGGATCCGA AACACCGTTA CTTCGAGGAC GCCCTCTCGC CGACGGACGT GGAGGGCCAC
CAGACGAACG TCGAGTACCT GCGGCGGTGT GTCGCCAGCG GCAAACCGCT CCGATCGGCG
TTCAACGGCC TGTACTTCAT CTCCAGCAAC AAGGACGCGT ACGATCCAGA GCAGGAGGCG
GGCACCGAAT ACGTCGAGAG CTTCCTGCAA TGGTCGGACG AGCGAGACGG CCCGTGGGCT
GCCTGTCTGA ACTTCATGGA CGCCCACTTC CCGTACGAAC CCGTTCGGGA ATTCCGGTCG
GCGGGGACGG AGGAACTGCT GGACGTACAC GATTCGCTGT CGTCGCCGAT CTCGAAAGAC
GTCGCCGCGT CCGGGGAGTG GTGGAAGCTC CGAGCGATCG AAGCCCTCTA CGACGACTGT
ATCCGGCAGG CCGACGACGC CGTGGCGACG CTGGTCGACG CGCTGCGAGA GCGCGGTGTT
CTCGACGACA CGCTCCTGGT CGTCACCAGC GACCACGGCG ACGGCTTCGG CGAGTGGAGC
CGCGTCGACC CGCGGGTCCG TGCCGCCTAC CACAGCTGGA GCGTCCACGA GGTGCTGACC
CACGTGCCGC TCCTCGTGCG CCGTCCGGGC GGAGAGGACG GCGGAGCGAA CGACTCGCTC
GCCAGTCTCA CCCGCTTCCC GGCCGTCGTC GAAGCGACGC TCGACGGCGA GACGGAGAGT
TTCGCCGTCG ACGACCGCGC GTTCGCCTCG ACCCACCGGC TCGAACGGCC GACGGTCATG
CTACCGGACG CCTGCGAGGA TCCCGAGCGG TACGCCGGAC CGTGGCGGGC GGTCTACGAG
CAGGACGACG ACGGCGTGTA CAAGTACGGG ACCCACGATT CGGCGTCGGC GACGATACGC
GTCAGAGACG CACAGGTGTC CTACCGCGTC GCCGACGACG ACGGCGGTGT CGTCGCGCAG
TCGTACGGCG AGCTGGAAGG GGCCGACGTG AGCGACGGCG AGTCGGACTC GCTGGAGGAC
TCCGTCGAAG ACCAGCTAGA GAGTCTCGGC TACATCCGGT AG
 
Protein sequence
MTAPNVLLVI LDSVRARNTG LHGYGDRTTP FLDSFAGEAE WYRQARAPSI HSVASHASLF 
SGLHVDQHGV TKHESQLDPD ATVWSDLSER GYETGLFTPN VVVTESSNLA EPFDTVDGPR
RDPKHRYFED ALSPTDVEGH QTNVEYLRRC VASGKPLRSA FNGLYFISSN KDAYDPEQEA
GTEYVESFLQ WSDERDGPWA ACLNFMDAHF PYEPVREFRS AGTEELLDVH DSLSSPISKD
VAASGEWWKL RAIEALYDDC IRQADDAVAT LVDALRERGV LDDTLLVVTS DHGDGFGEWS
RVDPRVRAAY HSWSVHEVLT HVPLLVRRPG GEDGGANDSL ASLTRFPAVV EATLDGETES
FAVDDRAFAS THRLERPTVM LPDACEDPER YAGPWRAVYE QDDDGVYKYG THDSASATIR
VRDAQVSYRV ADDDGGVVAQ SYGELEGADV SDGESDSLED SVEDQLESLG YIR