Gene Hmuk_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1626 
Symbol 
ID8411149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1546042 
End bp1547472 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content69% 
IMG OID645019953 
Productsulfatase 
Protein accessionYP_003177447 
Protein GI257387674 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.773199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.470838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCCG AAACCGGCGT CTCGAACGTC GTCCTCGTCA CGGTCGACTC GCTCCGTGCG 
GACGCCATCG GTCCGTACGA CAGCGACCGC CACACACCCG TCATGGACGA ACTCGCCCAG
CGGGGGACGG TCTTCGAGCG CGCCTTCGCC AACGGAAACT GGACTCCCTT CTCGTTTCCC
TCGATGCTGG CCTCCTCGCC GGTGTTCGCC GACGACGGCC AGATCGGCGT CACGGGCGTC
GAGACGCTCG CGGAGACGCT CGCGGACGCG GGCTTCGACA CCGGCGGTTT CAACGCCGCC
AACGGATTCC TGACCGAACA CTGGGGATAC GACGACGGCT TCGACGAGTT CGACTCCTTC
GTCGCCAGCG TGGGCTCCAG CATCTACAGT CGCTATCTCG CGACCCATCC GACCGTGGAG
GCCTGGCTCC AGCTGGCGAC CTCGCCGTTT CGCCGCGCCC GCTCGTGGCT CTCGTCGGCG
GACAGCCACC GTCCGTTCCT CGACACCTCG CGGATGTTCG ACGTGGAACA GGGGGCGACC
TCGTTCCTCG AATCGGCCTC GGAGCCGTTC TTCCTGTGGA TCCACTACAT GGACGCCCAC
ACGCCGTACG TTCCGGCCCC GCGGTACATC CGGGAGGTGT CTGCCGACCG CTTCGGGACC
CACCGGATGG TCCTGGCGCA CCTCCACGCC GGGCTGGGCT GGGAGGTCGG CGACCGGACG
CTTGGCAACC TCCGGACGCT CTATCAGAGC GCCGTCCGGC AGGTCGACGA CAGCCTCGGA
CGAGTGCTCG ACGCGCTGTC GGAACACGGC CACGACGACG ACACCGCCGT CGTACTGGCC
GGCGACCACG GCGAGGAGTT CCAGGACCAC GGCCACCTGG CACACTACCC GAAGCTGTAC
GACGAGCTGA TCCGCGTGCC GCTGATCGTC GACGTACCGG GCGCAGAGAG TCGGCGCGTC
GAGAGACAGG TCGGTCTCGA CTCGCTCCCA CCGACGGTGG CCGACCTCGC GGGCGTCTCT
CCGCCAGCGG AGTGGCGCGG GGACTCGCTC GCCCCCGCCG TCCTCGACGG CGAGGAACCG
GCCGACGAGC CGGTCGTCTC GGTCACGGTC AGGGGCGAGG ACGTCACCGA CCAGCCCATC
CCGCGGTCGC TCGACGACGG CGACCTCCTC GTCAGCGTCC GCGACCGCGA CTGGAGTTAC
ATCGAGAACG TCGACACCGG CGACCGGGAG CTGTACCACC GGCCCTCGGA CCCCGGCCAA
CAGGACGACC GCTCCGACGG TCCCGACGCC GAGGCCAGGG CCGTGATCGA GGCGTTCGAG
CCCCTCGTCG ACGCCCACGC AGACCTGCTG CACGCGGCAG AGCAAAGCGA GATCGACGAC
GAGATGGACG AGGATCTGGA CGCCAGACTG GAGGCGCTCG GCTACAAGTA G
 
Protein sequence
MHSETGVSNV VLVTVDSLRA DAIGPYDSDR HTPVMDELAQ RGTVFERAFA NGNWTPFSFP 
SMLASSPVFA DDGQIGVTGV ETLAETLADA GFDTGGFNAA NGFLTEHWGY DDGFDEFDSF
VASVGSSIYS RYLATHPTVE AWLQLATSPF RRARSWLSSA DSHRPFLDTS RMFDVEQGAT
SFLESASEPF FLWIHYMDAH TPYVPAPRYI REVSADRFGT HRMVLAHLHA GLGWEVGDRT
LGNLRTLYQS AVRQVDDSLG RVLDALSEHG HDDDTAVVLA GDHGEEFQDH GHLAHYPKLY
DELIRVPLIV DVPGAESRRV ERQVGLDSLP PTVADLAGVS PPAEWRGDSL APAVLDGEEP
ADEPVVSVTV RGEDVTDQPI PRSLDDGDLL VSVRDRDWSY IENVDTGDRE LYHRPSDPGQ
QDDRSDGPDA EARAVIEAFE PLVDAHADLL HAAEQSEIDD EMDEDLDARL EALGYK