Gene Hmuk_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1453 
Symbol 
ID8410974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1377273 
End bp1378625 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content45% 
IMG OID645019783 
Productsulfatase 
Protein accessionYP_003177279 
Protein GI257387506 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA ATTCTAACAT CATACTCATA ACCGTGGATT CACTGCGGTA TGATACTATA 
TTTTCGGACG AAGCGGTATC AGCTCCGACA ATTGAACAAT TAATTGAAGA AGGGCTAACG
TTCAAGCGAG CGTATGCGAA CGGTACCTCA ACGGGAATGT CTTTTCCATC TATTTTTTCA
GGGATGTATC CCTGGGACTA CTATGGGTCT TATTATTCCC CCGATCGTCC ACATCTCATT
GAAGAGTTCA AAAACACGAA TTATGGAACA GCAGCGTTCC ACTCAAACCC CCATCTAAGT
GCCTCATTCG GATATGACCG TGGTTTCGAC ATTTTCTCCG AGGGTTCTGA TTCACCGTCT
ACAGTCTCGA AATTACGTAA GAAGGCTGGG AATAACATAC CGAAAGATAG CATTCTCTAT
GATGTTTTGC GGAAAGCTTG GAAGAATCTA GAAAAAGCTA CTGGAACTGG TGTCGGGACT
CCTTACGTGG ATGGAACCGA ACTCAATGAG TACGTATTCG ACTGGTTAGA ATCCGCTTCA
GCTCCAGTAT TTGGGTGGGT TCACTATATG GACGTACACC ATCCGTATCT GCCCCACAAA
AATACGGTTA GTCACGATAT TGGAGAAAAA GAAGCGATTC GACTTCGGCA AAAATTCATC
CATTCTCCAG ATGACTTGGT GAATAGCGAG ATTGAAATCC TGCGTCAATT GTATCGGGGA
GAAGTGGAAT ACTTTGATCG ATGCCTCAAT TCGCTCCTTT CCAAGGTGCA GTCTGAGCTC
GGCTTCGACA ATACGGTCCT TATCTTGGTG TCCGATCACG GTGAAGCATT TGGGGAAAAT
GATAATTACG GTCACGGGGG CGATGCACTC GGTGACGAAG TCACCCGGGT CCCGCTTATT
ATCCGCGGAC CGAACATAGA GCCTAGCAAA GTAGAGTCAC CCGTGTCGTG CGTTGATATT
TATCCGACAA TTACAGATAT CAGGGGTGAG TCTGCACCGA GTAAATGTCG AGGGAAGTCG
TTACTCGCAG TAGATGAAAA TGAACGACAG GAAAAGGACG AGAAACAATA TGTATTCGCA
CACGCCAACG CGCCGTCAGA TGGACAGGCT ATGATCTGTG ATGGAAGGAG AAAGCTCATT
CACGATCGAG GTGAGAATAC AGATACCTTG TATGACTTAT GTGACGGACA AGAAAAAGAA
CGGATCAACT ATCGGGCGGA CCATGTGAAG GATCTAGTTG ACGAAATCGA AACGCACATA
TCCAGTATAA AAAGCAGCGA TGACTCTGCA ATTCAACCTG ATATAGACGA TGAAATGAAA
AATAAGTTAC GGGATCTAGG GTATATAGAT TAA
 
Protein sequence
MTGNSNIILI TVDSLRYDTI FSDEAVSAPT IEQLIEEGLT FKRAYANGTS TGMSFPSIFS 
GMYPWDYYGS YYSPDRPHLI EEFKNTNYGT AAFHSNPHLS ASFGYDRGFD IFSEGSDSPS
TVSKLRKKAG NNIPKDSILY DVLRKAWKNL EKATGTGVGT PYVDGTELNE YVFDWLESAS
APVFGWVHYM DVHHPYLPHK NTVSHDIGEK EAIRLRQKFI HSPDDLVNSE IEILRQLYRG
EVEYFDRCLN SLLSKVQSEL GFDNTVLILV SDHGEAFGEN DNYGHGGDAL GDEVTRVPLI
IRGPNIEPSK VESPVSCVDI YPTITDIRGE SAPSKCRGKS LLAVDENERQ EKDEKQYVFA
HANAPSDGQA MICDGRRKLI HDRGENTDTL YDLCDGQEKE RINYRADHVK DLVDEIETHI
SSIKSSDDSA IQPDIDDEMK NKLRDLGYID