Gene Hmuk_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1102 
Symbol 
ID8410621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1053580 
End bp1055100 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content67% 
IMG OID645019438 
Productsulfatase 
Protein accessionYP_003176936 
Protein GI257387163 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0852605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.561012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGTG CGGACGGGAC GAACGTGCTC TTCGTCGTGC TCGATACGGT CCGCAAGGAC 
CACCTCTCGG CGTACGGGTA CGACGAGCCG ACGACTCCGA CCCTGGAGGC GTTCGCCGAG
GAAGCGGCGG TCTTCGAACA CGCCGTTGCG CCGGCCCCCT GGACCCTGCC GGTCCACGCC
TCCCTGTTTA CGGGGCTGTA CCCCTCCGAA CACGAGGCCA CACAGGAGGA CCCCTATCTC
GACGGGGCGA CCACGCTGGC CGAGTCGCTG TCGGCGGCCG GCTACGACAC GGCCTGTTAC
TCCTCGAACG CCTGGATCAC GCCCTACACG AACCTCACCG CGGGGTTCGA CGACCACGAC
AACTTCTTCC AGATCATGCC CAGCGAACTC CTCTCGGGGC CGCTGGCCCG GATCTGGCAG
ACGATGAACG ACAGCGACAC CCTGCGGGGC GTCGCCGATC GGATGGTCCA GCTGGGCAAC
AAGTTCCACG AGTACTTCGC CTCCGAGGGC GGGGGCGACA CGAAGACCCC CGCCGTGATC
GACAAGACGA TGGACTTCAT CGACGACTCG GAGAACTTCT TCACGTTCAT CAACCTGATG
GACGCCCACC TCCCCTACCA CCCGCCCGAG GAGTACGTCG AGCAGTTCGC GCCCGGCGTC
GACTCGGCCG AGGTGTGCCA GAACTCCAAG GAGTTCAACT GCGGCGCTCG CGACATCGAC
GACGCGGAGT GGGACGACAT CGAGGGGCTG TACGACGCCG AGATCCGCCA CATCGACGAC
CAGCTCGACC GGCTCTTTAC CCACCTCAAG GAGACCGGCC AGTGGGACGA GACGATGGTC
GTCGTAGCGG CCGACCACGG CGAACTGCAC GGCGAACACG GCCTCTACGG CCACGAGTTC
TGCATCTACG ATCCGCTGGT GAACGTCCCC TGCATGGTCA AGCACCCCGA GATCGAGCCC
GAACGCGACG ACGAGACGGT CGTCGAACTC GTCGACCTGT ATCACTCCGT GCTGGACGCG
ACCGGCGTCG CGGGCGACGG CGTCTCGCTC GATCCCGCTC GCTCCCTGCT CTCGACGGAG
TATCGCGAGT TCGCCGGGAC GGCCTCGAAC GGTGCGGGCG CACACGGCCC CCGCGGTGAC
GTGGGCTTCG TCGAGTACCA CCAGCCGGTC GTCGAGCTCC GCCAGCTGGA GGGGAAAGCC
AGCGCGGCCG GCATCGCCCT GGAGACGGAC TCGCGGTTCT ACTCTCGGAT GGGCGCGGCC
CGTTCGCCCG AGGGCAAGTA CATCCACTGT ACGCGCATCC CGGACGAGGC GTACCGGATC
GACAGCGACC CCGGCGAGAC CGAGGACCGT GCGGGCAGTG ACGACGAACT GCTGGTCGAC
CTCGAAGCGG AGCTCTCCGA CTTCGTCGAC CGCGTCGACG CCGACTGGCC CGACGAGGCC
GACGGCACCG ACGGCGAGGT GCTGGACTCG ATGGACGACG ACGCGAAGGA TCGCCTGAAA
GACCTGGGCT ACATCGACTG A
 
Protein sequence
MTGADGTNVL FVVLDTVRKD HLSAYGYDEP TTPTLEAFAE EAAVFEHAVA PAPWTLPVHA 
SLFTGLYPSE HEATQEDPYL DGATTLAESL SAAGYDTACY SSNAWITPYT NLTAGFDDHD
NFFQIMPSEL LSGPLARIWQ TMNDSDTLRG VADRMVQLGN KFHEYFASEG GGDTKTPAVI
DKTMDFIDDS ENFFTFINLM DAHLPYHPPE EYVEQFAPGV DSAEVCQNSK EFNCGARDID
DAEWDDIEGL YDAEIRHIDD QLDRLFTHLK ETGQWDETMV VVAADHGELH GEHGLYGHEF
CIYDPLVNVP CMVKHPEIEP ERDDETVVEL VDLYHSVLDA TGVAGDGVSL DPARSLLSTE
YREFAGTASN GAGAHGPRGD VGFVEYHQPV VELRQLEGKA SAAGIALETD SRFYSRMGAA
RSPEGKYIHC TRIPDEAYRI DSDPGETEDR AGSDDELLVD LEAELSDFVD RVDADWPDEA
DGTDGEVLDS MDDDAKDRLK DLGYID