Gene Hmuk_1173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1173 
Symbol 
ID8410693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1115513 
End bp1117159 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content70% 
IMG OID645019509 
ProductO-sialoglycoprotein endopeptidase/protein kinase 
Protein accessionYP_003177006 
Protein GI257387233 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.3624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTC TCGGCATCGA AGGCACAGCC TGGGCAGCCA GTGCTGCGAT CTTCGAGGCG 
GACGAAAGTG AGCTTCGAGA TCCCTCGGCG GCCGCCAGTG GCGACCACGT CTTCATCGAG
ACCGACGCCT ACCAGCCCGA CAGCGGCGGC ATCCACCCCC GCGAGGCCGC CGAACACATG
GGCGAGGCGA TCCCCAAGGT CGTCGAGCGC GCGCTCGACC ACGCCCGCGA GCGAGCGCCC
GACACGGAGA CGGGCCCGCC CATCGACGCC GTCGCCTTCT CGCGTGGCCC CGGCCTGGGT
CCGTGTCTGC GCATCGTCGG GACGGCGGCC CGAGCCGTCG CCCAGCGGTT CGACGTGGCG
CTGGTCGGCG TCAACCACAT GGTCGCACAC CTCGAAGTCG GACGCTACTT CTCGGGGTTT
TCCTCGCCCA TCTGCCTGAA CGCCTCGGGT GCCAACGCGC ACGTCCTCGG GTACCGCTCG
GGACGGTACC GCGTGCTCGG CGAGACGATG GACACCGGCG TCGGCAACGC CATCGACAAG
TTCACGCGCC ACGTCGGCTG GTCTCACCCC GGCGGTCCCA AGGTCGAAGA TCACGCGACG
CGCGGGACGT ACGTCGATCT CCCCTACGTC GTCAAGGGGA TGGACTTCTC GTTCTCGGGA
ATCATGTCCG CCGCCAAGCA GGCCACCGAC CGTGGTACGC CGGTCGAGGA CGTCTGTCGC
GGGCTCGAAG AGACGATCTT CGCGATGCTG ACCGAAGTCG CCGAGCGCGC CCTCTCGCTG
ACTGACGCCG ACGAACTCGT CCTCGGGGGC GGCGTCGGCC AGAACGAGCG CCTCCGATCG
ATGCTCGCGG AGATGTGTAC GCAGCGCGGT GCGGAGTTCT ACGCGCCGGA ACCGCGCTTT
CTCCGGGACA ACGCCGGGAT GATCGCGATT CTGGGCGCGC GGATGTACGC GGCCGGCGAC
ACGCTGTCGA TACCCGACTC CGGCATCGAC TCGGACTTCC GGCCCGATCA GGTCGAGGTG
ACCTGGGATG CGGGCGAACC GGTCGCTCGC GTCGGCGGCG ACGCCGACGA GATTCAGGGG
GCCGAGGCGC TCGTCCGCTT CGAGGGCGAC CGCGTGATCA AGGAGCGCCG TCCCCGCTCG
TATCGCCACC CGAAGCTGGA CGAACGGCTG CGTTCGGAGC GCACCAGACA GGAGGCCCGC
CTCACCAGCG AGGCCCGCCG GCACGGCGTT CCGACGCCAG TGATACACGA CGTCGACCCA
CGGGACGCCC GGATCGTCTT CCAGCGAGTC GGCGATCGAG AACTGCGCGA CGGGCTGACC
GAGGAGCGGG TGCGGGCGGT CGGGCGACAG CTCGCTGCGA TCCACGACGC CGGCTTCGTC
CACGGCGATC CGACGACGCG GAACGTCCGC GTCGGGGCGG GCGATCCGGG CGTCTTCCTC
ATCGACTTCG GACTGGGCTA CTACACGCGA GACACCGAGG ACCACGCGAT GGACCTGCAC
GTCCTCGATC AGTCGCTGGC CGGCACCACT GACGACGCCG AGCGACTCCG TGCCGCCGTG
GCCGACGCCT ATCGGACCGC GAGCGAACGC GACGACACCG TCCTCGACCG TCTCGACGCG
ATCGAGGATC GCGGCCGATA CCAGTGA
 
Protein sequence
MRVLGIEGTA WAASAAIFEA DESELRDPSA AASGDHVFIE TDAYQPDSGG IHPREAAEHM 
GEAIPKVVER ALDHARERAP DTETGPPIDA VAFSRGPGLG PCLRIVGTAA RAVAQRFDVA
LVGVNHMVAH LEVGRYFSGF SSPICLNASG ANAHVLGYRS GRYRVLGETM DTGVGNAIDK
FTRHVGWSHP GGPKVEDHAT RGTYVDLPYV VKGMDFSFSG IMSAAKQATD RGTPVEDVCR
GLEETIFAML TEVAERALSL TDADELVLGG GVGQNERLRS MLAEMCTQRG AEFYAPEPRF
LRDNAGMIAI LGARMYAAGD TLSIPDSGID SDFRPDQVEV TWDAGEPVAR VGGDADEIQG
AEALVRFEGD RVIKERRPRS YRHPKLDERL RSERTRQEAR LTSEARRHGV PTPVIHDVDP
RDARIVFQRV GDRELRDGLT EERVRAVGRQ LAAIHDAGFV HGDPTTRNVR VGAGDPGVFL
IDFGLGYYTR DTEDHAMDLH VLDQSLAGTT DDAERLRAAV ADAYRTASER DDTVLDRLDA
IEDRGRYQ