Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1173 |
Symbol | |
ID | 8410693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1115513 |
End bp | 1117159 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645019509 |
Product | O-sialoglycoprotein endopeptidase/protein kinase |
Protein accession | YP_003177006 |
Protein GI | 257387233 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.3624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTTC TCGGCATCGA AGGCACAGCC TGGGCAGCCA GTGCTGCGAT CTTCGAGGCG GACGAAAGTG AGCTTCGAGA TCCCTCGGCG GCCGCCAGTG GCGACCACGT CTTCATCGAG ACCGACGCCT ACCAGCCCGA CAGCGGCGGC ATCCACCCCC GCGAGGCCGC CGAACACATG GGCGAGGCGA TCCCCAAGGT CGTCGAGCGC GCGCTCGACC ACGCCCGCGA GCGAGCGCCC GACACGGAGA CGGGCCCGCC CATCGACGCC GTCGCCTTCT CGCGTGGCCC CGGCCTGGGT CCGTGTCTGC GCATCGTCGG GACGGCGGCC CGAGCCGTCG CCCAGCGGTT CGACGTGGCG CTGGTCGGCG TCAACCACAT GGTCGCACAC CTCGAAGTCG GACGCTACTT CTCGGGGTTT TCCTCGCCCA TCTGCCTGAA CGCCTCGGGT GCCAACGCGC ACGTCCTCGG GTACCGCTCG GGACGGTACC GCGTGCTCGG CGAGACGATG GACACCGGCG TCGGCAACGC CATCGACAAG TTCACGCGCC ACGTCGGCTG GTCTCACCCC GGCGGTCCCA AGGTCGAAGA TCACGCGACG CGCGGGACGT ACGTCGATCT CCCCTACGTC GTCAAGGGGA TGGACTTCTC GTTCTCGGGA ATCATGTCCG CCGCCAAGCA GGCCACCGAC CGTGGTACGC CGGTCGAGGA CGTCTGTCGC GGGCTCGAAG AGACGATCTT CGCGATGCTG ACCGAAGTCG CCGAGCGCGC CCTCTCGCTG ACTGACGCCG ACGAACTCGT CCTCGGGGGC GGCGTCGGCC AGAACGAGCG CCTCCGATCG ATGCTCGCGG AGATGTGTAC GCAGCGCGGT GCGGAGTTCT ACGCGCCGGA ACCGCGCTTT CTCCGGGACA ACGCCGGGAT GATCGCGATT CTGGGCGCGC GGATGTACGC GGCCGGCGAC ACGCTGTCGA TACCCGACTC CGGCATCGAC TCGGACTTCC GGCCCGATCA GGTCGAGGTG ACCTGGGATG CGGGCGAACC GGTCGCTCGC GTCGGCGGCG ACGCCGACGA GATTCAGGGG GCCGAGGCGC TCGTCCGCTT CGAGGGCGAC CGCGTGATCA AGGAGCGCCG TCCCCGCTCG TATCGCCACC CGAAGCTGGA CGAACGGCTG CGTTCGGAGC GCACCAGACA GGAGGCCCGC CTCACCAGCG AGGCCCGCCG GCACGGCGTT CCGACGCCAG TGATACACGA CGTCGACCCA CGGGACGCCC GGATCGTCTT CCAGCGAGTC GGCGATCGAG AACTGCGCGA CGGGCTGACC GAGGAGCGGG TGCGGGCGGT CGGGCGACAG CTCGCTGCGA TCCACGACGC CGGCTTCGTC CACGGCGATC CGACGACGCG GAACGTCCGC GTCGGGGCGG GCGATCCGGG CGTCTTCCTC ATCGACTTCG GACTGGGCTA CTACACGCGA GACACCGAGG ACCACGCGAT GGACCTGCAC GTCCTCGATC AGTCGCTGGC CGGCACCACT GACGACGCCG AGCGACTCCG TGCCGCCGTG GCCGACGCCT ATCGGACCGC GAGCGAACGC GACGACACCG TCCTCGACCG TCTCGACGCG ATCGAGGATC GCGGCCGATA CCAGTGA
|
Protein sequence | MRVLGIEGTA WAASAAIFEA DESELRDPSA AASGDHVFIE TDAYQPDSGG IHPREAAEHM GEAIPKVVER ALDHARERAP DTETGPPIDA VAFSRGPGLG PCLRIVGTAA RAVAQRFDVA LVGVNHMVAH LEVGRYFSGF SSPICLNASG ANAHVLGYRS GRYRVLGETM DTGVGNAIDK FTRHVGWSHP GGPKVEDHAT RGTYVDLPYV VKGMDFSFSG IMSAAKQATD RGTPVEDVCR GLEETIFAML TEVAERALSL TDADELVLGG GVGQNERLRS MLAEMCTQRG AEFYAPEPRF LRDNAGMIAI LGARMYAAGD TLSIPDSGID SDFRPDQVEV TWDAGEPVAR VGGDADEIQG AEALVRFEGD RVIKERRPRS YRHPKLDERL RSERTRQEAR LTSEARRHGV PTPVIHDVDP RDARIVFQRV GDRELRDGLT EERVRAVGRQ LAAIHDAGFV HGDPTTRNVR VGAGDPGVFL IDFGLGYYTR DTEDHAMDLH VLDQSLAGTT DDAERLRAAV ADAYRTASER DDTVLDRLDA IEDRGRYQ
|
| |