Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2844 |
Symbol | |
ID | 8412395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2728853 |
End bp | 2730772 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645021189 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003178656 |
Protein GI | 257388883 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0124345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTCG AGAGTAAACT TAGAGCCAGT GATAAGTATG TGGTGTCTGA TTTACAGGAC GCCATGACAC ATGGTACACA GAGCTGGCGA CGCAGAGGAG TATTGAAATC TATCGGCGCA CTCGGGGCAC TCGCCGGTGT CGGCGTGACG GGCGCGACTC CCGGACGGAG CCCCGGGCCG AAGCCGGACG AACTGATCGT CGGCGCGAAG CGGGGCGTGA GCACTGCCGA TGTCGAGTCG GAAGTCTCGG CCGCGACGAC GGCGAACACG TCGGTGGTCC ACCGAAACGA GGCGCTTGGC TATCTCGCCG TCGAGCTTCC GGAGGTGAGC ACACAGTCCG AACGCGAGTC GGTCCGACAG CAGTTCGAGA GTCAGCCGAA CGTAGCCTAC GTGGAAGACA ACGTCACCTA CGAAACCCAG CTGACGCCCA ACGATCCGCA GTTCGGTGAC CAGTACGCGC CCCAGCAGGT CAACGCCGAG GCGGCCTGGG ACACGACGCT TGGCAGTACG GACGTGACCG TCGCCATCGT CGACACCGGC GCACAGTACG AACACCCGGA CCTGACGAAC CTGTTCGGGA GCAATCCCGG CCGGGACTTC GTCGACGGCG ACGGCGACCC CGCCCCGGGA TCCGCCGGTG AATCCCACGG GACCCACGTC AGCGGGTGTG CGTCGGCAGA CACCGACAAC GGCGTCGGCG TCGCGGGCGT GAGCGACTCG CGACTGTTGA GCGCCCGAGC ACTCGGTGGC GGCGGCGGCG GCGCGCTGTC GGACATCGCC GACGCGGTTC GATGGGCGAC CGACCAGGGT GTGGACATCA TCAACATGTC GCTGGGCGGG GGCGGCTACA CCCAGACGAT GAAGCGAGCG GTGGAGTACG CCTACGACCA GAACGACGTG CTGGTTGTCT GTGCGGCGGG CAACGACGGC GGTTCCGTCT CGTATCCCGC GGCCTACGAC GAGTGTGTCG CCGTCTCGGC GCTGGACCCG AACGAAGAAC TCGCGAACTT CTCGAACCGC GGGCCAGAGA TAGAGGTTGC CGCACCGGGA GTCAACGTCC TCTCGACGGT TCCATACGAC GGGTACGACT CCTTCTCCGG GACCTCGATG GCCTCTCCCG TTGCCGCGGG GGTCGCGGCA CTGGGGAAGG CCGCCGAGCC GGGCCTGTCG GCGAGTCAGC TCCGCGAACG GCTCAAGTCG ACGGCCGACG GCGTCGGACT ACCCGGCGAC CAGCAGGGCT CGGGCCGGGT CGATGCCGCC GACATCGTCC GTGCCAGCGG CGACCCGCCG GACAACGAGA CACCGTCGGC CTCCGCCGCC GCCGATCCGA CGGATCCCAG CGTCGGCGAG AGCGTGACCT TCGACGGGAG CGCCTCGTCC GACCCCGACG GCACGATCGA GAGCTACCAG TGGGACTTCG GGGACGGGAA CACCGGATCT GGCGTGACTG TGGAACACAG TTACGACGCT GCCGGGGAGT ACCAGGCGAC CCTGACCGTG ACCGACGACA GCGGTTCCTC GACGACCGAC GGCGTCGTCG TGAACGTCGC AAGCGGCGGC GGGGACTGCA GTCAGAGCGC CTCGGGGAGT GCCGACGGCC GGCTCACCGG CTGGCGAGAC AGCGACAGTT ACACCTGGGC GAGTCAGTTC TCGTCGACCT GTGAACTGAC GGTCGATCTC TCGGGAGCGT CGGGGACAGA CTTCGATCTC TACGTCACCG CGGACGGCCG GACGCCGACG ACCAACGACT ACGACGCACG GTCGGTGTCG AGCGACAGCG AGGAGTCGGT GACGCTGTCG GAGATCGGTG ACTCGGTCGG CATCCTCGTC GACTCCTATC GGGGCAGCGG CTCCTACACG GTCAGTGTCG AGGAGACCGG CGCGGGCACT CAGGCGACCG CCAGTTCGGA GGGACTGTAA
|
Protein sequence | MILESKLRAS DKYVVSDLQD AMTHGTQSWR RRGVLKSIGA LGALAGVGVT GATPGRSPGP KPDELIVGAK RGVSTADVES EVSAATTANT SVVHRNEALG YLAVELPEVS TQSERESVRQ QFESQPNVAY VEDNVTYETQ LTPNDPQFGD QYAPQQVNAE AAWDTTLGST DVTVAIVDTG AQYEHPDLTN LFGSNPGRDF VDGDGDPAPG SAGESHGTHV SGCASADTDN GVGVAGVSDS RLLSARALGG GGGGALSDIA DAVRWATDQG VDIINMSLGG GGYTQTMKRA VEYAYDQNDV LVVCAAGNDG GSVSYPAAYD ECVAVSALDP NEELANFSNR GPEIEVAAPG VNVLSTVPYD GYDSFSGTSM ASPVAAGVAA LGKAAEPGLS ASQLRERLKS TADGVGLPGD QQGSGRVDAA DIVRASGDPP DNETPSASAA ADPTDPSVGE SVTFDGSASS DPDGTIESYQ WDFGDGNTGS GVTVEHSYDA AGEYQATLTV TDDSGSSTTD GVVVNVASGG GDCSQSASGS ADGRLTGWRD SDSYTWASQF SSTCELTVDL SGASGTDFDL YVTADGRTPT TNDYDARSVS SDSEESVTLS EIGDSVGILV DSYRGSGSYT VSVEETGAGT QATASSEGL
|
| |