Gene Hmuk_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2844 
Symbol 
ID8412395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2728853 
End bp2730772 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content67% 
IMG OID645021189 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003178656 
Protein GI257388883 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0124345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTCG AGAGTAAACT TAGAGCCAGT GATAAGTATG TGGTGTCTGA TTTACAGGAC 
GCCATGACAC ATGGTACACA GAGCTGGCGA CGCAGAGGAG TATTGAAATC TATCGGCGCA
CTCGGGGCAC TCGCCGGTGT CGGCGTGACG GGCGCGACTC CCGGACGGAG CCCCGGGCCG
AAGCCGGACG AACTGATCGT CGGCGCGAAG CGGGGCGTGA GCACTGCCGA TGTCGAGTCG
GAAGTCTCGG CCGCGACGAC GGCGAACACG TCGGTGGTCC ACCGAAACGA GGCGCTTGGC
TATCTCGCCG TCGAGCTTCC GGAGGTGAGC ACACAGTCCG AACGCGAGTC GGTCCGACAG
CAGTTCGAGA GTCAGCCGAA CGTAGCCTAC GTGGAAGACA ACGTCACCTA CGAAACCCAG
CTGACGCCCA ACGATCCGCA GTTCGGTGAC CAGTACGCGC CCCAGCAGGT CAACGCCGAG
GCGGCCTGGG ACACGACGCT TGGCAGTACG GACGTGACCG TCGCCATCGT CGACACCGGC
GCACAGTACG AACACCCGGA CCTGACGAAC CTGTTCGGGA GCAATCCCGG CCGGGACTTC
GTCGACGGCG ACGGCGACCC CGCCCCGGGA TCCGCCGGTG AATCCCACGG GACCCACGTC
AGCGGGTGTG CGTCGGCAGA CACCGACAAC GGCGTCGGCG TCGCGGGCGT GAGCGACTCG
CGACTGTTGA GCGCCCGAGC ACTCGGTGGC GGCGGCGGCG GCGCGCTGTC GGACATCGCC
GACGCGGTTC GATGGGCGAC CGACCAGGGT GTGGACATCA TCAACATGTC GCTGGGCGGG
GGCGGCTACA CCCAGACGAT GAAGCGAGCG GTGGAGTACG CCTACGACCA GAACGACGTG
CTGGTTGTCT GTGCGGCGGG CAACGACGGC GGTTCCGTCT CGTATCCCGC GGCCTACGAC
GAGTGTGTCG CCGTCTCGGC GCTGGACCCG AACGAAGAAC TCGCGAACTT CTCGAACCGC
GGGCCAGAGA TAGAGGTTGC CGCACCGGGA GTCAACGTCC TCTCGACGGT TCCATACGAC
GGGTACGACT CCTTCTCCGG GACCTCGATG GCCTCTCCCG TTGCCGCGGG GGTCGCGGCA
CTGGGGAAGG CCGCCGAGCC GGGCCTGTCG GCGAGTCAGC TCCGCGAACG GCTCAAGTCG
ACGGCCGACG GCGTCGGACT ACCCGGCGAC CAGCAGGGCT CGGGCCGGGT CGATGCCGCC
GACATCGTCC GTGCCAGCGG CGACCCGCCG GACAACGAGA CACCGTCGGC CTCCGCCGCC
GCCGATCCGA CGGATCCCAG CGTCGGCGAG AGCGTGACCT TCGACGGGAG CGCCTCGTCC
GACCCCGACG GCACGATCGA GAGCTACCAG TGGGACTTCG GGGACGGGAA CACCGGATCT
GGCGTGACTG TGGAACACAG TTACGACGCT GCCGGGGAGT ACCAGGCGAC CCTGACCGTG
ACCGACGACA GCGGTTCCTC GACGACCGAC GGCGTCGTCG TGAACGTCGC AAGCGGCGGC
GGGGACTGCA GTCAGAGCGC CTCGGGGAGT GCCGACGGCC GGCTCACCGG CTGGCGAGAC
AGCGACAGTT ACACCTGGGC GAGTCAGTTC TCGTCGACCT GTGAACTGAC GGTCGATCTC
TCGGGAGCGT CGGGGACAGA CTTCGATCTC TACGTCACCG CGGACGGCCG GACGCCGACG
ACCAACGACT ACGACGCACG GTCGGTGTCG AGCGACAGCG AGGAGTCGGT GACGCTGTCG
GAGATCGGTG ACTCGGTCGG CATCCTCGTC GACTCCTATC GGGGCAGCGG CTCCTACACG
GTCAGTGTCG AGGAGACCGG CGCGGGCACT CAGGCGACCG CCAGTTCGGA GGGACTGTAA
 
Protein sequence
MILESKLRAS DKYVVSDLQD AMTHGTQSWR RRGVLKSIGA LGALAGVGVT GATPGRSPGP 
KPDELIVGAK RGVSTADVES EVSAATTANT SVVHRNEALG YLAVELPEVS TQSERESVRQ
QFESQPNVAY VEDNVTYETQ LTPNDPQFGD QYAPQQVNAE AAWDTTLGST DVTVAIVDTG
AQYEHPDLTN LFGSNPGRDF VDGDGDPAPG SAGESHGTHV SGCASADTDN GVGVAGVSDS
RLLSARALGG GGGGALSDIA DAVRWATDQG VDIINMSLGG GGYTQTMKRA VEYAYDQNDV
LVVCAAGNDG GSVSYPAAYD ECVAVSALDP NEELANFSNR GPEIEVAAPG VNVLSTVPYD
GYDSFSGTSM ASPVAAGVAA LGKAAEPGLS ASQLRERLKS TADGVGLPGD QQGSGRVDAA
DIVRASGDPP DNETPSASAA ADPTDPSVGE SVTFDGSASS DPDGTIESYQ WDFGDGNTGS
GVTVEHSYDA AGEYQATLTV TDDSGSSTTD GVVVNVASGG GDCSQSASGS ADGRLTGWRD
SDSYTWASQF SSTCELTVDL SGASGTDFDL YVTADGRTPT TNDYDARSVS SDSEESVTLS
EIGDSVGILV DSYRGSGSYT VSVEETGAGT QATASSEGL