Gene Hmuk_2735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2735 
Symbol 
ID8412286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2626349 
End bp2628358 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content69% 
IMG OID645021081 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003178548 
Protein GI257388775 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.637714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACA AAGACATTGG GGGGAGTACC AGACGAACCT TCCTCGGGAC AGTAAGCGGT 
ATCGTCGGAT CAACAGCAGT CGGCACTGCA GCCGCGGTCG AAGGACGCGA TCCAGACGGA
AACGGCGGGG GGCACCGCGA GGCGGGAGAA CTCATCGTCG GAATGGAGCC GACGGCAAAC
GCCGCCGAGA CGAAGGCGAC GATCCAGTCG GGGCTTCCCG AAGGAGCCTC GGTCGTCAGC
GAGAACGACA CGCTGGGGTT CATGGAAGTT CAGCTGCCTC AGCAGACCGG CCCACAGGCA
CAGTCGGCGG CCAAGAAAGA CCTGGAGAAC CAGCCCGGCG TCGCGTACGT CGAGCCAAAC
GCGATCTACT ATCCGATGGT CATCGACGTC GACGACCCGC GGGTGGGAGA GCAGTACGCC
CCGGAGCTGG TGAACGCTGG TGGGGCGTGG GAGACGACGC TGGGATCGAC CGACGTGACG
ATCGGCGTCG TCGACCAGGG AACCCAGTAC ACACACCCCG ACCTCCAAGC GCAGTTCGGA
GAGGTCAAGG GACGGGACTT CGTCTCCGAC GACGACGACC CGAGCCCGAG AGGCGGGAAC
GGCCACGGGA CCCACGTCTC GGGCATCGCG GCCGGAACGA CGAACAACGC CACGGGGATC
GCGGGCATCT CGAACTCCTC GCTGCTGGCC GCACGCGCGC TTGGCGGTCA GGGCGGCGGT
GGGACACTGA GCGCGATCGC AGACGCGATC GTCTGGTGTA CGGACAACGG TGCAGACATC
ATCAACATGT CCCTCGGTGG CGGCGGTGCC AACCGGACGA TGCGGAACGC GTGTGACTAC
GCGTTCGACA ACGGCGCACT GCCGATCGCA GCGGCGGGCA ACAGCGGCCA GCGAGGGATT
TCGTATCCGG CAGGCTACGA CTCCGTCGTC GCTGTCTCCG CGGTCGGTCC CAACGAGTCA
CTGACCGACT TCTCGCAGTA CGGCCCTGGC GTCGACGTTG CCGCACCGGG ACTGAACGTG
CTCTCGACGT ATCCGACCGA CGGCTACAAC TCGCTGTCGG GAACGTCGAT GGCCTGCCCG
GCGGCTGCGG GCGTCGCGTC GCTGGCCCTC GCGATCGATC CGAGCCTGTC GCCACAGGAG
CTCAAAGACG TGCTCACCGA GAGCGCTCGC GACATCGGTC TCCCGTCGGA CCAGCAGGGC
AGCGGTCTCG TCGATGCCGG TGCGCTCGTC GACGCCGTGA GCGACGACGA CGGCGGTGGC
GGCGGCGACG AAGACGCGAC GAGCGGCTCG CTCACCTTCG GCGATCAGGT GCTGGGCGAC
GACGGGAACG TCACGGTCAG CGACGTGACG ACGAACGGCG ACGCGACGGT CGTGGTGACC
TATCCCGACG GCGACCAGAA CGTGGTCGCG GGCGTCTCGT CCGCAGACGA CGCCAGCGGC
ACGTCCGTGC CGGTGTCGGT CCAGGACGAC GGCGGATTCC CCGGCGAGCA CACGGCCTGG
GTGTTCGGCG ACGGAGACGT CGAGGGCGTC GCGATCGGCG ACGACGCGAC GCCGGTGGCC
GGTAGCGCGC TCGATTCCGA CACCGCGGTC GTGTCGGACG GCGACGACGG CGGGGGCGGC
GGCAGCGAGT ACCCGCAGTG GTCCGCCGAC GAGGTCTACA CGAGCGGCGA CCGCGTCGTC
TACGAAGGAA CGATCTACGA GGCCCAGTGG TGGACTCAGG GCGACGAGCC CGGTTCGAGC
CAGTGGGGTC CGTGGGAAGA AGTCGGTCCG GCAGACGGCG GTGACGGCGG TGACGGCGGT
GACGGCGGTG ACGGCGGTGA CGGCGGTGAC GGCGGTGACG GCGGTGACGG CGGCGACGGC
GGCGACGGCG AGTACCCACA GTGGGACGCC GACACGGCCT ACACCGGCGG CGACCGCGTC
GTCTACGACG GCTCCGTCTG GGAGGCCCAG TGGTGGACCC GAGGCGACGA GCCCGCCGAA
GGGAAAGCGG TCTGGGAACG CGTCTCCTGA
 
Protein sequence
MTDKDIGGST RRTFLGTVSG IVGSTAVGTA AAVEGRDPDG NGGGHREAGE LIVGMEPTAN 
AAETKATIQS GLPEGASVVS ENDTLGFMEV QLPQQTGPQA QSAAKKDLEN QPGVAYVEPN
AIYYPMVIDV DDPRVGEQYA PELVNAGGAW ETTLGSTDVT IGVVDQGTQY THPDLQAQFG
EVKGRDFVSD DDDPSPRGGN GHGTHVSGIA AGTTNNATGI AGISNSSLLA ARALGGQGGG
GTLSAIADAI VWCTDNGADI INMSLGGGGA NRTMRNACDY AFDNGALPIA AAGNSGQRGI
SYPAGYDSVV AVSAVGPNES LTDFSQYGPG VDVAAPGLNV LSTYPTDGYN SLSGTSMACP
AAAGVASLAL AIDPSLSPQE LKDVLTESAR DIGLPSDQQG SGLVDAGALV DAVSDDDGGG
GGDEDATSGS LTFGDQVLGD DGNVTVSDVT TNGDATVVVT YPDGDQNVVA GVSSADDASG
TSVPVSVQDD GGFPGEHTAW VFGDGDVEGV AIGDDATPVA GSALDSDTAV VSDGDDGGGG
GSEYPQWSAD EVYTSGDRVV YEGTIYEAQW WTQGDEPGSS QWGPWEEVGP ADGGDGGDGG
DGGDGGDGGD GGDGGDGGDG GDGEYPQWDA DTAYTGGDRV VYDGSVWEAQ WWTRGDEPAE
GKAVWERVS