Gene Hmuk_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1994 
Symbol 
ID8411523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1898433 
End bp1899650 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID645020326 
Productpeptidase M48 Ste24p 
Protein accessionYP_003177814 
Protein GI257388041 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.235795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCC CCAACTCCCT CGGTACGCGG TTCCGGCTGG CCGCGCTGGT CGCGCTGCTG 
GTCGCCTTCG ACGCGGTCTT CGTCGTCGCC GTCTACTGGC TGGGTCATCT GGCCGCCGGC
CTGTTCTCGT GGGCCATCAC CTACGAGTCC GCCGGGCTGG TGGTCGACGC GTTGCAGTTC
CGGACGCTCC CGTCGCCGCT GGTCGTCGCC GTCGGGACCG CGGCGACGCT GGCGGGACAG
TCGGTGTTTG GCTACCGGAT GAGCGCCAAC ACCGTCCCTG GCCAGCGACA GACGTTCCGG
GACCTCTTCG CGCCGGTCGC CGATCCGGTC GAACCCGGAC AGCCTTTCGG CGGCCTGCTG
GCCGACCGCT CGCACGCGGT CTCTGCGGAC GATGAGGACG AACGCGATCC ACGCGTCGTG
CCGACGTTCG ACTACGGCGA GTACCGCGAG CGCGTCGCGG CCCGGCTCAC CGGGCTGGCA
CAGCTCTCGG ACACACCGAT TCCGGACGTG CGGGTCGTCG ACAGCGAGAC GCCGAACAGC
TACGTCGCCG GGCGGCCCGG CGAGCAGATC CTCGTCGTCA CGACGGGGCT GCTTCGCACG
CTGGACGACG AGGAACTCGA CGCCGTCCTG GCTCACGAGC TCGCCCACAT CAAGCACGGC
GACGCGTTCG TGATGACGGC CGCTGGCTTC CTGCCGACGG TCACGGCCCG CGTCAACGGA
GGGACGATCG AGTTCCTCCG CCGATCCGGA CTCGGATACC TCCCCGGCGT CGAGCAACCC
GAGGACAGTG ACACCGTCGC GTTCGGCCAG TTCCACGTCG CGATGGTCGC GCTCTCGCCG
ATCGTCCTCG CGCTCTCCTC GGCGCTGTGG CTCGCGAGCA CGGCCTGTTA CCGATCGCTC
TCGCGAGTGC GCGAGTTTCA CGCCGACGCC GGGGCGGCCG CGATCAGAGG GTCGCCGGCC
GCCCTCGTCG GCGCACTGGA GACGCTCGAG GACCTGCGTC CGTCCGAGGA CCTGCGGACC
GCCCAGACCG GCGTTCGCGA ACTGTGCGTG TTGCCCGACG CGATCGACGA CGAGGACGGG
ATCGATGGCG ACGATGCCAT CGCCCGGACG CGGCGTCGGT GGCGCTCGGT GGCCGCTCGC
GCGCTCCCCT CCTCGCATCC GCCGATCGAA TCACGGGTCG CCGCGCTCCG AGAGCAGGAG
CGCACGGGCC GCCGATAG
 
Protein sequence
MPSPNSLGTR FRLAALVALL VAFDAVFVVA VYWLGHLAAG LFSWAITYES AGLVVDALQF 
RTLPSPLVVA VGTAATLAGQ SVFGYRMSAN TVPGQRQTFR DLFAPVADPV EPGQPFGGLL
ADRSHAVSAD DEDERDPRVV PTFDYGEYRE RVAARLTGLA QLSDTPIPDV RVVDSETPNS
YVAGRPGEQI LVVTTGLLRT LDDEELDAVL AHELAHIKHG DAFVMTAAGF LPTVTARVNG
GTIEFLRRSG LGYLPGVEQP EDSDTVAFGQ FHVAMVALSP IVLALSSALW LASTACYRSL
SRVREFHADA GAAAIRGSPA ALVGALETLE DLRPSEDLRT AQTGVRELCV LPDAIDDEDG
IDGDDAIART RRRWRSVAAR ALPSSHPPIE SRVAALREQE RTGRR