Gene Hmuk_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1032 
Symbol 
ID8410550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp982985 
End bp984172 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content66% 
IMG OID645019367 
Productpeptidase M50 
Protein accessionYP_003176866 
Protein GI257387093 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0615226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAT TCCGTATCGG CAGTGCCTTC GGCATTCCGA TCCAGTTGGA CCTGACCTTC 
CTGCTGGTGT TGCCACTGTT CGCGTGGATC ATCGGCTCGC AGGTCGAGCC GACCGTCGAG
TTGCTCAACC GGTTCGGTGG GAACCTCGAC CCGGCAGTCC TCACCACCGG TCTCCTGCCG
TGGCTGCTGG GCATCGCGGC CGCGATCGGC CTGTTTACCG GCGTCGTCCT CCACGAGCTG
GGCCACTCAC TCGTGGCGAT CCGCTACGGC TACCCGATCG AGTCGATCAC GCTGTGGCTG
TTCGGCGGCA TCGCACAGTT AGACGAGATG CCCGAAGACT GGCGACAGGA GTTTCTCATC
GCGCTGGCCG GCCCCGTCGT CAGCGTCCTC GTCGGGATCG TCTCGCTGGT CGGGTTCGTG
CTGGTCCCCA GCGGGACGAC GACGTTCGCT GCCGTCCGCT TCGTCCTCGG CTACCTCGCG
CTGATGAACG TCGCGCTGGC CGTCTTCAAC ATGTTACCGG GGTTCCCGAT GGACGGCGGC
CGCATCCTCC GGGCTCTGCT CGCGCGGTCG AACCCCTACG CTCGCGCCAC GGAGATCGCG
GCCGAGGTCG GGAAGGGGTT CGCGATCCTG CTGGCGCTGT TCGGACTGTT CCCACCCTTC
AATCCGCTGC TGATCGGCCT GGCCTTCTTC ATCTACATCG GCGCGGCCGG CGAGTCCCGC
CAGACGGTCA TGCGAGCGGC CTTCGAAGGC GTCACCGTCG CCGACGTGAT GACGCCCGCC
GACCGCGTGA CCACCGTCGA TCCCGACACG AAGGTCCGCG AACTCATCCG GACGATGTTC
GAGGAGCGCC ACACCGGCTA CCCCGTCGAG CGCAACGGCG AGATCGTCGG CCTCGTCACG
CTTGAAGACG CCCGCGCCGT GCGGGAAGTC GAACGGGACG CCTACACCGT CGGCGACATC
ATGACCACGG AACTCATCGC CGTCGCCCCC GACGAGGACG TGATGACCGC GCTCTCGGAA
CTGGAAGGGA ACAACGTCGG CCGCCTGATC GTCCTCGACG AGGCCGACGC GTTCCGGGGA
CTGCTCACCC GCAGCGACAT CATGACGGCA CTGACGATCA TCAAGGAGAA TCCCGACTAC
AGGGCCGACG ACGAGGGCGA GGCGACCGTC TTCGAACCGC AGACGTGA
 
Protein sequence
MRRFRIGSAF GIPIQLDLTF LLVLPLFAWI IGSQVEPTVE LLNRFGGNLD PAVLTTGLLP 
WLLGIAAAIG LFTGVVLHEL GHSLVAIRYG YPIESITLWL FGGIAQLDEM PEDWRQEFLI
ALAGPVVSVL VGIVSLVGFV LVPSGTTTFA AVRFVLGYLA LMNVALAVFN MLPGFPMDGG
RILRALLARS NPYARATEIA AEVGKGFAIL LALFGLFPPF NPLLIGLAFF IYIGAAGESR
QTVMRAAFEG VTVADVMTPA DRVTTVDPDT KVRELIRTMF EERHTGYPVE RNGEIVGLVT
LEDARAVREV ERDAYTVGDI MTTELIAVAP DEDVMTALSE LEGNNVGRLI VLDEADAFRG
LLTRSDIMTA LTIIKENPDY RADDEGEATV FEPQT