Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1032 |
Symbol | |
ID | 8410550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 982985 |
End bp | 984172 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645019367 |
Product | peptidase M50 |
Protein accession | YP_003176866 |
Protein GI | 257387093 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0615226 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGAT TCCGTATCGG CAGTGCCTTC GGCATTCCGA TCCAGTTGGA CCTGACCTTC CTGCTGGTGT TGCCACTGTT CGCGTGGATC ATCGGCTCGC AGGTCGAGCC GACCGTCGAG TTGCTCAACC GGTTCGGTGG GAACCTCGAC CCGGCAGTCC TCACCACCGG TCTCCTGCCG TGGCTGCTGG GCATCGCGGC CGCGATCGGC CTGTTTACCG GCGTCGTCCT CCACGAGCTG GGCCACTCAC TCGTGGCGAT CCGCTACGGC TACCCGATCG AGTCGATCAC GCTGTGGCTG TTCGGCGGCA TCGCACAGTT AGACGAGATG CCCGAAGACT GGCGACAGGA GTTTCTCATC GCGCTGGCCG GCCCCGTCGT CAGCGTCCTC GTCGGGATCG TCTCGCTGGT CGGGTTCGTG CTGGTCCCCA GCGGGACGAC GACGTTCGCT GCCGTCCGCT TCGTCCTCGG CTACCTCGCG CTGATGAACG TCGCGCTGGC CGTCTTCAAC ATGTTACCGG GGTTCCCGAT GGACGGCGGC CGCATCCTCC GGGCTCTGCT CGCGCGGTCG AACCCCTACG CTCGCGCCAC GGAGATCGCG GCCGAGGTCG GGAAGGGGTT CGCGATCCTG CTGGCGCTGT TCGGACTGTT CCCACCCTTC AATCCGCTGC TGATCGGCCT GGCCTTCTTC ATCTACATCG GCGCGGCCGG CGAGTCCCGC CAGACGGTCA TGCGAGCGGC CTTCGAAGGC GTCACCGTCG CCGACGTGAT GACGCCCGCC GACCGCGTGA CCACCGTCGA TCCCGACACG AAGGTCCGCG AACTCATCCG GACGATGTTC GAGGAGCGCC ACACCGGCTA CCCCGTCGAG CGCAACGGCG AGATCGTCGG CCTCGTCACG CTTGAAGACG CCCGCGCCGT GCGGGAAGTC GAACGGGACG CCTACACCGT CGGCGACATC ATGACCACGG AACTCATCGC CGTCGCCCCC GACGAGGACG TGATGACCGC GCTCTCGGAA CTGGAAGGGA ACAACGTCGG CCGCCTGATC GTCCTCGACG AGGCCGACGC GTTCCGGGGA CTGCTCACCC GCAGCGACAT CATGACGGCA CTGACGATCA TCAAGGAGAA TCCCGACTAC AGGGCCGACG ACGAGGGCGA GGCGACCGTC TTCGAACCGC AGACGTGA
|
Protein sequence | MRRFRIGSAF GIPIQLDLTF LLVLPLFAWI IGSQVEPTVE LLNRFGGNLD PAVLTTGLLP WLLGIAAAIG LFTGVVLHEL GHSLVAIRYG YPIESITLWL FGGIAQLDEM PEDWRQEFLI ALAGPVVSVL VGIVSLVGFV LVPSGTTTFA AVRFVLGYLA LMNVALAVFN MLPGFPMDGG RILRALLARS NPYARATEIA AEVGKGFAIL LALFGLFPPF NPLLIGLAFF IYIGAAGESR QTVMRAAFEG VTVADVMTPA DRVTTVDPDT KVRELIRTMF EERHTGYPVE RNGEIVGLVT LEDARAVREV ERDAYTVGDI MTTELIAVAP DEDVMTALSE LEGNNVGRLI VLDEADAFRG LLTRSDIMTA LTIIKENPDY RADDEGEATV FEPQT
|
| |