Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2108 |
Symbol | |
ID | 8411646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2013104 |
End bp | 2014414 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645020449 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003177928 |
Protein GI | 257388155 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0520788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGC AGTCGACGCG CTGGCGGGCG ACGGTGGCGG CTGCGACGCT GTTCGCTGCG GCCGGGCTCG TGGCCCGCAG CGGCGCGTTA CTCCTCGCCG CGGTCGTTCC GCTGGTGTAT CTCGCGTACT CGCTGGTCAC GAGCGGTCCC GGTTCGGTCT CGCTCTCGGT CACTCGCCAC GTCGAGCCGG AGCTGTCGCC GCCGGGGACG CCGGTCCACG TCACGCTCGA ACTGACCAAC GAGGGCGACC GGCCGGTGTC CGACCTCCGC GTCGTCGACG CCGTCCCGGC GGACCTGGCG GTGACGCGTG GCTCTCCTCG GACGGGCGTG GCGCTCGAAG CCGGCGCGAC GACGACGATA GAGTACGTGC TGATCGCTCG CCGCGGCGTC CACGAGTTCG AACCCCCGCG CGTCAGGGTC CGGAGCCTCG GGGGCACGAG CGTCACGACG CTGCGCCCGA CTGTGTCGGG CGACACACGG CTCGTCTGCC GCCTCGACGC GGACGCGCCA CCGATCGACG ACCAGGGCGC GCAGTTCGTC GGCGACCTGA CGACCGACGA ACCGGGCGAG GGACTCACCT TCCACTCGAC TCGGGAGTAC CGGCGCGGCG ACGACGCGCG CCGGATCGAC TGGCGGACGT ACGCCAAGAC CGGCGAGCTG ACGACGATCG ACTACGAGCG CCGCCGGGCG GCCTCGGTCG TCCTCGTCGT CGACGCTCGC CGCATCAGTC GCGTCTCGGC GGGTCCCGGC CGACCCACGG CCGTCGAACT GTCCGCCTAC GCGGCGACCC ACGCGGTCAC CGACTTCGTC GCCAGCGGTC ACGACGTCGG CGTCGCGGTG ATCGGAGCCG ACGGTCCGGG GCCGGCCGGA CTCCACTGGC TGGCCCCCGA GAGCGGCGAC GGGCAGCGGT CCCGAGCGAT CGATTACTTC CGGACGGCGA CGGAGGTGAC CGGCGGCGCG CCCGACATCG ACCGCCAGCT GGCCGAGCTG CTCGACCTGG TACCGCCGGG CGCACAGCTG GCACTGTTCT CGCCACTGCT CGACGATCTG CCCGTCGAGG CCGTACAGGC GTGGCGCTCG CAGGGGTACC CCGCCGTCGT CCTCTCGCCG GACGTGGTGA CGGACAACAC CGTCGGCGGA CAGTTCGAAC AGGTCCAGCG GTACACGCGA CTGGCTCGCT GTCAGGCGAC CGGCGCGCGC GCGACGGACT GGCGACGGGG GACGCCGCTC CCGATGGCGC TGGCGTACGC GTTCGCGGCC GACGCACGGC TGCCGAGTCA GCGTCCGACC GGCACTGGGG GGGTCCCGTA G
|
Protein sequence | MSRQSTRWRA TVAAATLFAA AGLVARSGAL LLAAVVPLVY LAYSLVTSGP GSVSLSVTRH VEPELSPPGT PVHVTLELTN EGDRPVSDLR VVDAVPADLA VTRGSPRTGV ALEAGATTTI EYVLIARRGV HEFEPPRVRV RSLGGTSVTT LRPTVSGDTR LVCRLDADAP PIDDQGAQFV GDLTTDEPGE GLTFHSTREY RRGDDARRID WRTYAKTGEL TTIDYERRRA ASVVLVVDAR RISRVSAGPG RPTAVELSAY AATHAVTDFV ASGHDVGVAV IGADGPGPAG LHWLAPESGD GQRSRAIDYF RTATEVTGGA PDIDRQLAEL LDLVPPGAQL ALFSPLLDDL PVEAVQAWRS QGYPAVVLSP DVVTDNTVGG QFEQVQRYTR LARCQATGAR ATDWRRGTPL PMALAYAFAA DARLPSQRPT GTGGVP
|
| |