Gene Hmuk_0092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0092 
Symbol 
ID8409589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp90799 
End bp92220 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content71% 
IMG OID645018418 
Productprotein of unknown function DUF58 
Protein accessionYP_003175938 
Protein GI257386165 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTGA CGCGTCGGTA CTGGACGGTC GCCGCCCTCG GGACGGCGCT TGCCGTCTGG 
GGCGTCGTCG TCGCGGGGCC GCTGCCCGTA CTCGGAAGCG GTGCCATCGG AGCCTGGTTG
CTCGTCCGAC AGTACCGGTT CGTCCGATCG GCGACACGGG TACCCCAGAG AGTCAGCGTC
GACCTCTCGA CGCGGGACCG AGTCACGGCC GAAGAGACCG CCGAGCTGAC CGTCGGTGTG
ACTCTCGACA CCGCGATGCC GGTCCCGCTC ACCGTCACCG TCGAGCCACC GGTCGGTGCG
ACCGGCGACC CGGCGACGGT CGCGCTCGCG CCCGGCGAAC GCGAGGCGGT CGCGTCGACG
GCCGTGACCT GGCCGGTCGC GGGCTCGTTC ACCGTCGACG ACGTGTCGCT GTCGATCTCC
GACGCGGACG GGCTCTTCGC CCAGGTGGTC GATCTCGACG CCGGCCGCGA AGTCGCGGTC
AGCCCGCGGG CACCCGACGA TCTCCACGTC GGAGAGGGCG GAAATCCGAT CGCGACCGGC
TTCGGCGAGC AGGCGTCGAG CCAGATCGGT ACCGGCATCG AGCCCGCCGA GATCCGGGAG
TACGTGCCCG GCGACGCCGT CCGACAGATC GACTGGAAGG CGACGGCACG GCTCGACCAG
CCACACGTCA TGGAGTTCGA GGGCGGCGCG GACCGGCAGA CTATCCTGGT GTTCGACCAC
CGCGCGTCGA TGGGCGACGG ACGAGCCGGG ACGACCAAAC TCGACTACGC GAGACAGCTC
GCGGTCGCGA TCGTCGACCG AGCACGCACC GACGACGACG GGCTCGGCTA CTACGCCGTC
GGCGACGACG GCGTGACGAC GGCGATCAGC CCGCTGGTCA GAACCGACGC CTGGGCCACG
GTCCGAACGA AGCTGCTCGC GGCCGCGACG ACCGACGGTC GGTCCGGCGG CGTCGGACCC
GGACGGTCGC CGAACACACG GCTCGCCAGC CGAGTCGCGA GCGACGATTC GGCGTTTGGA
CGGACGCTCG AACCGTTTCT GTCGGAAGGT GGGGCGTACC AGCAGCGCTT CGACGCACAG
CCGCTGGCGG GCGCGATCAG GACGGCCGAG TACCCCAGTG ACATCGTCGT CCGGACGGTC
ATCGTGACCG ACGACAGCCG CCCGGACGAA CTGCGAGCGG CGGTCCGGCG TGCCAGCCGG
AACGGCAGCA CCGTCGCGGT CTTCATGACG CCGACGGCGC TGTTCGAGTC GAGCGAACTC
ACCGATCTCG CTGGGGCCTA CAGCGACTAC GTCGCCTTCG AGGAGTTCCG GCGCGAACTC
GCCGGCATGG AGGGCGTCGA CGCCTACGAG ATCGGCCCCG GTGACCGACT CTCGGCGGTG
CTGTCGGCGG GCCGAACGCG GCGACGGAGG GAGTCGGCGT GA
 
Protein sequence
MQVTRRYWTV AALGTALAVW GVVVAGPLPV LGSGAIGAWL LVRQYRFVRS ATRVPQRVSV 
DLSTRDRVTA EETAELTVGV TLDTAMPVPL TVTVEPPVGA TGDPATVALA PGEREAVAST
AVTWPVAGSF TVDDVSLSIS DADGLFAQVV DLDAGREVAV SPRAPDDLHV GEGGNPIATG
FGEQASSQIG TGIEPAEIRE YVPGDAVRQI DWKATARLDQ PHVMEFEGGA DRQTILVFDH
RASMGDGRAG TTKLDYARQL AVAIVDRART DDDGLGYYAV GDDGVTTAIS PLVRTDAWAT
VRTKLLAAAT TDGRSGGVGP GRSPNTRLAS RVASDDSAFG RTLEPFLSEG GAYQQRFDAQ
PLAGAIRTAE YPSDIVVRTV IVTDDSRPDE LRAAVRRASR NGSTVAVFMT PTALFESSEL
TDLAGAYSDY VAFEEFRREL AGMEGVDAYE IGPGDRLSAV LSAGRTRRRR ESA