Gene Hmuk_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2108 
Symbol 
ID8411646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2013104 
End bp2014414 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content73% 
IMG OID645020449 
Productprotein of unknown function DUF58 
Protein accessionYP_003177928 
Protein GI257388155 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0520788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGC AGTCGACGCG CTGGCGGGCG ACGGTGGCGG CTGCGACGCT GTTCGCTGCG 
GCCGGGCTCG TGGCCCGCAG CGGCGCGTTA CTCCTCGCCG CGGTCGTTCC GCTGGTGTAT
CTCGCGTACT CGCTGGTCAC GAGCGGTCCC GGTTCGGTCT CGCTCTCGGT CACTCGCCAC
GTCGAGCCGG AGCTGTCGCC GCCGGGGACG CCGGTCCACG TCACGCTCGA ACTGACCAAC
GAGGGCGACC GGCCGGTGTC CGACCTCCGC GTCGTCGACG CCGTCCCGGC GGACCTGGCG
GTGACGCGTG GCTCTCCTCG GACGGGCGTG GCGCTCGAAG CCGGCGCGAC GACGACGATA
GAGTACGTGC TGATCGCTCG CCGCGGCGTC CACGAGTTCG AACCCCCGCG CGTCAGGGTC
CGGAGCCTCG GGGGCACGAG CGTCACGACG CTGCGCCCGA CTGTGTCGGG CGACACACGG
CTCGTCTGCC GCCTCGACGC GGACGCGCCA CCGATCGACG ACCAGGGCGC GCAGTTCGTC
GGCGACCTGA CGACCGACGA ACCGGGCGAG GGACTCACCT TCCACTCGAC TCGGGAGTAC
CGGCGCGGCG ACGACGCGCG CCGGATCGAC TGGCGGACGT ACGCCAAGAC CGGCGAGCTG
ACGACGATCG ACTACGAGCG CCGCCGGGCG GCCTCGGTCG TCCTCGTCGT CGACGCTCGC
CGCATCAGTC GCGTCTCGGC GGGTCCCGGC CGACCCACGG CCGTCGAACT GTCCGCCTAC
GCGGCGACCC ACGCGGTCAC CGACTTCGTC GCCAGCGGTC ACGACGTCGG CGTCGCGGTG
ATCGGAGCCG ACGGTCCGGG GCCGGCCGGA CTCCACTGGC TGGCCCCCGA GAGCGGCGAC
GGGCAGCGGT CCCGAGCGAT CGATTACTTC CGGACGGCGA CGGAGGTGAC CGGCGGCGCG
CCCGACATCG ACCGCCAGCT GGCCGAGCTG CTCGACCTGG TACCGCCGGG CGCACAGCTG
GCACTGTTCT CGCCACTGCT CGACGATCTG CCCGTCGAGG CCGTACAGGC GTGGCGCTCG
CAGGGGTACC CCGCCGTCGT CCTCTCGCCG GACGTGGTGA CGGACAACAC CGTCGGCGGA
CAGTTCGAAC AGGTCCAGCG GTACACGCGA CTGGCTCGCT GTCAGGCGAC CGGCGCGCGC
GCGACGGACT GGCGACGGGG GACGCCGCTC CCGATGGCGC TGGCGTACGC GTTCGCGGCC
GACGCACGGC TGCCGAGTCA GCGTCCGACC GGCACTGGGG GGGTCCCGTA G
 
Protein sequence
MSRQSTRWRA TVAAATLFAA AGLVARSGAL LLAAVVPLVY LAYSLVTSGP GSVSLSVTRH 
VEPELSPPGT PVHVTLELTN EGDRPVSDLR VVDAVPADLA VTRGSPRTGV ALEAGATTTI
EYVLIARRGV HEFEPPRVRV RSLGGTSVTT LRPTVSGDTR LVCRLDADAP PIDDQGAQFV
GDLTTDEPGE GLTFHSTREY RRGDDARRID WRTYAKTGEL TTIDYERRRA ASVVLVVDAR
RISRVSAGPG RPTAVELSAY AATHAVTDFV ASGHDVGVAV IGADGPGPAG LHWLAPESGD
GQRSRAIDYF RTATEVTGGA PDIDRQLAEL LDLVPPGAQL ALFSPLLDDL PVEAVQAWRS
QGYPAVVLSP DVVTDNTVGG QFEQVQRYTR LARCQATGAR ATDWRRGTPL PMALAYAFAA
DARLPSQRPT GTGGVP