Gene Hmuk_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3052 
Symbol 
ID8412605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2937749 
End bp2940856 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content72% 
IMG OID645021399 
Producthypothetical protein 
Protein accessionYP_003178864 
Protein GI257389091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.18746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATC GCGGCCGCGT GCCGTTCGCA CTGATCGGCG TCCTCCTGCT CGTCAGCAGC 
GCGACGCTGG CGACGACGAT CGATCCGGGA TCGCTTCCGT CCGACAGCGA GACCGAGGTG
GTCACTGAAC GGACGACTGC GACCGCACAG ACGGAACTGC GCGAGGCCGT GACGACCGCC
AGCCGGGCGG CCGCAGCGGA CCCCGTCGTC GATCCGGCAG ACACCGCCGC CGGTCGCCTT
CTCGACGAAG AGACGGCGTT TCGCGACGCG CTCCGGCTCC GGATCTACCT GCGGGCCCGC
GACCGCCTCT CTCGGGTCGC GGTCCGTCGT GGCGAGGTCA CCGGCTCGGT CTCGCTGCCC
TCGACGGAGA CGCGGGCCAA GCGCCGCGCC GCCATCGACC GCGTGACAAT CCAGCGGGCA
GACGACGACG GCACCGCGAT CCGGGTGACC GTCGAGAACG TGACGGTCCG TACCCACCGT
GGCGGGCAGG TGCGCTCCCG GACGACGATC TCTCCGACTG TCACCGTCGT GACGCCGGTG
CTCGCGGCCC ACGATCGAGT CAGCACCTAC CAGCGGCGAC TCGACGCCGG CGTGACGGAG
CGAGGATTGA GCCAGCGACT GACGACTCAG CTGTACGCGC TGGCGTGGTC GCGAGGCTAC
CTCCAGTACG GCGAGGTGCC GATCAGCAAC GTCGTCTCGA ACCAGCACGT GGGCGTCGTC
ACCAACGAGG CACTGCTGGA CCTCCAGCGC GAGACGATCG GCCACGCCGA TCCACGGGGC
CGTCGCACGC TCGCGGTCGC GGCGGCACGG ACGGCGGCAC GGGATCTCAC CGTCGCCACA
GGGACCGACT CGCGCGTGAC CGACGCCGTT CTGAGCGGAC CGACGAAGCC AGCGGCGAGC
GACATCGAGG GACTGGAGCC ACCGCGCCGG TCGAGCCCCG ACGAGCGGCG CGAGGTCGCG
GTCAACGAGA CGGCAGACAG AGCGTTCGTC GACGTGCTCG ACGACGGTGC CATCGACAGC
ACGATACGGG ACGCCTACAG CGTCGAGGTC CGGACTGTCG GCCGCGTCGA GGGCGATCGC
CACGTCGGCG TGTCGGCACC GCGCCCCGCC GGCTCGAACT GGACGCGCGT CGACAAGCGC
CGTGAGCGAT CGATCCGCCA CCGAAACGTG TCCGTCGCGC CGCCGCCGAT CCCCGACGGC
TGGCACGAGT TCGAGACCTA CGGGCGAGAG ACGGTCGTCA CGGAGCGAGC GGTGGGGGTC
TGGGAGCGCC AGGTCGCGGG GCCAAACGGG AGCGTCGAGG TCCAGCGGAG GACCACCGAC
AGGACGGGCA CGAGTCGACA GACCGTCACC CTCGCGGTCG TCGGTCGTCA CGACCGCACC
TCGCCGGCTC CCGTTCGACC GATCCGACGG GCCCACCAGC GCGGTGCCGG CCCCCTGGAG
GGACCGAACC TCGCCGACGC TCGCGAACGC GCCAGAGAGC GCCTGATCGA CAGCCAGGGC
GGCCGGTCCG CGGCCCTCGA ATACGCGGTC CACAGCGGGC CGAACTCCGA CGTTCACACG
ATCGAGCTAG AGGTACCCGC GAACGCGTCC GAGTGGGCGT ATCGCGATCT GATGGGGGTC
CGAGAACGGG TCCGGTCCGT CGCCGTCGAG GTGCGGCAGG GACGAGTCGG GTCGTACGAA
TCGAACCCGC CGGCAGCGCT CGCACGGGCC GTCGAGCGAG AGCGGACGCG GCTGATCGAC
GCACCGTCCA GCTACGACGG CGCGGCGACC AAGGCGCGGA TCGCCGTCCG GGTCGCGTAC
CTCGATCGCG TACAGGCCCG CCTCCGTGCG CGGGCAGACG ACCGGCGTGG CAGGGCCGAC
GCGTTCGGCG ATCGGCTCGA CGAGGCCGGT ACGTCGATCG AGACGCTGCG CGAGGGGCTC
GACGCTCGCG GGCGACCACC GTCGGACCGC CAGCCGCGGC TGGACGGTGT CGGCGGGCCG
GTCGCGCTGA CGGCCGACGG CGCACCGGCC TACCTCACGC AGGCCTCGCT CAGCCACGAC
GACGACCCGG CGATCGAGAA CAGCTCTCGT CCGCTGGTCG CGCGCAACGT CAACGTCTTT
ACCGTACCCC ACCAGACCGT CTCGGACTCG CTGGTCGACG GTCTGTTCGG CGATCGATCG
GGGGTCAGAC TCGACACGGC GGCCCGGACG TTGAACGCGA CCAACGCGAC CCTCGCCGAG
GCGAACGCCG CTGCTGTGGA CGGCGAGGCG GTTCGACGGG CGGACGCGCG CCGACCCGAA
CGCCACGCCG AGAACGTCTC GGCGCTGACC CGCGAACGCG ACGCCCTCCG CCGAGAAGTC
GCGTCGGCGA ACGAGCACGT GATCGACGGC CAGCGGTCGG TGCTGTCCCG ACGGGCCGTC
GCCGGCAGTG CCAGCGAGCG CGAGGCGATG CTTCGAGACG CGCTCGCGCC GTGGCAGACG
ACCCACGATC GGGCCCTGGC GCTGGCGAAC GGCTCGGTCA GCCGTCGACT CGTCGCTCTC
GCCGGCCGGC GGACCGACCT CTCGGTGGCT GCACGCGACC GGCTCGCCAT TCGACTGAAC
GCCACGCGTC GCGCGGCGTT ACGGGAGCCG GGCGGGCGAC CGGACACCGA CGCCGTCGAC
GCGAGCCGAT CGCGCACGCA GACGGTCGCG CGGGAGCTGG CTCGGGAGGC CGCTGCCGCC
GGTGCGGAAC GGGCCACGAA GCGCGGCTAC GGCGCGGTGG TGAACGACAC GTTCGAGGCG
ATGCCGTCCG GACTCCCGCT GGCACCCGTG CCCGGCTCGT GGTACGCCAC GACGAACGTC
TGGCACGTCA CGGTCCGTGG CGAGTACGCC CGCTTCGGCG TGCGCGTCTC CCAGGGGCGG
CCGACGACGC CCGGCGGCGA GTTCGTCTAC GCCAGGGACG GCGAGAACGT CAGCCTCGAC
GTCGACGACG ACGGGTCTCC CGAGCGGATC GGCCGGTCAA CCCGCGTCGA CTTCGAGGCC
ACGGCGACGG TCCTCGTCGT CGTGCCGCCG GGCAAGACCG GCGTCGGCGA CACCAACGGC
GTCGCGATCG AGGAGTCGGA GGGGTGGCCC GATCCGGGGC CGGAGTGA
 
Protein sequence
MDDRGRVPFA LIGVLLLVSS ATLATTIDPG SLPSDSETEV VTERTTATAQ TELREAVTTA 
SRAAAADPVV DPADTAAGRL LDEETAFRDA LRLRIYLRAR DRLSRVAVRR GEVTGSVSLP
STETRAKRRA AIDRVTIQRA DDDGTAIRVT VENVTVRTHR GGQVRSRTTI SPTVTVVTPV
LAAHDRVSTY QRRLDAGVTE RGLSQRLTTQ LYALAWSRGY LQYGEVPISN VVSNQHVGVV
TNEALLDLQR ETIGHADPRG RRTLAVAAAR TAARDLTVAT GTDSRVTDAV LSGPTKPAAS
DIEGLEPPRR SSPDERREVA VNETADRAFV DVLDDGAIDS TIRDAYSVEV RTVGRVEGDR
HVGVSAPRPA GSNWTRVDKR RERSIRHRNV SVAPPPIPDG WHEFETYGRE TVVTERAVGV
WERQVAGPNG SVEVQRRTTD RTGTSRQTVT LAVVGRHDRT SPAPVRPIRR AHQRGAGPLE
GPNLADARER ARERLIDSQG GRSAALEYAV HSGPNSDVHT IELEVPANAS EWAYRDLMGV
RERVRSVAVE VRQGRVGSYE SNPPAALARA VERERTRLID APSSYDGAAT KARIAVRVAY
LDRVQARLRA RADDRRGRAD AFGDRLDEAG TSIETLREGL DARGRPPSDR QPRLDGVGGP
VALTADGAPA YLTQASLSHD DDPAIENSSR PLVARNVNVF TVPHQTVSDS LVDGLFGDRS
GVRLDTAART LNATNATLAE ANAAAVDGEA VRRADARRPE RHAENVSALT RERDALRREV
ASANEHVIDG QRSVLSRRAV AGSASEREAM LRDALAPWQT THDRALALAN GSVSRRLVAL
AGRRTDLSVA ARDRLAIRLN ATRRAALREP GGRPDTDAVD ASRSRTQTVA RELAREAAAA
GAERATKRGY GAVVNDTFEA MPSGLPLAPV PGSWYATTNV WHVTVRGEYA RFGVRVSQGR
PTTPGGEFVY ARDGENVSLD VDDDGSPERI GRSTRVDFEA TATVLVVVPP GKTGVGDTNG
VAIEESEGWP DPGPE