Gene Hmuk_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2141 
Symbol 
ID8411680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2049129 
End bp2052626 
Gene Length3498 bp 
Protein Length1165 aa 
Translation table11 
GC content57% 
IMG OID645020483 
Producthypothetical protein 
Protein accessionYP_003177961 
Protein GI257388188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.515545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.041224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACG AGGAGGATGG CACGGTAGTC GGCTACGAAG GAGGGGGCGT TTGGGAGTCC 
AGAGAGGGTA GTACAGTGAT GGAGTCGCCG CCGGCACTCA CGTACAAGGA CGGAAGCCTC
AACGTCCAGT TCCAGCGTGT GTCTGGAAAC ATCACTGACT CCAACGAAAT TACCGCACAA
GTCAACGCTA CGGACGAACG CGCGCTCAAT GACGAACTCT CGGTACTACT GTACACGAAT
CTCACACCGT CGAACATCAG CAGTGCAACG TCGTACAGTC TGACGTGTGC GCCGGCACGA
CTTCAGAACG CCACTGTGTT CATCAATGAC AGTCAGTTCG CTGGCGGGTG GGCACGCTAC
GCAGAGGACC GTTTTAATGA CCGCCGTGTC GATGTCCAAC CCAGCAACTC AGTGAGTCCA
CGCGAGAACG TTTCGATCAC CTTCGAACTC GGTGATGTGT CGCTGCCAGA ATTCTCTGCT
CGGAACGTGA CCACGAAACG ATACAGTGAT ACGGGCCCAC TGTACGTCAA CGCCACCGTC
CAGAACGAGG GTGGACTGAG CAGTGAGCAA ACGGTGACGT TCACCTTCGA AGACCCGGGC
GGTAACGAAG TCTCGACCGA TTCGAGGGAT GTTTCGCTCA ACGGTGGTGA GTCAAAGCGA
ATGAAGTTCG AAGTCCCTGC TAGCGACTTG AGTTCGGTCA CCGCTGGGAC GTACGACGCA
ACGATCGAGA CGGAAGACGA GTCAGAATCC AGGGCAGTTC AGTTCACGTC TGGGAATCAC
GTGCCAGAGT TCCAGATCAC TTCTGTTTCA GCCCCCTCAC AGGGACAGGT CGGCGACGAC
CTGACAGTTG ACGTGACTGT CGAGAACCGC GGAAGTATGA CCGGGACGCG AGCAGTCGCC
TACGGGTTCG ACGGGTCGAC AGAGAATACG ACAATGTTGC GCCTCCATCC CGGCCAGACG
AAGACTCTGA GTTACCCCCT GTCGACGACT ACAGACGGAA GCCACGACTG GACGGTCCAG
ACCGATGGCA CTTCCGAGCA AGGAACTGTC TTTGTGGGGA CGGGACCGAC GTTCACCATC
AGTGATACGG ACGCACCGAA ACATGCGGGC GCTGGCAACA CCTTCTGGCT CAATGCCACG
ATTGGCAATA CTGGTCAACT CAACGGCACG CGTGATATCA ACCTCACGAT CCGGAACGAC
ACTAACGGGA ATGTCGTCAC GGACGAAGAT CAAACTGTCA GCATCAACGG CACGATCGCT
GACAGCAGCG ACGAGGCGAC GGTCAACGCC TCGGGAACAA TCACCACACC AGGGACTTAC
GAGTACAAAA TCGACACGGG AGATATGGTC CAGACCGGCG GTTTCACTGT CGGACCGGCA
CGGTACCCGA ATTTCATCGT CACGTCCCTG CGCTTCTCCG ACGATCCCGT CGTGAAAGGT
GATCAAGTCA CCATCGAAGC AGACATCAAG AATACTGGCC GTGTAGATGA CACACAGCCT
GTCAAGATCT CGACCGAGCG CGATATCTTG ACGGCACCTA ACCGACACGT CGATCCCGGC
GACGTAGTGA CGGTTAGTAC TACTGTCGAC GTCGCGTCAC CGACGTTCGT CCTCGGAGAG
GCCAACGAAA TCACCGTCGA GACCGACAAC AATACCGTCG CTCAGAATCT GGAGGTATTG
GCCCAAGAGC CCATCGGCGA GACCGACGAC GGACAGATCA TCGCCAACGA ACGTCTAGAC
GTTAGCGTCA GACTCCTCGG AGCAGAACTG GAAGGGAGCA ACAGAGAATG GTGCAATCCC
AGATACCGCT CGTGTCTCGG CAGCTTCCGG ACAAACGGTG GCTACCAGCG TTACGGTATC
ATCAACGATC CCGTCGAAAT GTCACTCCTG ATCAACGGCC AAACGGAGAG TGATCTCTGG
CAGTCACTGC CGGGTGATGG CGATGTGAAT CACCCTGAAG CCGAACAGGA ACTTCTGAAC
GGTCAGAACC CGTACAACGA GACAGCCACC CTAGAGAAAG ACGATCGGGC AATCGTGACT
GCAACATCGT ACCGCTGTAG TTATTACCAG TACACGGATG TACGGTTCCC CGACCTCATC
ACGTTGGGTA ACACCGAGCA AGACGCCGTC GGCAAAGAGT GCGCCAACCG TGGGAGGACA
CGGCTTTCGA TAAACGGGAA TGGCGAAGAC GACAGAGTTG TCATAAGGAA GGACGGAGAG
AGCATCCGCG GCGGCGAGGA GTTTGAGCCT CAGGCGGCAA ACTACCAGCG CAACGTCAAG
CAGATGTTGC AGGGTCGCCT GAACGAGACC GGCCACCTTG ACCTGTCGCC CGGCGAACGA
GTGCTGATTT ACGAACTCTC TGACCGAGAC GCGACCATTG AACAAGCAAG CCAGTCAGGC
GATCCCGACT ACAACGACGC CCTCGTACTG TTCAAAGTCC TGTCGAAGAA CCGCACTCTC
TCGCCGCCGG CATCCTACGA GATTACCGAT CTTAGCGCAC CGGCGAGCGT TCGACGCGGC
GACTCCGAGA CGATTACAGC GACCATCAAC AACACCGGTG GACAGGAAGG TGAGACAGAC
ATCCAGTTTG CATTCGGCGG TACGACTGTC GAGACAGAAT CGACTGGAAA ACTCGAACCC
GGTGAGAAGA CGACCGTAAC CTTCGACGTT CCCACGGGAT CGACTGGGAC CTTCGGATAC
ACCGTAACTG TCGCAGACGA GCCGACGGAA AATCGGGCAG GACAATTGAC CGTCGGTGTG
CCACGATCGC CCCACTTCCA AGTCACCCAG TTCACTCCCG ACGCAACCGC TGTTGAACGG
GGTGAGACGG TCGGGATCGA GACGACGATC ACCAATGTCG GCGCCACAAG CGACACTCAA
ACTGTCGAAC TGCGCAACGT CACTGGTAGC AACGACACGG TCGTCGACAC AGTATCTGGC
GTGTCTCTGG CGGGCAACGC CGACACCACA CTCCCTCTTG ACCTTCCGAC ACACACCAAT
GGGACCATCC GGTATACGGT ATCGACTGAG GACGACCGTG CGATCCCACA GCGAGCTTAC
GTCAACGAAT CGCACGTTAA GATCAATCAG ACTGAGGTTG GGCTGAGTAC CTACAACGAG
AGTGAACTCA TCGAACGAAA GTCAGTGCCA CGGATGACCG TGAAACTGAA CAATCGTGGC
GGACTGGGAG ACGACCGTGA TGTCAAATTA ACAATCACGA ACCAGTCCGA CGGCACTACT
CACTCCGACA CGACTACCGT AAGTGTCGGA GACGGGGAAA TAAAGCCACT GTACCCCGGA
TATGCTACGT TCGACACTTC CTCGATGGGG ATCTCTGAAG GCTACTACAG TTACTCGATC
GAGGTTGAAG ACGACGGGGC TCGCGAGGAC TACTGGACGG GTGAACTATT CGTTCACGAA
GATGGGACGG TCGAACAATC GTCTGACTCT GACTCGCCAA TCAACGTCGA CTCTGGACAG
ATCGAAGTCG GCAGCTGA
 
Protein sequence
MQYEEDGTVV GYEGGGVWES REGSTVMESP PALTYKDGSL NVQFQRVSGN ITDSNEITAQ 
VNATDERALN DELSVLLYTN LTPSNISSAT SYSLTCAPAR LQNATVFIND SQFAGGWARY
AEDRFNDRRV DVQPSNSVSP RENVSITFEL GDVSLPEFSA RNVTTKRYSD TGPLYVNATV
QNEGGLSSEQ TVTFTFEDPG GNEVSTDSRD VSLNGGESKR MKFEVPASDL SSVTAGTYDA
TIETEDESES RAVQFTSGNH VPEFQITSVS APSQGQVGDD LTVDVTVENR GSMTGTRAVA
YGFDGSTENT TMLRLHPGQT KTLSYPLSTT TDGSHDWTVQ TDGTSEQGTV FVGTGPTFTI
SDTDAPKHAG AGNTFWLNAT IGNTGQLNGT RDINLTIRND TNGNVVTDED QTVSINGTIA
DSSDEATVNA SGTITTPGTY EYKIDTGDMV QTGGFTVGPA RYPNFIVTSL RFSDDPVVKG
DQVTIEADIK NTGRVDDTQP VKISTERDIL TAPNRHVDPG DVVTVSTTVD VASPTFVLGE
ANEITVETDN NTVAQNLEVL AQEPIGETDD GQIIANERLD VSVRLLGAEL EGSNREWCNP
RYRSCLGSFR TNGGYQRYGI INDPVEMSLL INGQTESDLW QSLPGDGDVN HPEAEQELLN
GQNPYNETAT LEKDDRAIVT ATSYRCSYYQ YTDVRFPDLI TLGNTEQDAV GKECANRGRT
RLSINGNGED DRVVIRKDGE SIRGGEEFEP QAANYQRNVK QMLQGRLNET GHLDLSPGER
VLIYELSDRD ATIEQASQSG DPDYNDALVL FKVLSKNRTL SPPASYEITD LSAPASVRRG
DSETITATIN NTGGQEGETD IQFAFGGTTV ETESTGKLEP GEKTTVTFDV PTGSTGTFGY
TVTVADEPTE NRAGQLTVGV PRSPHFQVTQ FTPDATAVER GETVGIETTI TNVGATSDTQ
TVELRNVTGS NDTVVDTVSG VSLAGNADTT LPLDLPTHTN GTIRYTVSTE DDRAIPQRAY
VNESHVKINQ TEVGLSTYNE SELIERKSVP RMTVKLNNRG GLGDDRDVKL TITNQSDGTT
HSDTTTVSVG DGEIKPLYPG YATFDTSSMG ISEGYYSYSI EVEDDGARED YWTGELFVHE
DGTVEQSSDS DSPINVDSGQ IEVGS