Gene Hmuk_1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1951 
Symbol 
ID8411479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1859934 
End bp1862858 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content62% 
IMG OID645020282 
Producthypothetical protein 
Protein accessionYP_003177771 
Protein GI257387998 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGAA CTAACGATAA GGTTCGCAGC CTGTTCCTCT CTGCGCTGAT GGTCATGTCC 
GTCGTCGCGA TGGGCACGGC TTTCACTGCA GGAGCAGTTG CGGACACTAG CGGTAATGCG
ACTGTCGAAA GCCTCGAGGT TACGCAAAAC CGCGTCCTCG AAGCTGACGA CTCTCGTAGT
AGCGACATTC AAACACAGAC CGTCGAATTC GACGCATATC TCGAATCCGG CGACACGGAA
GAGATCACGA TCGACACGGC CGACGCTGAA GACAACGGCC TGACGATCGA CGACGTTTCC
GTCGACGGCT CCACGTCTAG CGACGTTTCC GTCGAGGACG TCGACTTCGA CGACGACCAG
ATCACCTTCG ATCTGAGCGA GGACGCCGGT GGCTCCACGG TCAATGATGC CACCGTCGCC
CTCAACGTCG ACTTCGACGT GAGTGACGGT GAGAACGCAA ACCGTCTCGA ACACGTCGTC
ACGGCAACTG GTGACGACGG GTCCACGACG CTCTCCCCGG TCACCGCCAC GTACAAGATT
GCCGGCGATT ACTCGAAGAC CGTCAACGGC GAGGATTCGG TCTTCGCTGG TGAGACGATC
CGATTCACCC CCGGTGCCGA AAACACCCAG ATCGAGGTCT ACACGACCGA CGACGAGGGT
GCTCCGACGA ACAACCGTGT TGCAAACTTC AACACGGCGT TCGACACCGC GATCAACTTC
GACACGTCGA ACCTCGACAC TGGCGAGACG TACAACGTTC AGATCAGTGG CAGTGGTGTC
ACCGATACTG ACTACGATCT CGCTGTCAAC GACCTCGGCC TCGACGCCGA AGCACGTGAC
ACGCAGATCT CCACGGAAGA CCGCATCGTC GCGGACATCA CGTCGAACGC GAACCTCGAC
GACCCGTGGA CGGCTACGCT CCTCGACAGC GACGGTGAGG CCGTCGAGGA CGACAACGGT
GATGCAATCG AACGAACTGG TAGCTTCTCC GGCTCCGGTG CAGCAACCGT TCGATTCACC
GCTCCGACCG ACTCTGACCA CTACGGCACT GGCAACTACT CGGTCGAAGT CACGCACGAC
GACACCGGCA TCACGTCGGA AACCGACTCG ATCGAAGTCT CGGAAGCCGG TGACGGCGAC
GTCTCCTTCG CTGGCGACGG CGTCTTCACC GAAGAAGCCG GTGACGTCGC GAACGTCACG
GTCGAGATGA CCAACACCGA CGAAGCGACG GTCACCATCG GTACCGAAGA TCAGGGTTAC
TACATCGTCT ACCAGGTCTC TGACGAGAGC GGTGACGGTC AGGTCTCCCT CGAATTCAAC
AGCTACACCG CAGGCCGCAC GTCCCAGGAC AACGTCGTCA GCGTCGCTAA CTCTGACGAC
GAGATCGAGT TCCTCACGCA GGGTGGCGAC TTCACCAACA CCTCCGAGGC TGTCGGCTCC
GACACCCTCG ACCCGACGGA ATACGAGATG AACGCTTCCG TCGGCCACGT CGACATGGGC
ACCGACGACT ACACTGACTC CGACGCGCTC GGCTCGCTCT CGCTGCAGCC GCGCTCGACC
GACGGCATGC AGTCGTGGGT CACGCCGAAG GGCACCGACA TCAGCGACCT CAGTGATGAC
GTTGGCAACA TCTACGACGA GATCGGCTCG TCGATCACCG AGTCCGGCGA AGTCGCCTAC
GACGACGGTG ACGCCGACGG CGACACCGTC ATCCTCCAGC TCGAAGCTGG CGGTATCGAG
GGTGCCATCG ACCACGCAAA TGGTCTGAAG ACCCTCATCG ACAACAACCA GGACAACGAC
GACGATGAAG CGATCGACCT CACGATCGAG CAGACCAACG AGGATGCAAA CCGGAACGCA
AAGACGCTCG ACTTCGTGAA CGACGACGCG ATCTCGGTCA TCGAAGACCC CGAGAACAAC
TCGTACTTCA TCGCCGTTGA GCTGTCGAAG GTCGACCAGC CGACCCGCGA CCTCGCGGAA
GGCGACAACT TCGAGGCGAC ATTCACCATC GAGGACGACA ACGTCCTCAA CACGATCGAC
GACAACGAGG AAGTCACCAG TGACTTCACG CTCGTCGAAC CGAACACTGA GCTCGTCACG
AACTCTGACG ACCTCATCCT GATCGAGTCC GCGTCCGGTC AGACCATCTC GGGTAGCACC
AACTACGCGC CTGGCACCGA CCTCAACGTT CGGGTCAAGT CCAGCGACAC CGCGAGTCCG
TTCCAGACTC GCCCCGAGGC AACCGTCCAG ACGGACGGTA CCTTCACGAC CGAAGGTGCT
GACTTCAGCG AGGTCTCGCC CGGGACCAAC CTGACGCTGC AGACGCGTCG CGGTGGCTCC
GCGGTCGGTG ACGAGTACGA CGGTCGCATC GGCGAAGTCC CGACTGCCTC CGTCAGCATC
AGCAACCAGA CCACCGACGG CTCCACCGTG ACGGTCGACT CGGTTACGAC CGAGAACGGT
GGCTTCGTCG CGATCCACCT CAACAACGCC AGCGGTGAGG TCATCGGCAA CTCCGAGTAC
CTCGACTCTG GCACCCAGCA GGACGTCGAG ATCAGCCTCG ACTCCGCTCT GGACGAGAAC
GCGACGGTCG TCGCGATGCC CCACCAGGAC ACCAACGACA ACGAAGAGTA CGACTTCGGC
GATGGTGACG GTGCGGACGG TCCGTACACC GAGAACGGCT CGGCAGTGAC CGACAGCGCG
TCGATCACGA TCCAGACCAC CGAGGACACG CCGACGCCGA CGGACACGCC GGAAGACACG
CCGACGGACA CGGCGACGCC TGACGAAGGC GAAGACATGA CGGACACGCC GACTGACGAC
GGCGGCGACG AGACCACCAC CGGCGACGGT GCTGGCTTCG GCGCAGTCGT CGCACTCGTC
GGCCTCCTCG CTGCTGCGCT GCTCGCAACG CGGCGCAACA ACTGA
 
Protein sequence
MTGTNDKVRS LFLSALMVMS VVAMGTAFTA GAVADTSGNA TVESLEVTQN RVLEADDSRS 
SDIQTQTVEF DAYLESGDTE EITIDTADAE DNGLTIDDVS VDGSTSSDVS VEDVDFDDDQ
ITFDLSEDAG GSTVNDATVA LNVDFDVSDG ENANRLEHVV TATGDDGSTT LSPVTATYKI
AGDYSKTVNG EDSVFAGETI RFTPGAENTQ IEVYTTDDEG APTNNRVANF NTAFDTAINF
DTSNLDTGET YNVQISGSGV TDTDYDLAVN DLGLDAEARD TQISTEDRIV ADITSNANLD
DPWTATLLDS DGEAVEDDNG DAIERTGSFS GSGAATVRFT APTDSDHYGT GNYSVEVTHD
DTGITSETDS IEVSEAGDGD VSFAGDGVFT EEAGDVANVT VEMTNTDEAT VTIGTEDQGY
YIVYQVSDES GDGQVSLEFN SYTAGRTSQD NVVSVANSDD EIEFLTQGGD FTNTSEAVGS
DTLDPTEYEM NASVGHVDMG TDDYTDSDAL GSLSLQPRST DGMQSWVTPK GTDISDLSDD
VGNIYDEIGS SITESGEVAY DDGDADGDTV ILQLEAGGIE GAIDHANGLK TLIDNNQDND
DDEAIDLTIE QTNEDANRNA KTLDFVNDDA ISVIEDPENN SYFIAVELSK VDQPTRDLAE
GDNFEATFTI EDDNVLNTID DNEEVTSDFT LVEPNTELVT NSDDLILIES ASGQTISGST
NYAPGTDLNV RVKSSDTASP FQTRPEATVQ TDGTFTTEGA DFSEVSPGTN LTLQTRRGGS
AVGDEYDGRI GEVPTASVSI SNQTTDGSTV TVDSVTTENG GFVAIHLNNA SGEVIGNSEY
LDSGTQQDVE ISLDSALDEN ATVVAMPHQD TNDNEEYDFG DGDGADGPYT ENGSAVTDSA
SITIQTTEDT PTPTDTPEDT PTDTATPDEG EDMTDTPTDD GGDETTTGDG AGFGAVVALV
GLLAAALLAT RRNN