Gene Hmuk_2922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2922 
Symbol 
ID8412474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2808060 
End bp2810033 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content65% 
IMG OID645021268 
Productglycoside hydrolase family 18 
Protein accessionYP_003178734 
Protein GI257388961 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID[TIGR01634] phage tail protein, P2 protein I family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.183514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTC GCAACTATCT ACAATCGCTT TCGGCGCTGG CCGGTCTGGC CGGCGTCTCC 
GCGGTAACAG CACAGGAAGA GTATCCGGCG TACGATTCGA GCGCGACCTA CAACGGTGGC
GATCGAGTCG TCTACGAGGG ATACATCTGG GAAGCACAGT GGTGGACCAA AGGAACGGCA
CCGAGCGCAG ACAAGGCTGT CTGGGAGAAA GTCGGACCCG CTGACGGAGG CGGCGGGGAC
GACGGCGGCT CCGACGACGG CGGCAGCAGC GACATTCCTG CCTACGATTC GAGCGCCACC
TACACCGGTG GCGACCAGGT CACCTACGAC GGGTTCGTCT GGGAGGCCGA GTGGTGGACC
AAGGGTACCG AACCCTCCGA GAGCGCGAAC GTCTGGACGA AGGTCCGCGC CGTCGACGAC
GGCGACAACG GCGGCGACGA CGGTGGATCC TCCGACCTCA ACGCCGTCAT CGACGCCAGC
GCGACGCGCG TCGACGTCGG TGAGGACGTC ACGCTGGACG CCAGCGGCTC CGAGGGCGAC
ATCGAGTCCT ACGAGTGGAT GGTCGGCGAC CAGGGTCCGA TCTCCGGCGT CGAGAACACG
GTCACGCTCG ACGAGGAAGG CACCTACGAG GTCACGCTGA CCGTCACCGA CGCAGACGGC
AACGAGGCGA CCGCGACCCG ATCCGTGTTC GTCGGCACCG CGGGCGGCAC GCAGCCCGGC
GACAAGCGAG TCGTCGCCTA CTACCGACAG TGGGCACAGT ACGACCGCGA GTACACCCCG
TCCGACATGC CCCTGGACAA CATCACGCAC GTCCAGTACG CGTTCGCGCG CCCGGAGGAG
GACGGCTCCG TCAACCTCGT CGGCGACAGT CACGGCCAGC AGGCGTTCTG GGACCAGAAC
ACCGACTGGC GTGACGCACC CGGCGGAAAG AGCATCGCCG AGCTCGCAGA AGAGAACGAA
GACACCAAGT TCACGCTCTC GATCGGTGGC TGGGGCGACT CCGAGTACTT CTCGTACGCC
GCAGAAACCG AGGAGAACCG CCAGCGCTTC GCCGACCAGT GTGCCGAGTG GGTCGACCGA
GGCAACCTCG ACGGCATCGA CATCGACTGG GAGTTCCCCC ACGGCGGGGG CTGTCAGGGC
GACGGCGGCG AGGCGTGTAA CAAGGAGAAC GTCGAACGTC CCGAAATCGA CATTCCGAAC
TTCACGAAGC TGTGTCAGGC GGTCCGCGAT CGCCTCGACG AGAAGGCGGC AGAGGAAGGT
CGCGAAGAGC CCTACGAGGT CACCGCTGCG GTCAACGCGG ACCCCGAGGC GATGGCCGAC
TACGAGCACG AGGCCCTGTC GGACATCCTC GACTTCATCC TCGTGATGAC CTTCGACTAC
GCGGGTATCT GGAGCGAGTA CACCCGCCAT CACGCCCCGC TCAAGGAGAA CCCGGACAAC
CCGTTCGAGA AGTCCGACAG CTGGAACGCC TCCTACGCTC TCAGCTGGTT CGAACAGCAG
GGCTGGTCGC CGGACCAGCT CAACATGGCC GTCCCGTTCT ACGGGCGTAG CTGGAGCAAC
GTCAACGACC CCGACGGCGA GGGCAACGGC GAGGACGACG GTCTCTTCCA GAAGTTCGAC
GGAGAGGACG GCAACGCCAG CGGCGACGGT AGCTTCGGTA CTATCGGTGG TATCTACGAG
TACTACGACC TCGCCGGTGG CTCCCGTGGC GGCTCCAGTA TCATCGACGG CGACGACTAC
GAGACCTACA TCGACGAGGA CGCCATGACG GCCTACAGCT ACAACCCCGA CAAGGGCGGT
GGCTACAACA AGGCCAGCGG CGAGATGATC TCCCACGACA CCGTCGAGAC CATGGAGATG
AAAGCCCAGT GGCTCCGCGA CTCGCCGTAC GGCGGGACGA TGCTGTGGGC CATCGGTGGC
GACACGAAGG ACGGCGAACT GATCAGCACG CTCTGGAACA CGCTCAACGA ATAG
 
Protein sequence
MQRRNYLQSL SALAGLAGVS AVTAQEEYPA YDSSATYNGG DRVVYEGYIW EAQWWTKGTA 
PSADKAVWEK VGPADGGGGD DGGSDDGGSS DIPAYDSSAT YTGGDQVTYD GFVWEAEWWT
KGTEPSESAN VWTKVRAVDD GDNGGDDGGS SDLNAVIDAS ATRVDVGEDV TLDASGSEGD
IESYEWMVGD QGPISGVENT VTLDEEGTYE VTLTVTDADG NEATATRSVF VGTAGGTQPG
DKRVVAYYRQ WAQYDREYTP SDMPLDNITH VQYAFARPEE DGSVNLVGDS HGQQAFWDQN
TDWRDAPGGK SIAELAEENE DTKFTLSIGG WGDSEYFSYA AETEENRQRF ADQCAEWVDR
GNLDGIDIDW EFPHGGGCQG DGGEACNKEN VERPEIDIPN FTKLCQAVRD RLDEKAAEEG
REEPYEVTAA VNADPEAMAD YEHEALSDIL DFILVMTFDY AGIWSEYTRH HAPLKENPDN
PFEKSDSWNA SYALSWFEQQ GWSPDQLNMA VPFYGRSWSN VNDPDGEGNG EDDGLFQKFD
GEDGNASGDG SFGTIGGIYE YYDLAGGSRG GSSIIDGDDY ETYIDEDAMT AYSYNPDKGG
GYNKASGEMI SHDTVETMEM KAQWLRDSPY GGTMLWAIGG DTKDGELIST LWNTLNE