Gene Hmuk_3148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3148 
Symbol 
ID8412701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3035898 
End bp3037589 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content69% 
IMG OID645021495 
Productglycoside hydrolase family 18 
Protein accessionYP_003178960 
Protein GI257389187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00268374 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAAA CACGACGCGA CGTTCTTCGG AACGCCTCGG CGCTGGTCGC AGCGTTGACC 
GGCGCTGGCA CAGCAGCGGC ACAGCAATCA CCCCCGGCCT ACGACGACGG CACGGTCTAC
ACCGGCGGGG ATCGAGTGAC CTACGACGGT TCCGTCTGGG AGGCCAGTTG GTGGACCCAG
GGGACGCCCC CGTCGACGGA CGCGGCGGTC TGGGAACGCA TCGACGCCGA CGACGGTGGC
GGTGACGACT CCGGCGACGG CTCGTACCCG GCCTGGGACG CCAGCGTCGC CTACAGTGGC
GGGGACCGGG TGACTCACGA CGGCTCCGTC TGGGAGGCGG CGTGGTGGAC GAAAGGGACC
GAGCCCGCCG CCGACGCGAC CGTCTGGGCA CTGGTCGAGA GCGACGACGG CTCCGACGGC
GACGAGAACG GGGCACCGAC GGCCGTCGTC GGCGTCACTC CCTCGGCACC GACGCCGGGC
GAGACGGTGA CCTTCTCCGG GGCCGACTCC TCGGACCCCG ACGGCGACGC GCTGGACTAC
GAGTGGACCA TCGACGGCGA CACGGTGACC GGCGTCGAAC ACACGCGGTC GTTCGAGTAC
GCCGGGGAGT ACGCCGTCAG CCTGACCGTC ACCGACGGCT CGGGGGCCAG TGACTCGGCC
ACGACGACGG TGTCGGTCTC CGAGGGAGAC GGGTCCTCGC CGGGCTGGGA CGAGCACCCG
GTCATGGCCT ACGAGTTCCA GGGGGCAGCC CCGGCCGACA AGCTCACCCA CGTCATCCAG
ACGTTCGGCA GCGTCAGCGC CGACGGCTCC GTCTCGGCAC CGTGGGACGC CGGGGGTATC
GACGGCGTCA CCACGCTCCT CTCGATCGGC GGCTGGAAGA ACTCCCAGGG GTTCCCGGAA
CTGGCGAGCG ATCGGGAGAG CCGGGAGACG TTCGCCAGCG AGTGTGTCGC GCTCCTGCGT
GACCGCGATC TCGACGGCAT CGACATCGAC TGGGAGTTCC CCGGTCCGTA CGGTCCCGAC
GGACTCACGT CGTACCCCGA CGACGAGGAG AACTTCGCCG CGCTGATCGA GGAGTGTCGC
CGACAGTTCG ACGCGGCCGC CGCCGAGGAC GACACCGAGT ACTACCTCAC AGCGGCGCTG
AACCACGCGG AGTCACATCT CTCCGGACTC CCCCACGACC GGCTCGCCGA CGCGCTGGAC
TACGCGAAGA TGATGACCTA CGACATGCAC TCGCCGGTCT GGGTCGACGA GACCAACCAC
AACTCGCCGC TGTACGCCAC CTCCGGAGCC GACAGCGACG ACTCGATCCA CAACACCCTC
ACGTACATGC GCGAGCAGGG ATGGCCCGCG GAGAAGCTCG TCATGGGACT GGCCTTCTAC
GGCCGCGAGT TCACCGGCGT CGAGAGCACG GCCAACGACG GGCTGCTCCA GCCCTTCGGA
GGCAGCGGCG GGGCCACCGG CTTCGCCGAC ATCGACCAGC AGTTCGGGGG CCACACCCGC
TACTGGGACG ACGAGGCGAA GGTGCCCTAC AAGTTCGACG GCAGTAGCCT GCTGAGCTAC
GACGACGAGG AGTCGGTCGC GGTCAAGGCC GACTACGCAG ACGAGCGGGG CCACCCCATC
ATGTACTGGG CCTCCGGTCA CGACCCCAAC GAGACGCTGA TCGACGCGGT CAACGACGCG
CTCGGGAAGT GA
 
Protein sequence
MRETRRDVLR NASALVAALT GAGTAAAQQS PPAYDDGTVY TGGDRVTYDG SVWEASWWTQ 
GTPPSTDAAV WERIDADDGG GDDSGDGSYP AWDASVAYSG GDRVTHDGSV WEAAWWTKGT
EPAADATVWA LVESDDGSDG DENGAPTAVV GVTPSAPTPG ETVTFSGADS SDPDGDALDY
EWTIDGDTVT GVEHTRSFEY AGEYAVSLTV TDGSGASDSA TTTVSVSEGD GSSPGWDEHP
VMAYEFQGAA PADKLTHVIQ TFGSVSADGS VSAPWDAGGI DGVTTLLSIG GWKNSQGFPE
LASDRESRET FASECVALLR DRDLDGIDID WEFPGPYGPD GLTSYPDDEE NFAALIEECR
RQFDAAAAED DTEYYLTAAL NHAESHLSGL PHDRLADALD YAKMMTYDMH SPVWVDETNH
NSPLYATSGA DSDDSIHNTL TYMREQGWPA EKLVMGLAFY GREFTGVEST ANDGLLQPFG
GSGGATGFAD IDQQFGGHTR YWDDEAKVPY KFDGSSLLSY DDEESVAVKA DYADERGHPI
MYWASGHDPN ETLIDAVNDA LGK