Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2923 |
Symbol | |
ID | 8412475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2810321 |
End bp | 2812225 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645021269 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003178735 |
Protein GI | 257388962 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3325] Chitinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00791055 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACAGA AGGCAACGGC CCTGTCGGCA CTGGCAGTCG GAGCCAGTGC GACAGCAACA GCAGCAGACT GCAGTGGCGT CTCCGAGTGG GACGCGAGCG CCACCTACAA CGGCGGCGAT CAGGTCACCT ACGACGGTGC GCTCTGGACC GCCGAGTGGT GGACCAGCGG CACTCAGCCG GCCGAGGACG CTTCGGTCTG GACCAAGGAG GGTGCCTGTG GTGACACTCC ACCGGGCGAC GGCGACGAAG GGACGGACTG CAGCGAAGTC TCCGCGTGGG AGTCCGACGT TGCCTACACC GGCGGCGACC AGGTCACCTA CGACGACTCG CTGTGGACCG CCGAGTGGTG GACCAAGGGC ACCGAACCCG CAGAGAGCGA GAACGTCTGG ACCCTGGAGG GTCCCTGCGG TGACGGCGGC GGCGGTGGTG GCGGCGGTGG CGACGAGAAC CAGTCGCCCG ACGCCTCCTT TACCGTCTCG CCGTCTTCGC CCGAGCCGGA TGAGGAAGTG ACCCTCGACG CCAGCGGGTC CTCCGATCCC GACGGCGATT CCCTGAGCTA CGAGTGGGAG ATCGAGACCG TCGGCACCGT CGAGGGTGCG GAGAACTCGC TGACCTTCGA CGAGGCCGGT GACTACGAGG TCACGCTGAC CGTCACGGAC GCCGAAGGCG CGTCCAGTTC GGTCACCGAG ACCCTCAGCG TCGCCGAAGC ACCCGATCCG CCGAGCGACG AGTTCAAAGT CATCGGTTAC TACCCCGGCT GGAAGGCCAC GCCGGAGTAC GACTACTACC CCGAGGACAT CCCCTTCGAC AAGGTCACGC ACGTCCAGTA CGCGTTCCTC GGCGTCGACG CGGACGAGGC AGTGCCGACG ATCATGAGCG ACCAGGACCG CGAGAACCTC GAACGGTTCA AGGAGCTCAA GGACGGAGCG GCCTCCGACA CCAAGATCAC GCTCTCGATC GGTGGCTGGG CGGACTCGAC CGGCTTCTCC GAGATCGCCG CGACCGAGAG CAATCGACAG TCCTTCGCCG ACCGGTGTGT CGAGATCCTC CGGAAGTACA ACCTCGACGG CATCGACATC GACTGGGAAC ACCCAGGTAG CTCCCAGGGC AAGTGCCAGT GTGGGAGCAA CGAGGACTAC GAGACTCACG TCGACCTCCT GCAGGCGCTT CGAGACACGC TCGACGCGGC CGCCGAGGAA GACGGCAAGT ACTACGAGCT GTCCGTCGCG AACGGTGGCT CGGACTGGAA CGCCGGTGGC CTCCGCCACG GCGACATCGG CGAGATCTGT GACTTCGCCT CCATCATGGC CTACGACTTC ACGGGCTCGT GGATGGACGT AGTCGGTCAG AACGCGCCGC TGTACGGCGA CTCTCACCCC ACGGAGAACA GCCAGTACGG CGAGACCTAC ACCGCCCAGT ACTTCGTCGA GTACTCGGTC GACAAGCTCT ACGCTGGCGA CCACGGCGAG ACGGGCTACT GGCCCGGCCA GTGGGAGTAC CCGCCGGCCG AGCCCGCAGA GTACGACGAA CTGGTCCTCG GCCTGCCGTT CTACGGCCGC GGCTTCAACG GCACCGAGAT GTACGGCAAC TACAGCGGCC TCCCGGAGGG CACGTGGCAC GACCAGCTCG AAGACGGAGC CGACCCGACC GGCGCGTTCG ACTTCGGTGA CCTCGAAGAG AACATCGAGG GCGCGGACGG CTGGACGAAG AAGCGCCACG ACCCCGGTGC GGTCCCCTAC ATCGTCAACG AAGACGAAGA GACGATCATC AGCTACGACG ACGAACAGGC CATCGAGGAG AAGGTCGAGT TCGCGAAGGA ACGAGGCATG CAGGGCGTCA TGTTCTGGGA ACTCTCCCAG GACTGGAACC AGACGCTGCT CGACGCGATC AACCGGACCG CGTAG
|
Protein sequence | MLQKATALSA LAVGASATAT AADCSGVSEW DASATYNGGD QVTYDGALWT AEWWTSGTQP AEDASVWTKE GACGDTPPGD GDEGTDCSEV SAWESDVAYT GGDQVTYDDS LWTAEWWTKG TEPAESENVW TLEGPCGDGG GGGGGGGDEN QSPDASFTVS PSSPEPDEEV TLDASGSSDP DGDSLSYEWE IETVGTVEGA ENSLTFDEAG DYEVTLTVTD AEGASSSVTE TLSVAEAPDP PSDEFKVIGY YPGWKATPEY DYYPEDIPFD KVTHVQYAFL GVDADEAVPT IMSDQDRENL ERFKELKDGA ASDTKITLSI GGWADSTGFS EIAATESNRQ SFADRCVEIL RKYNLDGIDI DWEHPGSSQG KCQCGSNEDY ETHVDLLQAL RDTLDAAAEE DGKYYELSVA NGGSDWNAGG LRHGDIGEIC DFASIMAYDF TGSWMDVVGQ NAPLYGDSHP TENSQYGETY TAQYFVEYSV DKLYAGDHGE TGYWPGQWEY PPAEPAEYDE LVLGLPFYGR GFNGTEMYGN YSGLPEGTWH DQLEDGADPT GAFDFGDLEE NIEGADGWTK KRHDPGAVPY IVNEDEETII SYDDEQAIEE KVEFAKERGM QGVMFWELSQ DWNQTLLDAI NRTA
|
| |