Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0210 |
Symbol | |
ID | 8409708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 207969 |
End bp | 210194 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645018535 |
Product | alpha amylase catalytic region |
Protein accession | YP_003176054 |
Protein GI | 257386281 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0538024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGC TTGGGACCGC TGCAGCGGCC TTCGGAATGA GCCGGAGTGT CGAGGCAGAG ATCAACGGAT ATCATACACT GGCTACCGAA GACAACCGTC TGGGTCCGGT CACACCACGT GATACGGTGT ATCAGATCCT GACAGATCGA TTCTACGACG GCGACACGAG CAACAACGAC TTCGAGGATT TCGATTCCTC ACTCTACGAC GGCAGTGGAG AGGACCTCAA GCTGTACCAG GGCGGTGACT GGCAGGGGAT CATCGACAAG ATACCGTATC TCGAGGAGAT GGGTGTGACA GCGGTCTGGA TTTCCGCACC CTACGACAAC CGAGAGAGCA AGATCGAAGA CTACCAGGAC GACGGTACGG TCGACGTCTG GACCAGTTAT CACGGCTACC ACGCGCGCAA CTACTTCCGG ACCCACCGGT TTTTCGGCGG GATGCAGGAC TTCTACGCGA TGCGCGACGC GTTGCACAAC AACGGCATCA AACTCGTCAT CGACTTCGTC TCGAACCACA CGAGCCGCTG GCGGAATCCG ACGAAGAACA ACGAGCCCGA GGAAGGAGAA CTCTACGAAC CGGACACGGA TGCAGACGGC AATTACGTCT TCGACGAGAA CGGGGACGCC GTCGGTGAGA ACTTGCTGGC AGATCCACAC GACGATGTGA ACGGCTGGTT CCACGGACTG GGTGACCGTG ACGGGGACTC GTCGAAGTTC GGCTATCGCC ACAAGGAACT GGGCTCGCTG GCCGACTTCG CACACGAAAA CGGGGGGGTC GTCGACCACC TCGAACGAGC GACGCAGTTC TGGAAGTCGA AAGGGATCGA CGGAATTCGT CACGACGCAA CGTTGCACAT GAACCCGGCA TTCGCAAAGA ACCTCAAGAC CGCGACCGAC AGTTCGCAAG GCGGTCCGAT CACGCACTTC GGGGAGTTCT TCATCAGCCG ACCCGACCCG AAATACGAGG AGTACCGGAC GTTTCCCGAC AGAACCGGCA TCAACAACCT CGACTTCGAG TTCAACCGGG CGGCGACGAA CGCGTTCGGC GACTTCTCCG AGACGATGTC CGACTTCGGA GACATGCTGA TCAAGACCCA CGGCGACTAC ACGCACGAAC ACCAGACGAT CACTGCCGTC GACAATCACG ACCTCACTCG GTTCCGGTAC ATCCAGCCAA ACGACAAGCC GTACCACGCG GCGATCGCCG CGTTGATGAC CTGCCGCGGA ACGCCGAAGA TCTACTACGG GACCGAGCAG TACATGAATC CCGGATCGAG CGGTGCGAAC GCAGGTCGGC TGTTCATGCA AACCGACAGC GATTTCGACA CGTCGACCAC GGCCTATCAG TTGATCAGCG ACCTCGCACA GCTCCGGCGT GACAACCTCG CGCTCGCGTA CGGTCAGACG AGTATTCTCC ACTCGACAGA CGACGTGATC GTCTACGAGC GGACCTTCTA CGATCACGTG GTCGTGACGG CGATCAATCG ACAGCCCGAC CAGTCCGAGA GCGTCCCGTC GGTCGGTACG AGCCTGCCGG ACGGTAGTTA CGGCGACTAC CTCTCGGGGG ACCTCTACGG ACAGGGCATC TCGGTCGACA GCGGTGCACT GGACTCCTTC ACGTTGGGCG GTGGAGAGGT CAGCGTCTGG ACAGCGAGCC CCGACCTGGG GAATGCGCCG AAGATCGGAA CGACGGTCAG CACGATGGGA CAGGCCGGCG ACTCCGTCCA CATCTACGGA TCAGGTCTCG ACGGCGACGT GTCAGTCACG TTCGGCGGTA CGCCCGCGAA CGTCGTGTCC AACAGCGCGA CCCGTATGAC GGCGACAGTC CCCGATGGGC CCGCCGGACT CGTCGACGTG GTCGTCGAAA AGAACGGCCA GACGAGCAAC GCCGCGAAAT ACGACATCCT CACGGACGAG CCGGTACAGG TGATCTTCCA CGTCGAGGCC GAGACCGAAC CCGGCGAGAC GATCCACGTC GTCGGTGATC CACCGGAACT GGGTAGCTGG GATGCCCTGT CGGGCAGCGA ATCCTTCATG AACCCGGACT ATCCGGAGTG GTTCCTCCCA GTGAGCGTGC CAAAGGACAC GACCTTCGAG TTCAAATTCG TCAAGATCGA CGAGAGCGAC AACGTGACCT GGGAAAGCGG GAGCAACCGG ACCTTCACGT CACCCACGGA TTCGACCGAG ACCGAAGATA CGCCGCTCTA CTACTGGCAA AGCTAG
|
Protein sequence | MKTLGTAAAA FGMSRSVEAE INGYHTLATE DNRLGPVTPR DTVYQILTDR FYDGDTSNND FEDFDSSLYD GSGEDLKLYQ GGDWQGIIDK IPYLEEMGVT AVWISAPYDN RESKIEDYQD DGTVDVWTSY HGYHARNYFR THRFFGGMQD FYAMRDALHN NGIKLVIDFV SNHTSRWRNP TKNNEPEEGE LYEPDTDADG NYVFDENGDA VGENLLADPH DDVNGWFHGL GDRDGDSSKF GYRHKELGSL ADFAHENGGV VDHLERATQF WKSKGIDGIR HDATLHMNPA FAKNLKTATD SSQGGPITHF GEFFISRPDP KYEEYRTFPD RTGINNLDFE FNRAATNAFG DFSETMSDFG DMLIKTHGDY THEHQTITAV DNHDLTRFRY IQPNDKPYHA AIAALMTCRG TPKIYYGTEQ YMNPGSSGAN AGRLFMQTDS DFDTSTTAYQ LISDLAQLRR DNLALAYGQT SILHSTDDVI VYERTFYDHV VVTAINRQPD QSESVPSVGT SLPDGSYGDY LSGDLYGQGI SVDSGALDSF TLGGGEVSVW TASPDLGNAP KIGTTVSTMG QAGDSVHIYG SGLDGDVSVT FGGTPANVVS NSATRMTATV PDGPAGLVDV VVEKNGQTSN AAKYDILTDE PVQVIFHVEA ETEPGETIHV VGDPPELGSW DALSGSESFM NPDYPEWFLP VSVPKDTTFE FKFVKIDESD NVTWESGSNR TFTSPTDSTE TEDTPLYYWQ S
|
| |