Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0843 |
Symbol | |
ID | 8410358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 814031 |
End bp | 815710 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645019179 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003176681 |
Protein GI | 257386908 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATTC TGGTCAACCA ACTCGGTTAC GAGACCGATG GCCCGAAACG AGCGGTCTGT CGCGCGACAG AGCGCCACGA CCTCGATGGC TTCGTGTTGC ACGACGGGGA CAGCGTCGTC TTCGAGGGGA CGCCGGAGTT CGTCGGCGGC GTCGCCGACT GGGGAGAGTG GGTCTTCTGG ACGCTGGAGT TCTCCTCGCT CACCGAGCCC GGCGAGTACA CGCTCCGGGC CGGCGAGGCC CACTCGCGGC GCTTCGAGAT CGGCGAGGAC CTCCACAAGG AGACGCTCCT CTCCGATCTG CTGTACTACT GCAAGACCCA GCGTGCCAGC GGCGAGTACG ACCGGGCCGA CCGATCGGTG CCGTTCGTGG GCGACCGCGA GGGGACGGTC GACGTACGCG GCGGCTGGTA CGACGCCTCG GGCGACATGA GCAAGTATCT GAGCCACCTC AGCTACGCCA ACTACTTCAA CCCACAGCAG ATTCCGATCG TCGCCTGGGG ACTGGCCGAC GCCCGTGACA GACTGCACGA CGGCGACCAC AGGCTGGGCG GGGAACTCGA CGCGCGACTG CGCGAGGAGA TCCACCACGG CGCGGACTTC CTCGTCCGGA TGCAGGACGA CGCGGGCTAC TTCTACATGA CCGTCTTCGA CCAGTGGTCC AAGGACGTGG ACCGCCGAGA GATCTGTGCC TACGAGACCG AAGAGGGACA CAAGACGATC GACTACGAGG CCGGATACCG CCAGGGCGGA GGGGTCGCCA TCGCCGCGCT GGCTCGCGCC AGCCAGGTCG AGGGACCGGG CGCGTTCGAC CGTGAGACCT ACCGCGAGGC CGCGGTCGAG GGGTTCGATC ACCTGCAGGC CCACAACACG GAGTACCTCG ACGACGGGAC CGAGAACGTC ATCGACGACT ACTGCGCGTT GCTCGCCGCC ACGGAACTGG CCGCCGCCAC GGACGACGAG CGGTTCCGGA CGGCCGCCAG AGAGCGCGCT CAGTCGCTGC TGGACCGCCA GACCGGCGAC GACCGCTACG ACGGGTGGTG GCGTGCCGAC GACGACGATC GCCCCTTCTA TCACGCGTCC GACGAGGGAC TGCCGATCGT CGCGCTGTTG CGCTACCGGG CGGTCGACGC CGACGGGCCG CTGGACGACG CCATCGTGAA CGCGATCGAG CGCTTCTGGG GCTTCCAGAC GACGGTCGGC GACGAGGTCA CCAACCCCTT CGACTACCCG CGCCAGTACG CCAGACCGGT CGACGAGGAC GAACCGCGCG CGTCGTTCTT CATGCCCCAC GAGAACGAGA CCGGCTACTG GTGGCAGGGC GAGAACGCCC GGATCGCCTC GCTTGCGACC GCGGCCGCGC GGTCGCGGGC ACAGCTCGAC GCGGAACTGG GCGAGCGCCT CGATCGGTTC GCCCAGGCCC AACTCGACTG GATCCTCGGG TCGAACCCCT TCGGCGTCTG CATGGTCCAC GGCGTCGGCG CGCCGGAGCC GACCTACCAC CGCCAGTTCC GCAACGTCCC CGGCGGCGTC CAGAACGGTA TCACCGCCGG CTTCGAGAAC GAGGCCGACA TCGCCTACTG TCCCGAGCCC TGGGGCGACG ACCACGCGCA CCGCTGGCGC TGGGCCGAGC AGTGGATTCC CCACTCGGCC TGGCTGTTCC TGGCGGTCAG CTCGCTGTAG
|
Protein sequence | MEILVNQLGY ETDGPKRAVC RATERHDLDG FVLHDGDSVV FEGTPEFVGG VADWGEWVFW TLEFSSLTEP GEYTLRAGEA HSRRFEIGED LHKETLLSDL LYYCKTQRAS GEYDRADRSV PFVGDREGTV DVRGGWYDAS GDMSKYLSHL SYANYFNPQQ IPIVAWGLAD ARDRLHDGDH RLGGELDARL REEIHHGADF LVRMQDDAGY FYMTVFDQWS KDVDRREICA YETEEGHKTI DYEAGYRQGG GVAIAALARA SQVEGPGAFD RETYREAAVE GFDHLQAHNT EYLDDGTENV IDDYCALLAA TELAAATDDE RFRTAARERA QSLLDRQTGD DRYDGWWRAD DDDRPFYHAS DEGLPIVALL RYRAVDADGP LDDAIVNAIE RFWGFQTTVG DEVTNPFDYP RQYARPVDED EPRASFFMPH ENETGYWWQG ENARIASLAT AAARSRAQLD AELGERLDRF AQAQLDWILG SNPFGVCMVH GVGAPEPTYH RQFRNVPGGV QNGITAGFEN EADIAYCPEP WGDDHAHRWR WAEQWIPHSA WLFLAVSSL
|
| |