Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0332 |
Symbol | |
ID | 8409830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 326048 |
End bp | 327454 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645018657 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003176176 |
Protein GI | 257386403 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAC CAACTCGCAG AGACTACGTT AGAGGAGTCG GTGCAGCAAC AATTGTCGGA CTCGCAGGCT GTTCCGGAGA CGGCGGCGAC GGCGGTAGCG ACGGCGGTGA CGGCGGCAGC GACGGCGCTT CGACCGGCGA CAGCGGCGGA AGCAGTGACG TGACGTTCGA CTTCTGGCAC ATCCACGGTG ACGAGCTCGG CGAGACGCTG AGCCAGTTCG CTCAGGAGTT CTCCGAGGAG ACAGACGGCG TGACGGTCAA CGCGGTCAAC AAGGACGGCT ACCGCCAGAA CCTCAACCAG TCCCTGCAGG CCTCGCGAGC CGGTGACCCA CCGGGCGTCG CCCAGATCTT CGAGATCGGG ACACAGCTGT GTCTCGACAG CGGAGCCTTC ACGCCCATCG AGCAGGTCAT CCCCGACGGT GCGGTCGACT TCGACGATCT CCTCCCGTCG GTGTCGAGCT ACTACCGCAT CGACGACGAG CTCAATTCGA TGCCGTTCAA CTCCTCGAAC ACGATCATGC TGTACAACAA GACGGCGTTC GAGGAGGCGG GACTGGATCC GGAGGATCCG CCACGGAGCC TCTCCGGCGT GCGCAGCGCC GCAGAGACGA TCGTCGACCA GACCGACATG GAGGCCGGCA TCTCCTGGCC GAACCACTCC TGGATGCAGA TCGAACAGCA GTTCGCGAAG CAAGACCAGG TGCTGCTCAA CAAGGAGAAC GGTCGCGCCG GTCGTGCCGA CAAGACCTAC TACAACAGCG AGGCCGGTCG GAACGTCTAC GAGTGGTGGA AGGGCATGGC CGACGACGAC CTGTACCTGA ACCCGGGTAT CGAGGCGTGG TCCGAGGCAC GACAGGCGTT CCTCACGGGC CAGGTCCCGA TGCTGTGGGA CTCCACGTCG AACATGGTCT CGATGAAGGC CGGTGCCGAG GAGAACGGTT TCGAACTCGG GTCGGCGTAC CTGCCCGCAC CGGACGGTGC CAACAACGGC GTCGTCATCG GCGGCGGCTC GCTGTGGGTG CCCGACGCCC TCGCAGACGA GAAAAAAGAG GCCGCCGGCA AGTTCATCGC CTACATGGTC CAGCCCGAAC AGCAGGCCCG CTGGCACCGC AACAGCGGGT ACTTCCCGGT GAGTCAGGGT GCCGTCGACC AGCTCGAAAG CGACGGCTGG TTCGAGGAGA ACCCCGACTT CCGTACGGCC TTCGATCAGT TGCAGGACAC CAAGGACTCG CCCGCCACGC GTGGCGCGGT CATGGGCGTC TTCCCGGAAG CGCGGAGTAT CAACACGGAG ATCTCGGTCA GTATCCTCAA CGGTGACGTC GGCGTCGAAG AGGGCCTCTC GGAGATGGAC ACCCAGGTCC AGGAGACCCT GTCGAGCTAC TCCGGCAACT ACAGCGGAGA GGAGTAA
|
Protein sequence | MKKPTRRDYV RGVGAATIVG LAGCSGDGGD GGSDGGDGGS DGASTGDSGG SSDVTFDFWH IHGDELGETL SQFAQEFSEE TDGVTVNAVN KDGYRQNLNQ SLQASRAGDP PGVAQIFEIG TQLCLDSGAF TPIEQVIPDG AVDFDDLLPS VSSYYRIDDE LNSMPFNSSN TIMLYNKTAF EEAGLDPEDP PRSLSGVRSA AETIVDQTDM EAGISWPNHS WMQIEQQFAK QDQVLLNKEN GRAGRADKTY YNSEAGRNVY EWWKGMADDD LYLNPGIEAW SEARQAFLTG QVPMLWDSTS NMVSMKAGAE ENGFELGSAY LPAPDGANNG VVIGGGSLWV PDALADEKKE AAGKFIAYMV QPEQQARWHR NSGYFPVSQG AVDQLESDGW FEENPDFRTA FDQLQDTKDS PATRGAVMGV FPEARSINTE ISVSILNGDV GVEEGLSEMD TQVQETLSSY SGNYSGEE
|
| |