Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0206 |
Symbol | |
ID | 8409704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 202032 |
End bp | 203333 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645018531 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003176050 |
Protein GI | 257386277 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.568957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGACT ATTTCATCAA TCGTATATCC ATGGGATTCG ATCGGAGAAC ACTGTTGAAG CACATTGGTG CAACGGGAAC GATCGCAGCA GTCGGCGGCT GTGTTGGCGT CGAGGAACAG GGAACAGATG CTGATGCCGG CGGGGGCGAG AGCAGCAACG GTACCGACAG TACAACAGAG GAGCCCGCCG GATCGGCAAC GGCCTGGTAC GGCCTCTCCG ACACGGAGCT CGAACTCCGC GAAGACATCA TCGCGGCATT CAACGAGGAG TCCAGCCACA CGATCAAGGG AGGGAATATC GCCGAGATGC AAGACCGGAC GACGAGCGCG ATCCCTGCCG GACAGGGACC GGAAACGTTC CAGTGGGCCC ACGACTGGGT CGGTGATTAC TACGAACGCG GGTTCGTCGT CGACCAGAGT GACGAGCTGT CCGTCGACCT CGACCAGTTC ACCAGTGCAG CGGCTGGTGC CGTCCAGACC GACGATGCGA TCGTCGGGCT CCCCTTTTCG GCGGAGACGG TGACGCTAAT CTACAACGCA GACATCGTGG ACAAACCACC GGAGACGTTC GAGGAAATGG CAGCAAGTAT GGAAGCGTAC CACGATTCGG CCAACGGGAA GTACGGGCTA GCCATGCCGT TCAACCCCTA CTTTATCAGC GGGATCGCAC AGGCGTTTGG CGGCCGCTAC TTCGATCCCG AAAGCGACCC AGTGGTTGGT CTCGATTCTG AGGAGACGGT CCGTGGATTC GAGTTTATGC TCGACAATCT CGTCCCATAT ATGCCGAACG ACCCAGGCTT CGAACCCCAG CAGGCAACGT TCGCAGAGGG CAACGCGGCC TTCGCAGTCA ACGGTCCGTG GTATCTTGCC ACACTCAACG ACAGCGACAT CAACTACGAG GTGACGACCT TCCCGTCGAT GGACGGCGGT GAGTTCACTC CACTGAGCGG GATCAAGATG TGGTACTTCT CGAAGGCAAT GGAAGAGGGA GATGTCGACG CGACGGCAGG ACGCGAGTTC ATCGAGTGGT TCGTGACCAA CGAGGACCAC CTACTCACCA GAGCCGAAGA ACAGGGCCAC ATTCCAGTCC TCTCGTCGCT CGCCGGCAGC GACGATCTCC CCGGCCCAGT CCGGGCCTAC TCGGAGGCCG TCGATCAGGG TATCCCGATG CCGACGGATC CTCGTATGAG CGACGTGTTC GCAGCGCTGG AGGAACCAGT CGTCCAGATT TTCAACGGAA GTCAGAGCCC AGCACAAGCA CTTGCCGGGG CCGCCGACGA GGCTCGAAGT AACTGGGAGT AA
|
Protein sequence | MYDYFINRIS MGFDRRTLLK HIGATGTIAA VGGCVGVEEQ GTDADAGGGE SSNGTDSTTE EPAGSATAWY GLSDTELELR EDIIAAFNEE SSHTIKGGNI AEMQDRTTSA IPAGQGPETF QWAHDWVGDY YERGFVVDQS DELSVDLDQF TSAAAGAVQT DDAIVGLPFS AETVTLIYNA DIVDKPPETF EEMAASMEAY HDSANGKYGL AMPFNPYFIS GIAQAFGGRY FDPESDPVVG LDSEETVRGF EFMLDNLVPY MPNDPGFEPQ QATFAEGNAA FAVNGPWYLA TLNDSDINYE VTTFPSMDGG EFTPLSGIKM WYFSKAMEEG DVDATAGREF IEWFVTNEDH LLTRAEEQGH IPVLSSLAGS DDLPGPVRAY SEAVDQGIPM PTDPRMSDVF AALEEPVVQI FNGSQSPAQA LAGAADEARS NWE
|
| |