Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0944 |
Symbol | |
ID | 8410460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 906044 |
End bp | 907345 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645019279 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003176780 |
Protein GI | 257387007 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.322748 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGAAG CCATCGGTGC CGGTGGCCTC ATCGCCATCT CCGGCTGTAT GGGCGACGGT GGCGATGGCG GCGATGGCGG TGACGGTGGC AGCGACGGCG GTGACGGTGG CAGCGACGGC AGCGACGGTG GCGATCAGCA GACCATCCAG TTCCTGACGA TGGGGGTCGG CGACAACATC GCCGAGTTCT TCGAGAAGAA CAACGCGGCC TTCGAAGAGG AGTTCGGCGT CACGCTTGAC TTCACGAGCG TCACCTGGGA CAACGCCCAG CAGACGGTCA ACAACCGCGT CGACGGCGGC GAGGCACCTG ACGTAAGTCG CTGGCCGGCC CGCTGGATCC CCCAGCTCGT CGGCAAGGAA GCGCTCGTCC CCATCACCGA CATGATGGAA GGCGAGTTCG GCGACCAGTT CTACCAGGGC ATGGCCGACG GCTGTATGTA CCAGGGCGAG TACTACGCTG CCCCCTGGGC CGCATCCAAC AAGTGCTTCT ACTACAACAA GGACGTGTTC GAGGCGGCGG GCCTCGATCC GGAGGACCCC CAGCTCGACA CCTGGGACGA CATGCTCTCG GCGGCCCAGA CCATCACCGA GGAGACCGAC ACCCCCGCAC TGGGACTGGC CGGTGCCGAC GCCATCGAGA CCGGCTCGCA GTACTACCAC TACCACTGGT CACACGGCGC GGACCTGATC GACGACGAGG GTCAGCCGGT CGTCAACTCC GATGGGGCCG TCGAGGCGCT GAGCTTCTAC TCGGACCTGC ACCTCGAACA CGGCGTCACT CAGTCCTCGC CGCTGTCCTC GACGCGCCAG GACATCCGTC AGCTGTTCGA GTCCGGCTCG CTGGGTATGG TCATCGCCCA CGTCTACACG GGCATCAACA TCGACGACAG CGACGCCGAC TTCGACTACG GGATCGCACA GGTGCCGGAG GGGCCCGCTG GCCGCTACAG CCTGAACACG ATCGACGGCG TTTCGATCTT CGCCCAGACC GAGGTCGAGG ACCTCGCGCG GGACCTGCTA CGGTTCTACT TCGACGAGGA CCGCCACTTC GAGTACGCGA GCAGCAAGGG ATTCATGCCG ACGGTCGAGG CGGTCGGCGA GCGCGACTAC TTCCAGGACT CGGAGAACTG GGCACCGTTC ATCGAGGCCG GTCAGTACGC CCGCGCTCGG CCGAAACTGT CGAACTTCAA CGAGTTCAAC AACCGCATGG TCCAGGCGAT CCAGGAAGCG CTGGGCGACC AGAAGTCCCC CCAGCAGGCC CTGGACGACG CACAGGCGGA CCTCGAAGAG ATGATGCAAT AA
|
Protein sequence | MLEAIGAGGL IAISGCMGDG GDGGDGGDGG SDGGDGGSDG SDGGDQQTIQ FLTMGVGDNI AEFFEKNNAA FEEEFGVTLD FTSVTWDNAQ QTVNNRVDGG EAPDVSRWPA RWIPQLVGKE ALVPITDMME GEFGDQFYQG MADGCMYQGE YYAAPWAASN KCFYYNKDVF EAAGLDPEDP QLDTWDDMLS AAQTITEETD TPALGLAGAD AIETGSQYYH YHWSHGADLI DDEGQPVVNS DGAVEALSFY SDLHLEHGVT QSSPLSSTRQ DIRQLFESGS LGMVIAHVYT GINIDDSDAD FDYGIAQVPE GPAGRYSLNT IDGVSIFAQT EVEDLARDLL RFYFDEDRHF EYASSKGFMP TVEAVGERDY FQDSENWAPF IEAGQYARAR PKLSNFNEFN NRMVQAIQEA LGDQKSPQQA LDDAQADLEE MMQ
|
| |