Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2264 |
Symbol | |
ID | 8411805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2185164 |
End bp | 2186708 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020607 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003178083 |
Protein GI | 257388310 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.337461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAATA ATCATTCGGA CGACGACGGA CAGCGAAGCA GTGTCTCTCG ACGACGGTTC GTCGCAGCCA GCGGTGCATC GGGCGTCGCG GCGACACTCG CCGGCTGCGC AGATCTGGTC GGTGGCGGGG ACGGCGGCGA CGGTGGCAGC GACGGCGGCG ACGGTACCGG AGCGACGACG GGCGAGGGCG GGAGCGACGG CGCGACGACG ATCCAGTGGG GCATCGGGCC GACCGCCGTC CAGACGGCCG GCGAGGAGAT GAAGACGGCA CTCCACGAAG CGGGTGGCCT CAGAGACGAC ATCGAGATCG AGTGGGTCCC CAGCGCCTCG GACACGGGCG AGGTCCGCTC GAACTACAAC CGCATCCTCA ACGCGGACCA GAGCGACCCC GATATCTTCC AGATGGACAA CGGCTGGGTG AACATCTTCA TCCAGCGCGG GCTGATCCAG AACCTCTCGG AGACGCTCCC GGAGGATCTC CTCTCGGACA TCAACGAGAA CTACTTCAGC GGGTTCACCG ACACGGCCCG AGACCCGTCG TCGGGTGACC TCTACGGCGT GCCGCTGTTC CCCGACTTCC CGACGATGCA GTACCGCAAG GACCTCGTCG AACAAGCGGG CTACGACCCG GAAAGCGAGA ACTGGGCGAC CGAGCCGATG ACGTGGGAGG AGTGGTCCCA CGTCGTCGCC GACGTGAAGG ACAACGCCGA CGCCGAGTAC GGCTTTACGA CCCAGTGGGA CATCTACGAG GGCACCGCCT GCTGTACGTT CAACGAGGTC ATGTCATCGT GGGGCGGCGC GTACTTCGGC GGCCGCGAGA ACCTCTTCGG TCCGGTCGGC GAGCGTCCGA TCACGGTCGA CGAGCCCGAG GTCCACAACG CGCTCAACAT GATGCGGAAG TTCGTCCACG ACGAGGAGTT CGACGGGACC TTCGCGGACT ACGCCGGCAA CATCGCGCCG ACGGACATCC TCGGCTGGAT CGAGGAGCCC TCTCGCTCGC CCTTCGCCGA GGGCGACGCA GTCTTCCACC GCAACTGGCC GTACTCGCTC GCGCTGACCG GCCGGAACCC CGAGGAGACC GACGATCCCG CACTCGGCGA GGACCTGGGT GCGATGCCGA TCCCCTACGC CGTCTCCGAG AGCGAGGCCG CCCAGCCCGG AACCGGTGGC ACGACCTCGG CGCTCGGTGG CTGGCACCTC ACGTTCAACC CCAACAGCGA CAACCTCGAC GTCATCGACG AGGTCGTCTC GGCCGTCATG GAGCCGGACT TCGCCCTCGA ACTGTTCCGA CTGCAGGGGT GGCTCCCGCC GCGTCCCGAG CTGTTCAACT CCGACGAGGC CCGGAACGTC GCACCGGTCG GGCGCTACAT GGACACGCTC CAGGTGGCCG GTGAGAACGC GATGGCACGG CCGGTCACGC CGGTCTGGAG CCAGCAGTCC AGCGACATCG CCCAGTCGGC CAACAGGGTC GTCGGCCAGG AAACCTCGGC CGAGGACGCG ATGGCTTCGC TGACCTCCAG CCTCGAAGCC ACCGAACAGA ACTGA
|
Protein sequence | MSNNHSDDDG QRSSVSRRRF VAASGASGVA ATLAGCADLV GGGDGGDGGS DGGDGTGATT GEGGSDGATT IQWGIGPTAV QTAGEEMKTA LHEAGGLRDD IEIEWVPSAS DTGEVRSNYN RILNADQSDP DIFQMDNGWV NIFIQRGLIQ NLSETLPEDL LSDINENYFS GFTDTARDPS SGDLYGVPLF PDFPTMQYRK DLVEQAGYDP ESENWATEPM TWEEWSHVVA DVKDNADAEY GFTTQWDIYE GTACCTFNEV MSSWGGAYFG GRENLFGPVG ERPITVDEPE VHNALNMMRK FVHDEEFDGT FADYAGNIAP TDILGWIEEP SRSPFAEGDA VFHRNWPYSL ALTGRNPEET DDPALGEDLG AMPIPYAVSE SEAAQPGTGG TTSALGGWHL TFNPNSDNLD VIDEVVSAVM EPDFALELFR LQGWLPPRPE LFNSDEARNV APVGRYMDTL QVAGENAMAR PVTPVWSQQS SDIAQSANRV VGQETSAEDA MASLTSSLEA TEQN
|
| |