Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2574 |
Symbol | |
ID | 8412120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2477639 |
End bp | 2478826 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645020915 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003178387 |
Protein GI | 257388614 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.577743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCG GACTGCAGCA GTCACGTGCT GGTTCTTTCG AGCTCGTCCA TCACTACGTC GGGGGCGACG GAGAGGCCGC ACTGAGAGCG CTGATGGAGG GATTCGCCGA CAGCCACAGC TCGATCTCGG TTCAGGAGAC CCACTACGAC AACATGCGGC TCCAGGTCAA GAGCCGCATT CTGGGTGAAG ATCCGCCGGA CGTCTGGACC GGGTGGCCCG GCGGCGAGAT GGCCGGCTAC GCGGAAGTCG ACGCGGTCAA AGACATCACC GACCTCTGGG AGGCCTCGGG GATGGCCGCG AACTTCCGGT CGGTCGCGGC GGACGTGTCA CAGGTCGACG GTCGGTACCA CGCGGTCCCG ATCACCATCC ACCGGGCAAA CGACATGTAT CTCCACACCG AGACCGTCGA GGCGCTCGGG ATCGACCCCG CGCGAGCGTC GGACCCGACC GAACTCGTCG AGATGCTGGA AGCGGTCGAC GACGACCACG ACGGACCGTC GTTCTTGCTC CCGATGGCCG ACCCGTTCAC CGTCCTGCAG CTGTGGGAGA TCACGCTCCT GGGACTGTCC GATCACGCCA CGTTCGAGGC GATGACCGCG GGCAACGCGT CGCGTCACCG CGACGTGATC GTGAGTGCGC TCGAACACAT CCAGCGTTTC TCGGCGCTGT CCAGCGAGGA CTCGCTGTAT CACGGGATGA CCGACGCGAA CGAGCAGTTC ATCGACGGAG CCGCCCCGGT CTACCCGCAG GGCGACTGGG CGGCCGGCGT GTTCGACGAG ACGCCCGAGT TCGACTTCCA GTCCGAGTGG GACCGCGTCG CGTTCCCCGG TACGGAGGAC ATGTTCGCCG TCGTCATGGA TTCGTTCATC CCGTCGTCGA AATCGGACAG CGACGCGCTC GAGACGTTCC TCGAATACGT TGGCTCGTGT GACGCACAGG AGCGGTTCAG CCGGAAGAAG GGGTCGCTGC CGGCACGGAA AGACGCCTCG ATCGACGGGT TCACCGAGTT CGGGAGATCA CAGTACCGGC AGCTCGATCG CTCGGCCGAA CAGCCCCAGA TAATCACGCA CGGCCTGAGC GTCTCCTCGG CACAGCTGGT CGACCTGAAG TCCGCGATCG CGGGCTTCAT CGACGAGTGG GACGCACAGG CGACGGCCGA CGAGATGATC GGCGTCTTCG ATCGCTGA
|
Protein sequence | MSSGLQQSRA GSFELVHHYV GGDGEAALRA LMEGFADSHS SISVQETHYD NMRLQVKSRI LGEDPPDVWT GWPGGEMAGY AEVDAVKDIT DLWEASGMAA NFRSVAADVS QVDGRYHAVP ITIHRANDMY LHTETVEALG IDPARASDPT ELVEMLEAVD DDHDGPSFLL PMADPFTVLQ LWEITLLGLS DHATFEAMTA GNASRHRDVI VSALEHIQRF SALSSEDSLY HGMTDANEQF IDGAAPVYPQ GDWAAGVFDE TPEFDFQSEW DRVAFPGTED MFAVVMDSFI PSSKSDSDAL ETFLEYVGSC DAQERFSRKK GSLPARKDAS IDGFTEFGRS QYRQLDRSAE QPQIITHGLS VSSAQLVDLK SAIAGFIDEW DAQATADEMI GVFDR
|
| |