Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3222 |
Symbol | |
ID | 8409495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | + |
Start bp | 59 |
End bp | 1432 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645018158 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003175683 |
Protein GI | 257372909 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.950328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.818094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACC AGAATGCGAG CAGCGATCGG CGACAGCTAC TGAAGGCGCT CGGCACGCTC AGTGTCGCAA GTATGGCTGG CTGTATCAGC CAAAGCGGTT CCGACGGGGA CGGCTCGGGC GAGAGCGGAG ACGGGTCCAC CGGTTCGTCC GGGGACGGCT CCACCGAGGA GGGCGGGGAG GCCACACCGG TACCGATGGC CGACTCGTTC AGCTTCTGGG AGCTGAGCAA GACGTGGGGG CCACACATCG AGCGCTACGA GTCCGAGACC GACGCGACGG TCGAGCACAC GAACATGGGT CCGGACGAAC TGCTCGACAA CCTCCAGACG CGGCTGCTCT CGGGCACCGG TGCGCCCGAC GCCGCGATGG TCGAGTACAC CTCGCTCAAG CAGGTCGCCA AGACCGGCGG GCTGCGTGAC GTCTCCGACT GGATCGACGA GGCCGACATC CGCGACGACT TCACGTCGGG GATCTGGGAG GTCGTCAGCG ACGGCGACGC AGTGTACGAG GTCCCCTACG ACATCGGACC GGCGACGCTG TTCTACCGCA AGGACATCTG GGACGAACAC GGGCTCAGCG ACGACATCGA GACCTACGAC GAGTTCATCG AGGAGGGCAA GAAGCTCCCC GACGACGTTT CGCTGCTGTC GCTGCCGGGC AGCGGTCTCT CGGTGTTCTG GCGGATGATG TACCGCCAGC TCGGCGGCGT CGAGTTCGAC GAGGAAGGGC GGCTCGCGTT CGACAACGAC AAAGCGGTGC AGGCGATGGC CCAGCTCCAG GAACTGGCCG ACGCCGGCAT CACGGACGAC ACCGCGAGCT GGAGCCAGCA GTGGTTCGCC GGCTTCAACG AGGGGACCGT CACGGCCTAC CTCTCGGGGG CGTGGTTCAG CGGGACCCTG ATGTCCTCGA TGGACGAGGC GGCGGGCGAC TGGCGGGGGA TGAAGATCCC GGCCCTGGAG TCGGGCGGCA CCCGTGCGAG CAACATCGGC GGCTCCGGCG TCTGCTTCCC GGACCAGAAC GACGAGGCCA CGGCACGCCG GGCCTTCGAC TTCGTCGTCA ACACGACGAC CAACGTCGAG GAGATGGCGA ACCTGTTCGC CGAGGAGGGC AACATCAGCG CGTACATGCC CGCGTGGGAC GACGAGGCCT TCGAGAACCC CCGCGAGTTC TTCGGCGGAC AGGCCCTCGG TACGCTGTGG ACCGACATCG CCGACGACAT TCCGCCGTAC CGCTACACGC TCGACTCGCC GAAGATCATG AACCTGCTCA ACCCGCTCCT GCAGGACGTC GTCTACGGTG ACCTCGATCC CGAGTCGGCA CTGGAGGAGT GGGTCACCCA GTCCGCCAAC GAGACCGGGC GTGAAGTCGC CTAA
|
Protein sequence | MTDQNASSDR RQLLKALGTL SVASMAGCIS QSGSDGDGSG ESGDGSTGSS GDGSTEEGGE ATPVPMADSF SFWELSKTWG PHIERYESET DATVEHTNMG PDELLDNLQT RLLSGTGAPD AAMVEYTSLK QVAKTGGLRD VSDWIDEADI RDDFTSGIWE VVSDGDAVYE VPYDIGPATL FYRKDIWDEH GLSDDIETYD EFIEEGKKLP DDVSLLSLPG SGLSVFWRMM YRQLGGVEFD EEGRLAFDND KAVQAMAQLQ ELADAGITDD TASWSQQWFA GFNEGTVTAY LSGAWFSGTL MSSMDEAAGD WRGMKIPALE SGGTRASNIG GSGVCFPDQN DEATARRAFD FVVNTTTNVE EMANLFAEEG NISAYMPAWD DEAFENPREF FGGQALGTLW TDIADDIPPY RYTLDSPKIM NLLNPLLQDV VYGDLDPESA LEEWVTQSAN ETGREVA
|
| |