Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0217 |
Symbol | |
ID | 8409715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 215308 |
End bp | 217074 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645018542 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003176061 |
Protein GI | 257386288 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.745298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCTG ACGATCCACT CGGACGACGC GACTTCCTCA GGGCGGCCGG TGCTGGTGCC GTCACGGTAA CGCTCGCCGG CTGTGCCGGC GACGACGGTG ACGACGAGAC GCCGACAGAG GGCGGCGACG AACAGATGGA GACCGACGAG GCCGCGACCA CCGCCGAAGG TGACGACGGC GGCGACGACA CTGCCGGCGA CCAGACGCTC GTCTACGCCC GTGGGGACCA CCCGGAGAAC TACGACCCCC AGCAGACCAC GAGCGGCGAG GTCGCGAAGG TGACGAACCA GATCTTCGAC ACGCTGATCC AGTTCGCCGC CGGCAGCGGG GGCGAACTGG AAGCCGGACT CGCGACCGAC TACTCGCTGG AGGGGACGAC GGCGACGCTC ACGCTTCGGG AGGACGTGAC CTTCCACAGC GGTGAGCCGT TCACCGCCGA GGACTTCGAG GCGACCTTCC GGCGCTTTAC CGATCCCGAA TACGACTACT ACCTCGGCGA CGCCAACCGA TCCGGATACG GCCCCTTCAC GCTCGGCAAC TGGATCGAAT CGGTCGACGC CAGCCAGGAC GGCGAACTGA CGATCGAACT GAGCCAGCGC TACGCGCCCT TCCTGCGCAA CCTCGCGATG TTCGCGGCGG CGGTGCTCTC GAAGGCCCAG ATCGAGAGCT TCGACGCGAG CCCGGACGCG CAGGTCGGGC TCGGCACCGA ACCGATCGGG ACCGGCCCCT TCGCGTTCGA CCAGCTGGAC AACCCCAACG ACCGGATCCG CCTGACGGCC AACGAGTCGT TCTGGGGTCC CGGTCCGAAC GTCGGCTCTG TCGTCTTCAA GACCATCTCC GAGAACAGCA CGCGCGTTCA GGACGTGATC AACGGCGCGT CACACGTCAC CGACAACCTC GACTCCGACG GCTTCCAGCG GGCCGACAGC AGCGACACGG CGACGCTGCT GCGCAAGAAC GGAATCAACG TCGGCTACAT GGCGATGAAC ATGGAACGGA TGGAGCCGTT CCGGGATCGC CGAGTCCGGC GTGCGGTCTC GCTCGCGGTC AACACCGAGG CCATCGTCAA CCAGATCTAC CAGGGCTTTG CCACCCAGGC CTCCCAGCCG CTGCCGCCGG ACGTGCTGGG ACACAACGAC GGTCTCGATC CCTACCCGAC GGACAAAGAC GAGGCCCGGT CGCTGCTGGA GGAAGCGGGC TACGGCGACG GCTTCGAGTT CGAGCTAGCG ACGTTCTCGA ACCCGCGCGG TTACAACCCC AGTCCGGTCC AGACGGCCAA CCAGGTCCGT TCCGATCTGC AGGACATCGG TCTCTCTGTC GAGATCAACC AGTTCTCGGA CTTCGGCCCC TATCTCGATT ACACCGACCA GGGCCGCCAC GACGCCTGCT TCCTCGGGTG GTACACAGAC AACGCCGATC CCGACAACTT CCTCTACGTC CTGCTCGACC CACAGGTCCC GCTCGACGAC GTTCCGGACG GACAGGACTG GATCAGCTTC GACACTGACG GGTACAACAC GCTGAACGTC TCGGCGTGGG CCAACACCGA GTACATGGAA CTGGTCCGGG AGGCTCAGTC GACCTACGAC ACGAACGAGC GCGATACGAT GTACCAGGAG GCCAACAAGC TCGCCCACGA CGAGGCTCCG TGGGTGTTCG TCGACTACGC CGAGACGCTT CGAGCGATCA ACGAGGCCGT CGTCGAGGAC ACCTACACGG TGAGCTCCGT CGGCGGACCG TACCTCAACA CCGTCGAACT GCAGTAA
|
Protein sequence | MQSDDPLGRR DFLRAAGAGA VTVTLAGCAG DDGDDETPTE GGDEQMETDE AATTAEGDDG GDDTAGDQTL VYARGDHPEN YDPQQTTSGE VAKVTNQIFD TLIQFAAGSG GELEAGLATD YSLEGTTATL TLREDVTFHS GEPFTAEDFE ATFRRFTDPE YDYYLGDANR SGYGPFTLGN WIESVDASQD GELTIELSQR YAPFLRNLAM FAAAVLSKAQ IESFDASPDA QVGLGTEPIG TGPFAFDQLD NPNDRIRLTA NESFWGPGPN VGSVVFKTIS ENSTRVQDVI NGASHVTDNL DSDGFQRADS SDTATLLRKN GINVGYMAMN MERMEPFRDR RVRRAVSLAV NTEAIVNQIY QGFATQASQP LPPDVLGHND GLDPYPTDKD EARSLLEEAG YGDGFEFELA TFSNPRGYNP SPVQTANQVR SDLQDIGLSV EINQFSDFGP YLDYTDQGRH DACFLGWYTD NADPDNFLYV LLDPQVPLDD VPDGQDWISF DTDGYNTLNV SAWANTEYME LVREAQSTYD TNERDTMYQE ANKLAHDEAP WVFVDYAETL RAINEAVVED TYTVSSVGGP YLNTVELQ
|
| |