Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0541 |
Symbol | |
ID | 8410043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 513139 |
End bp | 515037 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645018867 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003176382 |
Protein GI | 257386609 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.843572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTTG CGGGCTGTTC GGATCAGCTC GGCGGTGGTG GTGACGGTGG CGACGGCGGC GACGGTGGTG TCGACCCCGT TCAGGATCGT GTCACGGTCG ATCCCGCCGA CATCGAAGAG GGCGGGACGT TCAGAGCCGC GATCGGTGAG GGACCGGACT CGTTCGACTT CGCGTACAGT AGCTCCGCTT CGGCTTCGAT CCTCCACAAC CTCATCTTCG AGGGGATGGT GACGACCGAC GCGAGCGGCG AGATCTATCC GTGGCTCGCC GAGTCGTACG AACAGGTCGA CGTACAGGAC GTATCGCCGG CCGACTACGC GGACTACATG ACCTCCGTCC CCTACACCGA GACCGAAGAC GGCGCGATGG TCATCGACAC GGACGCACAG ATCGTCCTGG AACACCCGGA CAACGATCCC GCGTCCGGCG ACGACGCCCG CGTTCTGACC GTCGAAGAGG CCGGCGACGC CGTCGCGGAT GGCACCTACG GGATGCACTT CCGGTTCGAC CTCCACGAGG GTGTCACGTT CCACAACGGC AACGAGATGA CCGCCGACAA CGTCGTCGAG TCCTACGAGC GCATCCGGAA CTCGACGCTG TCGGGCCAGT ACTACGACTC GATGCTGGAC ATCCAGGCCG ACGGCGACTA CACCGTCCAC CTCTACATTC AGGAACCCGA CGCGGCCGCG GTGCTGGAAC TCGGCGACGC GCCGATCTAC CCCTCCGAGT CGGCGACGCT CCCGCCCGAG GCGATGGACC CCCGACAGGG GAACACGCCG ATGGGGACCG GAATGTTCGA ACTGGACGAG TTCCAGGAAG GCGAGTACGT CGTGTTCACC GCCTTCGACG ACTACTGGTT CGACACCGAG ATGAAAGACT GGTTCGAGGG CTCCTCGGAG TTCCCGAACG GCCCGGTCGT CGACGAGGTC GACGTATCGT TCGTCTCGGA GGACGCTTCA CGGTCCGCGG CCCTCCAGGA AGGCGAGATC GACATGAGCT ACGGGCTGAC TGCGAGCACG CTCAACGACT ACCAGAACTC CGAAGACTTC CGGACGGCCC CGACCGACGG TGCCGGCTAC ACGTTCCTCC AGCACCCCGT CACGGTCGAA CCCTTCACCG ACAAGCGGGT TCGACAGGCG ATCAACCACC TCATCCCGCG TGAGAATATC GCCCAGAACA TCTTCTCCGG GTGGGAAAAT CCGGCTTGGA CGCCGCTGCC GCCGGTCGCC GCCGGGGCCG GGACCGACGA CTACGAGCAG CTCGTCGAGG ACGGCCGCGA GTACAACGAA TACGACCAAG AGCGAGCGGC GGAGCTCGTC GAAGAGGCAA TCGAGGACAA TGGCTGGGAG ACCCCGATCG AGGTCCAGCT GGAGACGAAC TCCGACAACG ACGACCGCGT CCGTACCGTC GAGCTGATCC AGGAAGCGCT CAATCGGTCG GAGTACTTCG AGGCCTCTCT GGAGACCTAC GAGTTCCTCG ACTTCATCGG CCAGCTCCTC AGCGAGGAGT ACTACGACGA CGGCAAGTTC GCTTTCATCG GGCTCTCGGG CGGCTTCAAC CCACACGGCT ACGCGAAGTC CGTCCACTCA CAGGACAACT TCGCTCAGTG TTGTAACTTC CAGAACATCA ACGACGACGA ACTGAGCCAG CTGTTGCGCG ACGCACGATA CGGCGTCGAC GTGGCCCAGG ATCCCGAACT CAGACAGGAG CGGTACAACG CGGTCTGGGA ACGCGTCCTC GAACTCAGCG CCAACTCCTA CGGTACGCAC AGCACGCTCG TCGGTGTCGT CGACGACACC GTCGTCAACG GGTTCAACAC GTATCCGAGC ACGCAGGACA TCATCGGATA CGGCCTGTTC GCTCCACAGG ACGAACAGAT TACGTACCTC AGCAGATAA
|
Protein sequence | MSLAGCSDQL GGGGDGGDGG DGGVDPVQDR VTVDPADIEE GGTFRAAIGE GPDSFDFAYS SSASASILHN LIFEGMVTTD ASGEIYPWLA ESYEQVDVQD VSPADYADYM TSVPYTETED GAMVIDTDAQ IVLEHPDNDP ASGDDARVLT VEEAGDAVAD GTYGMHFRFD LHEGVTFHNG NEMTADNVVE SYERIRNSTL SGQYYDSMLD IQADGDYTVH LYIQEPDAAA VLELGDAPIY PSESATLPPE AMDPRQGNTP MGTGMFELDE FQEGEYVVFT AFDDYWFDTE MKDWFEGSSE FPNGPVVDEV DVSFVSEDAS RSAALQEGEI DMSYGLTAST LNDYQNSEDF RTAPTDGAGY TFLQHPVTVE PFTDKRVRQA INHLIPRENI AQNIFSGWEN PAWTPLPPVA AGAGTDDYEQ LVEDGREYNE YDQERAAELV EEAIEDNGWE TPIEVQLETN SDNDDRVRTV ELIQEALNRS EYFEASLETY EFLDFIGQLL SEEYYDDGKF AFIGLSGGFN PHGYAKSVHS QDNFAQCCNF QNINDDELSQ LLRDARYGVD VAQDPELRQE RYNAVWERVL ELSANSYGTH STLVGVVDDT VVNGFNTYPS TQDIIGYGLF APQDEQITYL SR
|
| |