Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1598 |
Symbol | |
ID | 8411120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1523368 |
End bp | 1525053 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645019924 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003177419 |
Protein GI | 257387646 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0732805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGACA CACGGCACAC GGGACGGATC GCATGGCTCT CACGGCGACA GTACGTCCGC GGCGTCGCCG GCAGTGCTCT CGCGGTCGCT CTGGCGGGCT GTCAGGGCGA CGGCGGTGAG GACGATGGCG GCGACGGCGG CGCACAGCAA CCGGAGGAGA CGACGGCCGA CAGCGAGGAC ACGGCGACGG ACACCGACGA CACAGAGGCC ACGACCGACG GCTCGATCAC GTTCGCACAG GCGAAATCGC CCGTGGAGTT CGATCCCGTC GTCCTGAACG ACGTTCCGTC GGCCGAGGTC GCGATGCTCG TCTTCGACTC GCTGTACACG TACGACGAGG GCACCAATCT CGTTCCGCAG ATCGCCGCCG ACATGCCGGA GGTAGAGCGC GGAGCCCAGC GGTGGATCGT TCCGATACGG ACCGACGCCA CTTTCCAGAA CGGCGACCCG GTCACGGCCG AAGACGTCGC TTACTCCTTC CGGGCACCGG TCGAAGAAGA GACGGAGAAC GCCGGGGAGT TCAACATGAT CGATAGCGTC GAAGTCGTCG ACGAGTCGAC CGTCCAGTTC GACCTGCAGT TCCAGTTCGG GGCATTCGAC TCGTATCTCC CGTGGGAAAT CGTCGACAAG TCCGTCCGCG AATCGGACAG AGACGCCTAC AACACGTCAA GCCCCGTCGG AGCCGGCCCG TTCACGTTCG ACGACTGGCA GGAGGGCGAG TACGTCCGGC TCAGCCGCTG GGACGACTAC TGGGGAGAGC CACTCCCGAA CCTCGCCGAG ATCGAATTCG TCCCGGTCGA AGAGCCGACG ACCCGGATCA CGACGCTGCG AACCGGTGAG AACGACGTGG TAAAGAACAT TCCACCGGCA AACTGGGAGA CTGTCGAGAA CATGGGCGAG GCCAGCATCG AGTCGGTTCT GGGAACGAGC TACTTCTATC TCGCCTTCAA CTGCAACGAG GGACCCACCG CCGACCCCGA GGTGCGGGAG GCGATCGACT ACGCCTTCTC GATGGACGAC GCGGTCGGCC AGTACGTCGA ACCGACCGGC GAGCGACAGT ACGCGCCGGT TCCCAGAGCG ATCTCCGAAG ACTGGGAGTT CCCCGTCGAG GAGTGGCAGC AGATCCCCCA CGAGCCGGAT CTGGATCGGG CCAAGTCGCT GCTCGACGAC AACGACAGCG TCCCGGACGA CTGGCAGCCC CGGATCATCG TCCCGCCGGA CGACAAGCGC GAACAGATCG GGGTCTCCGT CTCGAACGGG CTCAGCGAGG CCGGCTACGA CGCGACGGTC CAGCGCCTCG ACTGGGGTGC CTTCCTCGAA CAGTACGTCA CCGGCAGCGC CGACGACTAC AACATGTTCA CGCTTGGCTG GGCCGGTTCG CCCGATCCGG ACACGTTCAT GTACTTCCTG TTCGCCCACG ACCAGATCGG CACGAACAAC GGCACCTACT ACCGCAACGA GTCGATGAAC GAACAGATCA TGAACGCCCG TCAGTCCAAC GACGACGAAC AGCGCCGCGA GTGGTACGTC GACGCCATCC AGACGGTGCT CGAAGACCGG GTCCACCTCC CGGCGTACAA CATCAAGAAC AGCTTCGGGG TCCGGAGTCA CGTCTCGGAC TTCCGGGCCC ACCCCGTCGA CCAGTTCAGC ATCGTCTCGG CGTACAACAA CGTCTCCGTC CAGTGA
|
Protein sequence | MKDTRHTGRI AWLSRRQYVR GVAGSALAVA LAGCQGDGGE DDGGDGGAQQ PEETTADSED TATDTDDTEA TTDGSITFAQ AKSPVEFDPV VLNDVPSAEV AMLVFDSLYT YDEGTNLVPQ IAADMPEVER GAQRWIVPIR TDATFQNGDP VTAEDVAYSF RAPVEEETEN AGEFNMIDSV EVVDESTVQF DLQFQFGAFD SYLPWEIVDK SVRESDRDAY NTSSPVGAGP FTFDDWQEGE YVRLSRWDDY WGEPLPNLAE IEFVPVEEPT TRITTLRTGE NDVVKNIPPA NWETVENMGE ASIESVLGTS YFYLAFNCNE GPTADPEVRE AIDYAFSMDD AVGQYVEPTG ERQYAPVPRA ISEDWEFPVE EWQQIPHEPD LDRAKSLLDD NDSVPDDWQP RIIVPPDDKR EQIGVSVSNG LSEAGYDATV QRLDWGAFLE QYVTGSADDY NMFTLGWAGS PDPDTFMYFL FAHDQIGTNN GTYYRNESMN EQIMNARQSN DDEQRREWYV DAIQTVLEDR VHLPAYNIKN SFGVRSHVSD FRAHPVDQFS IVSAYNNVSV Q
|
| |