Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1189 |
Symbol | |
ID | 8410709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1129939 |
End bp | 1131804 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645019525 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003177022 |
Protein GI | 257387249 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.105584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACA AGTCCAACGA CTACAAAGAC GTTCTTAGCC GCCGCCGGTT CGTCGCGCTG ACAGGCGCGG CAGGTGCTGC TGCACTTGCC GGCTGTGACG GCTCGGAAGG TACGGATGAC AGTACGCCGG CCGACGAGGG TGGGGACGAC GACACGTCGA CCGAAGACGA CGACATGTCG ACCGAAGACG ACTCGATGGA AGTCGCCGAC GTGCGACACC GCAGCGGGAC GAGCCTCTCG CCCGCGGACA TCCAGTTCAA CCCGTGGGGC CAGAACACCG CACAGATCTC GAACGAGCTC ATCTTCGATC CGTTCGCGGA GTTCAACTAC GCTACTGGCG AGTACGTTCC CGCGATCATC GAGGAGTGGG AGTACACGGG CGATACCTTC GAGATGGTCA TTCAGGAGGG TGCGACCTGG CACGACGGCG AGCCCGTCAC GGCCCAGGAC CTGGCGACCT ATCTCCGACT CGACCGCGAG TCGGGCAGTT CGATCTGGGA CTGGGGCAGC GACGTCGAGG AGATCGACGA CCGGACCGTC GCCATCACTA TCGAGGGCGA CATCAACCCG TCGCTGATCG AGTTCTCCGT GATGGAGAAT CGGCTCACCA CCAAGCACTC CCGCTACGGC GACGTTCTCT CGGAGACGCA GAACGCGGAC GACAACACGC CGCTGACGGA GTTCGTCGAC GACGAGCCGA TCGGCAACGG GATCTTCCAG TACGGCGAGG CCGACGAGCA GGTAATTCTC ACCGAGCGCC ACGCCGATCA CCCCAACGCC GACAACGTCA ACTTCAAAGA GTACGCCTTC CAGTACTTCG ACGGCAACAC GGCGATCCAC CAGGCGCTGC TCTCGGGGAA CATCGAGAGC ATGTTCGCCA TCTACACGCC GGGCAACGTG GTCACCGACC TGCCTGACTC GATGAACGAG TACCGCACGC CACGGAACGG CGGCGTCGGG ATTATCCCCA ATCACAATCA CGACCACCTC GGTCGACGCG AGGTCCGTCA GGCGATCGCC TACGCCATGA ACCGAACGCA GGTCGCGGCG AACTCCGACC CGCGTACCAA GGTTGCCCCG CGCATCCCCA CGGCACTGCC CAACGCCCAG CTCGAGAACT GGCTCGGCGA CTCCATGGAG GACTTCGAGA CGTACGGTCG TGAGTCCAGT GAGGTAGAGA AGGCAGCGAG TGTGCTCAAG GAGGCCGGCT ACAGCCGCAA CGGCGACGAC GTGTGGGAGG ACGAGGACGG CAACACCCTC TCCTTCGAAC TCATCGCGCC CGGTGGCTGG TCCGACTGGG TCACTGCGAT GGAGTCCGTG GCCGATCAGC TCAACGCCGC CGGCATGGAC GTGGAGTTCT CGACGGTGCC GTTCGGTGAC CTCGGCGGAT CCGACGGCCG CTGGGCACAG GGGAACTTCG ACGCGACCGC CGAGTACTGG ACTGCGGCGT TCGCGCGTGC CGCTCACCCG TACCACAACC TGCGCCACCA GATGGTCAAC CCCAAGGCGA CGCTGCGCGA AAACGGCTAC GCCTATCCGG GTGCTGTCGA GGACCGCGGC GGTTCCGAAG CCGACATCAC CGTTCCGGCA CTCGACGGCT CGGGCGAGCT GACGGTCAAC CCGGTCGAGG ACGTCGGTAC GCTCGGTTCG ACCAGTGACA GCGACACTGA GGCCGAGCTC GCGCTCGAAC TCCTCTGGGT CTCCAACCAG GATCTCCCGA TGATCCCGAT CCAGGAGGGG CTGAACCAGA CGTTCATCTC CTCGAAGCGA TTCGATATTC CGGCCGAAGA CGCCGAAGTC GCCCAAGTGC AGTACGCGAA CACCTGGCTC CCGCGCCAGG GCGAGATGAC CTACAACGGC AACTAA
|
Protein sequence | MADKSNDYKD VLSRRRFVAL TGAAGAAALA GCDGSEGTDD STPADEGGDD DTSTEDDDMS TEDDSMEVAD VRHRSGTSLS PADIQFNPWG QNTAQISNEL IFDPFAEFNY ATGEYVPAII EEWEYTGDTF EMVIQEGATW HDGEPVTAQD LATYLRLDRE SGSSIWDWGS DVEEIDDRTV AITIEGDINP SLIEFSVMEN RLTTKHSRYG DVLSETQNAD DNTPLTEFVD DEPIGNGIFQ YGEADEQVIL TERHADHPNA DNVNFKEYAF QYFDGNTAIH QALLSGNIES MFAIYTPGNV VTDLPDSMNE YRTPRNGGVG IIPNHNHDHL GRREVRQAIA YAMNRTQVAA NSDPRTKVAP RIPTALPNAQ LENWLGDSME DFETYGRESS EVEKAASVLK EAGYSRNGDD VWEDEDGNTL SFELIAPGGW SDWVTAMESV ADQLNAAGMD VEFSTVPFGD LGGSDGRWAQ GNFDATAEYW TAAFARAAHP YHNLRHQMVN PKATLRENGY AYPGAVEDRG GSEADITVPA LDGSGELTVN PVEDVGTLGS TSDSDTEAEL ALELLWVSNQ DLPMIPIQEG LNQTFISSKR FDIPAEDAEV AQVQYANTWL PRQGEMTYNG N
|
| |