Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0357 |
Symbol | |
ID | 8409855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 349360 |
End bp | 350490 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645018682 |
Product | phosphate binding protein |
Protein accession | YP_003176201 |
Protein GI | 257386428 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0129882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.511021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGCG AGTCAGCGCG ACTGTCTGAT CTGGTATCAC GGCGGAAATT CATACTGACC TCTGGCGCGG TTGGTGCTGC CGGACTCGCA GGCTGTACCA GCGGAAGTGA ACAGGGTGAC GCCGAGAGCG ACGGCGGTGA CGGTGGCAAC GGTGGCGACG GCGGCGACAG CGGTGACGGT GGCGACAGCG GCGACGGCGG CGACGGTGGT AGCGGCGACG ACTCGAACGT CGACCGGGCG TCGCTCTCGG GAGACGTCCG AATCTCGGGG AGCAGCACGG TGTACCCGGT GGCCGAGGAG GTCTCTCGAC TGTGGAACGA GGAGTACGGC GACGGCGTCG GGTTCAACAT CACGCCGGAC GGCAGCGGCG GGGGCTTCGA GAACGTCTTC ATCCCGGGCG ACAGCGACAT CAACAACGCG AGCCGCCCGA TCAAAGACGA AGAGCTCCAG CGCTGCCGCG ACAACGGCAT CGAACCGGTC GAGTTCTACG TCGCTCAGGA CGCGCTCACC GTGGTCGTCA ACAACGACGC CGACTTCATC GACGAGATCT CACTGGAGGA TCTCAAGACC ATCTGGTCAC CGGACACCGC TCCCGAAATG TGGAGTGACG TCAACTCTGA CTGGCCGGAC GAGCCGCTCG ACCTGTACGG TCCGGCGAGC ACGTCGGGAA CCTACGACTA CTTCATCGAG GCGGTCATCG GCGAGACCGA GTCGGACCAG CCGATCCGCA GCGACTTCGA GGGGACCGAG GAGGACAACC TGATCGCACA GGGCGTCTCC GGCAACGAGT ACGCGTTCGG GTACCTCCCC TTCGCGTACT ACACGAACAA CCCGGACTCG GTCAAGGCGC TCGGCCTCGT CGAGGGCGGG AACGACCCGG TCGAGCCGAG TCTCGAAGGC GCACAGAGCG GGAACTACCC GCTCGCCCGG CCGCTGTTCT TCTACGCCAA CATGAACAAG CTCCAGGAGA AGACCCACCT CCAGGAGTTC ATCCGCTTCT ACGTCAACGA GGCCGACGAG GATTACATCG CCAGCGACGT CGGCTACGTC CCCTCCAGCG ACCAGATGGT CGAGGACAAC CTCGCGAACC TCGAAGCGGG CATCGCTGGC GACTACGAGT TCAGCAGGTG A
|
Protein sequence | MTRESARLSD LVSRRKFILT SGAVGAAGLA GCTSGSEQGD AESDGGDGGN GGDGGDSGDG GDSGDGGDGG SGDDSNVDRA SLSGDVRISG SSTVYPVAEE VSRLWNEEYG DGVGFNITPD GSGGGFENVF IPGDSDINNA SRPIKDEELQ RCRDNGIEPV EFYVAQDALT VVVNNDADFI DEISLEDLKT IWSPDTAPEM WSDVNSDWPD EPLDLYGPAS TSGTYDYFIE AVIGETESDQ PIRSDFEGTE EDNLIAQGVS GNEYAFGYLP FAYYTNNPDS VKALGLVEGG NDPVEPSLEG AQSGNYPLAR PLFFYANMNK LQEKTHLQEF IRFYVNEADE DYIASDVGYV PSSDQMVEDN LANLEAGIAG DYEFSR
|
| |