Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0502 |
Symbol | |
ID | 7400383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 521826 |
End bp | 522815 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707567 |
Product | phosphate binding protein |
Protein accession | YP_002565174 |
Protein GI | 222478937 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.940111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.578937 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCAA GCCACGACGC GGATACCGAG GGGATCACGC GGCGGAAATC GCTTTCGGCG CTCGCCGGGG CGGGCGCGCT GGCGCTCGCA GGATGTACGC AGAGCACGGG AGGCGGCAGT GGTGACGCCC TGTCCGGCTC GATCAACATC GCGGGTTCCT CGACCGTCTT CCCGCTGATG AGCGCGATCG GGGAGGACTT CGCCGCGGAG CACGACCAGG TCTCGGTCGA CATCAGTTCG ACCGGCTCGG GCGGCGGGTT CTCGAACTAC TTCTGCGTCG GCGACACCGA CTTCAACAAC GCGAGCCGGC CGATCCAGCC CGCAGAGGAG GAACTCTGCG AGGAGAACGG CGTCGAGTAC GTCGAGCTCA TCGCGGCGAC GGACGCGCTC ACGATCGTCG TCAACCCCGA GGCCGACTGG ATCGACTCCC TCACCGTCGA GGAGCTCGCA CAGATCTGGG AGGAGGACCC CGCCCAGACC TGGGACGAGG TCCGCGACGA GTTCCCGAAC GAGGAGATCG AGCGGTTCGG CGCCGCCGAC ACCTCCGGGA CGTACGACTA CTTCATCGAG AGCATTCTGG AGGAGCGCGG TCACACTAGC GACTACCAAG CGACCGAGCA GGACAACTCG ATCGCACAGG GCGTCTCGGG CAGCGAGTAC GCGATCGGCT ACTTCGGCTT CGCGTACTAC TTCCAGAACC CCGATCAGCT CAAGGCGCTC GGGATCGACG ACGGCAACGG GCCGGTCGAG CCGAGCCTCG AAACGGCGTC CAGCGGCGAG TACACTCCCC TTTCCCGGCC GCTGTTCACC TACCCGTCGA TCGAGTCACT CGGCAAGGAG CACGTCGCCG AGTTCGCGCG CTACTTCGTC GAGCAGACGA CCAACGAGGA CCTCGTCGCC GGCGACGTGG GATACGTGCC CGCGACTGAA GAGACCCAAG AGGAGCAGAT GGAGATCCTC GAGGACGCTA TCGAGCGGGC GCAGGAGTAA
|
Protein sequence | MASSHDADTE GITRRKSLSA LAGAGALALA GCTQSTGGGS GDALSGSINI AGSSTVFPLM SAIGEDFAAE HDQVSVDISS TGSGGGFSNY FCVGDTDFNN ASRPIQPAEE ELCEENGVEY VELIAATDAL TIVVNPEADW IDSLTVEELA QIWEEDPAQT WDEVRDEFPN EEIERFGAAD TSGTYDYFIE SILEERGHTS DYQATEQDNS IAQGVSGSEY AIGYFGFAYY FQNPDQLKAL GIDDGNGPVE PSLETASSGE YTPLSRPLFT YPSIESLGKE HVAEFARYFV EQTTNEDLVA GDVGYVPATE ETQEEQMEIL EDAIERAQE
|
| |