Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3592 |
Symbol | |
ID | 7402507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | - |
Start bp | 344459 |
End bp | 345814 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643710130 |
Product | sulfatase |
Protein accession | YP_002567696 |
Protein GI | 222481460 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGATA TCGTTTTAGT AACAGTTGAC TCGTTACGCG CCGATCACGT CGGCTGGCAC GGCTACGATC GGAATACAAC GCCAAATCTT GACCAGCGCG CGGCATCAGC CCAGACGTTC ACGTCCGCCT TTTCCCATGC ATGTTCAACA CGACCTTCGT TTCCGTCTAT TATGACTTCG TCGTACGCTC TTGAGTACGG AGGATTCGAA CGACTCTCCT CGAAACGAAC CACAATTGCC GAACTTTTAG AAGAGGCCGG GTACGAGACT GCTGGCTTCC ACTCGAACCT CTATCTCTCT GCTGATTTTG GCTACGATAG AGGATTCAAT CGGTTCTTTG ATTCGAAATC GGACCCAGGG ACACTCGCTA AACTTCGACA GGAGGTCAAA ACACACCTTG ACTCCGATGG CCATCTCTAC GGTTTTCTTC AGCAGGCGTT CAACGCAACG GAGAAACGAG CAGGTATTGA ACTCGGTTCT GCCTACATCG ACGCTGAGGA AATCACCGAT CGTGCGCTCT CTTGGGCGTC TTCAACGAGT AGCAATCCCC GCTTCCTTTG GGTGCACTAC ATGGATGTCC ACCATCCGTA CGTCCCACCA GCGGAGCATC AGCGGCGATT CCGCGATGAA CCGGTCAACG ACCGTGACGC TGTTCAGCTT CGGAGGAAAA TGTTGGAATC ACCGGAGAAG ATAACTGATC AGGAGTTTAA CACGCTCATT GATCTTTATG ACTCCGAAAT ATCCTATGTC GACGCACAGG TTGAACGCCT AATAGAAACA CTTCAGGCAG AATGGGACAA TAATCCCGTA ATCGCATTCA CCGCCGATCA CGGAGAGGAG TTCCTCGATC ACGGTGGGTT CAGTCACAGT GCTACCTTCT ACGACGAAGT AATTCATGTG CCGCTGTTCG TTGACACTGG AGAAGATGAG ACAGTAGAAA ACGACAATCT CGTTGGCTTG ATGGATCTAG CACCCACTCT CGCTGATAAA GCGGATGTCG ATCGACCGGA GACCTATCGG GGTCAACCGC TGAGTCAGGT CGAGGACCAG TGGAACCGGT CAGAAGTCAT CGCCGAATGG GCCGACACCG ACACAGATGA TCGTCGGTTT GCCGTTCGGA CCACGAACTG GAAGTATATC CGCGAGGAAA ACGGAGCTGA GCAACTTTAC GACCTTACCG CTGATCCGGA GGAGATGAAC GATCTTGCTA CTGGGAATCC CAACGTATTA TCGGACCTCC GCGAAACGCT TGAGGATCAT CTGGCGACGT TAGACGAAAG CCGCGAGGAC CTCGGTGATG TCGAGATGGA CGAGGAGGTG CGCCAGCGAC TTCGCGACCT CGGATATCAG GAGTAG
|
Protein sequence | MRDIVLVTVD SLRADHVGWH GYDRNTTPNL DQRAASAQTF TSAFSHACST RPSFPSIMTS SYALEYGGFE RLSSKRTTIA ELLEEAGYET AGFHSNLYLS ADFGYDRGFN RFFDSKSDPG TLAKLRQEVK THLDSDGHLY GFLQQAFNAT EKRAGIELGS AYIDAEEITD RALSWASSTS SNPRFLWVHY MDVHHPYVPP AEHQRRFRDE PVNDRDAVQL RRKMLESPEK ITDQEFNTLI DLYDSEISYV DAQVERLIET LQAEWDNNPV IAFTADHGEE FLDHGGFSHS ATFYDEVIHV PLFVDTGEDE TVENDNLVGL MDLAPTLADK ADVDRPETYR GQPLSQVEDQ WNRSEVIAEW ADTDTDDRRF AVRTTNWKYI REENGAEQLY DLTADPEEMN DLATGNPNVL SDLRETLEDH LATLDESRED LGDVEMDEEV RQRLRDLGYQ E
|
| |