Gene Hlac_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3592 
Symbol 
ID7402507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp344459 
End bp345814 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content53% 
IMG OID643710130 
Productsulfatase 
Protein accessionYP_002567696 
Protein GI222481460 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGATA TCGTTTTAGT AACAGTTGAC TCGTTACGCG CCGATCACGT CGGCTGGCAC 
GGCTACGATC GGAATACAAC GCCAAATCTT GACCAGCGCG CGGCATCAGC CCAGACGTTC
ACGTCCGCCT TTTCCCATGC ATGTTCAACA CGACCTTCGT TTCCGTCTAT TATGACTTCG
TCGTACGCTC TTGAGTACGG AGGATTCGAA CGACTCTCCT CGAAACGAAC CACAATTGCC
GAACTTTTAG AAGAGGCCGG GTACGAGACT GCTGGCTTCC ACTCGAACCT CTATCTCTCT
GCTGATTTTG GCTACGATAG AGGATTCAAT CGGTTCTTTG ATTCGAAATC GGACCCAGGG
ACACTCGCTA AACTTCGACA GGAGGTCAAA ACACACCTTG ACTCCGATGG CCATCTCTAC
GGTTTTCTTC AGCAGGCGTT CAACGCAACG GAGAAACGAG CAGGTATTGA ACTCGGTTCT
GCCTACATCG ACGCTGAGGA AATCACCGAT CGTGCGCTCT CTTGGGCGTC TTCAACGAGT
AGCAATCCCC GCTTCCTTTG GGTGCACTAC ATGGATGTCC ACCATCCGTA CGTCCCACCA
GCGGAGCATC AGCGGCGATT CCGCGATGAA CCGGTCAACG ACCGTGACGC TGTTCAGCTT
CGGAGGAAAA TGTTGGAATC ACCGGAGAAG ATAACTGATC AGGAGTTTAA CACGCTCATT
GATCTTTATG ACTCCGAAAT ATCCTATGTC GACGCACAGG TTGAACGCCT AATAGAAACA
CTTCAGGCAG AATGGGACAA TAATCCCGTA ATCGCATTCA CCGCCGATCA CGGAGAGGAG
TTCCTCGATC ACGGTGGGTT CAGTCACAGT GCTACCTTCT ACGACGAAGT AATTCATGTG
CCGCTGTTCG TTGACACTGG AGAAGATGAG ACAGTAGAAA ACGACAATCT CGTTGGCTTG
ATGGATCTAG CACCCACTCT CGCTGATAAA GCGGATGTCG ATCGACCGGA GACCTATCGG
GGTCAACCGC TGAGTCAGGT CGAGGACCAG TGGAACCGGT CAGAAGTCAT CGCCGAATGG
GCCGACACCG ACACAGATGA TCGTCGGTTT GCCGTTCGGA CCACGAACTG GAAGTATATC
CGCGAGGAAA ACGGAGCTGA GCAACTTTAC GACCTTACCG CTGATCCGGA GGAGATGAAC
GATCTTGCTA CTGGGAATCC CAACGTATTA TCGGACCTCC GCGAAACGCT TGAGGATCAT
CTGGCGACGT TAGACGAAAG CCGCGAGGAC CTCGGTGATG TCGAGATGGA CGAGGAGGTG
CGCCAGCGAC TTCGCGACCT CGGATATCAG GAGTAG
 
Protein sequence
MRDIVLVTVD SLRADHVGWH GYDRNTTPNL DQRAASAQTF TSAFSHACST RPSFPSIMTS 
SYALEYGGFE RLSSKRTTIA ELLEEAGYET AGFHSNLYLS ADFGYDRGFN RFFDSKSDPG
TLAKLRQEVK THLDSDGHLY GFLQQAFNAT EKRAGIELGS AYIDAEEITD RALSWASSTS
SNPRFLWVHY MDVHHPYVPP AEHQRRFRDE PVNDRDAVQL RRKMLESPEK ITDQEFNTLI
DLYDSEISYV DAQVERLIET LQAEWDNNPV IAFTADHGEE FLDHGGFSHS ATFYDEVIHV
PLFVDTGEDE TVENDNLVGL MDLAPTLADK ADVDRPETYR GQPLSQVEDQ WNRSEVIAEW
ADTDTDDRRF AVRTTNWKYI REENGAEQLY DLTADPEEMN DLATGNPNVL SDLRETLEDH
LATLDESRED LGDVEMDEEV RQRLRDLGYQ E