Gene Htur_3376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3376 
Symbol 
ID8743996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3485194 
End bp3486723 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content68% 
IMG OID646513958 
Productsulfatase 
Protein accessionYP_003404912 
Protein GI284166633 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGT CAGTCGATTC ACCCGATTCA TCAGACGTCG ACGCAGCGTC GGCTAACGGA 
CGTGATCCGG AGTCACATTC CACTGTGCGG AACGTCGTGC TCGTCGTCCT CGATACCGCA
CGGGCGACGA GTACGGGGCC GAAGACGACG CCGAATCTGA ACCGCCTCGC GGCCGACGGA
ACCCACTTCG ACAACGCCTT CGCGACCGCA CCCTGGACGC TCCCGTCTCA CGCCTCGATG
TTCACCGGGA CCTATCCCTC CGAGCACGGC ACCCACGGCG GACACACCTA TCTCGACGAC
GAGTTGCGGA CCCTCCCAGA GTCCTTCGCC GATTCCGGAT ACGAGACGAT CGGCGTCTCG
AACAACACCT GGATCACCGA GGAGTTCGGC TTCGACCGCG GCTTCGACGA CCTCCGGAAG
GGCTGGCAGT ACATCCAGTC CGACGCGGAC ATGGGCGCCG TCGTCCGCGG CGAGGACCTC
CGGGAAAAGC TCCAGGCGAC CCGGAACCGG CTCTTCGACG GCAACCCGCT GGTCAACGCC
GCGAACATCC TCTACAGCGA GGCCCTCCAG CCCGCGGGCG ACGACGGCGC CGACCGATCG
ACGACCTGGA TCACCAACTG GCTCGACGAT CGCGACGACA GCCGTCCGTT CTTCCTGTTC
TGTAACTTCA TCGAACCCCA CGTCGAGTAC GATCCGCCCC GCGAGTACGC CGAGCGGTTC
CTCCCAGACG GCGCGAGCGT CGACGAGGCA CTCGCCATCC GGCAGGACCC CCGCGCCTAC
GACTGCGAGG ACTACGGCCT CTCCGAACGG GACTTCGCCC TGCTCCGCGG GCTCTACCGG
GCCGAACTCG CCTACGTCGA CGAGCAGCTC GGACGGCTCC GGGCGGCCCT CGAGGACGCC
GGCGAGTGGG AGGACACCCT CTTCGTCGTC TGCGGCGACC ACGGCGAGCA CATCGGCGAC
CACGGCTTCT TCGGCCACCA GTACAACCTC TACGACACCC TGATCAACGT CCCGCTGGTC
TGCCACGGCG GCCCCTTCAC CGACGGCGGC CAGCGCGAGG ACCTCGTCCA GTTGCTCGAC
CTCCCCGCCA CGCTGCTCGA GACCGCGGGG ATCGACGATC CCGAACTGCG CGCGCAGTGG
TCCAGCCGCT CGTTCCACCC CGCGTCGGAC GACGACCCCC GAGACGCCGT CTTCGCGGAG
TACGTCGCCC CCCAGCCCTC GATCGACCGC CTCGAGGCCC GCTTCGACGA ACTTCCCGAC
CGCGTCTACG AGTACGACCG TCGCCTCCGG GCCGTCCGGA CGCGCGAGTA CAAGTACGTC
CGCGGCGACG ACGGGTACGA CCGGCTCCAC GACGTCGAGA CCGACCCGCT CGAGCGCGAC
GACATCGCCG CACGGGAGCC CGAGCAGGTG CGAGCGATGC AGCGGCGCCT CGAGGAGCGG
TTCGACCCGC TCGCCGAGGC CGGCGAGAGC GGCGAGGTCG AGATGCGCGA GGGGACCAAG
GAGCGACTCG CGGATCTGGG GTATCTCTAA
 
Protein sequence
MAESVDSPDS SDVDAASANG RDPESHSTVR NVVLVVLDTA RATSTGPKTT PNLNRLAADG 
THFDNAFATA PWTLPSHASM FTGTYPSEHG THGGHTYLDD ELRTLPESFA DSGYETIGVS
NNTWITEEFG FDRGFDDLRK GWQYIQSDAD MGAVVRGEDL REKLQATRNR LFDGNPLVNA
ANILYSEALQ PAGDDGADRS TTWITNWLDD RDDSRPFFLF CNFIEPHVEY DPPREYAERF
LPDGASVDEA LAIRQDPRAY DCEDYGLSER DFALLRGLYR AELAYVDEQL GRLRAALEDA
GEWEDTLFVV CGDHGEHIGD HGFFGHQYNL YDTLINVPLV CHGGPFTDGG QREDLVQLLD
LPATLLETAG IDDPELRAQW SSRSFHPASD DDPRDAVFAE YVAPQPSIDR LEARFDELPD
RVYEYDRRLR AVRTREYKYV RGDDGYDRLH DVETDPLERD DIAAREPEQV RAMQRRLEER
FDPLAEAGES GEVEMREGTK ERLADLGYL