Gene Htur_3492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3492 
Symbol 
ID8744112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3593257 
End bp3594753 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content67% 
IMG OID646514073 
Productsulfatase 
Protein accessionYP_003405027 
Protein GI284166748 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATA GCCGCCCGAA CGTTCTGTTC GTCCTCACCG ACCAGGAGCG CTACGACTGC 
ACGGCGCCCG AGGGACCGCC CGTCGAGACG CCGGCGATGG ATCGCCTCTC GAGCGAGGGG
ATGCGTTTCT CGCGGGCTTG CACCCCGATC AGCATCTGTA CGAGCGCCCG CGCCTCGCTC
ATGACCGGCC TGTTCCCCCA CGGCCACGGG ATGTTGAACA ACAGCCACGA GGCGGACGCG
ATCCGGCCGA ACCTGCCGCC CGAGTTACCG ACGTTCTCGG AACTGCTGGC CGAGAACGGG
TACGACTGCA GCTACACCGG AAAGTGGCAC GTCGGCCGGG ACCAGACGCC CGAGGACTTC
GGCTTCGCCT ATCTCGGCGG CAGCGACAAA CACCACGACG ACATCGACGA GGCGTTCCGG
GAGTACCGCG AGGAACGCGG CGTCCCGCCG GGCGAGGTCG ACCTCGAGGA GGTGCTCTAC
ACCGGCGACG ACCCGCGCGA TGCGAGCGAG GGAACCTTCG TCGCGGCGAC GACCCCGGTC
GATGTCGAGG AGACCCGCGC GTACTTCCTC GCCGAGCGGA CGATCGACGC CATCGAAGCG
CACGCCGATG GCGACAGCGG AGAGGGCGAC GGAAACGGCA GCGACCCATT CTTCCACCGC
GCGGACTTCT ACGGCCCCCA CCACCCCTAC GTCGTCCCCG AGCCCTACGC CTCGATGTAC
GACCCCAACG AGATCGATCC GCCCGAAAGC TACGCCGAGA CGTACGACGG GAAGCCCCAA
GTTCACGAGA ACTTCCACTA CTACCGCGGC GCCGACGGCC TCGAGTGGGA CCACTGGGCC
GAGGCCACCG CGAAGTACTG GGGGTTCGTC TCGCTGATCG ACGACCAGCT CGAGCGGATC
CTCGAGGCGC TCGAGGAGCA CGGACTGGCG GACGAGACGG CCGTCGTCCA CGCCTCGGAT
CACGGCGACT TCGTCGGCAA CCACCGCCAG TTCAACAAGG GCCCGCTGAT GTACGACGAC
ACCTACCGGA TTCCCCTACA GGTGCGCTGG CCCGGCGTCG CCGAACCCGG AACGACGTGC
GAGGTGCCCG TCCACCTCCA CGATCTGGCC GCGACGTTCC TCGAGATGGG CGGCGTCGAC
GTTCCGGAGT CGTTCGATTC CCGAAGTCTG GTGCCGCTGC TCGAGACCGG CGACGACCCG
GACGCGGTGC CCGACGACTG GCCCGACTCC ACCTTCGCCC AGTATCACGG CGACGAGTTC
GGCCTCTACA CCCAGCGGAT GGTCCGCACT GGGCGCTACA AGTACGTCTA CAACGGTCCC
GACATCGACG AGCTGTACGA CCTCAAGGCC GATCCCGCCG AATTGCAGAA CCTGATCGAC
CACCCGGGAT ACGCGGACGT TCGCGAGGAA ATGCGGGATC GACTCGTCGA CTGGATGCAG
GAGACGGACG ATCCGAACCA GGGGTGGGTG CCAGACGTGC TCAGAGACAC GCCGTAA
 
Protein sequence
MADSRPNVLF VLTDQERYDC TAPEGPPVET PAMDRLSSEG MRFSRACTPI SICTSARASL 
MTGLFPHGHG MLNNSHEADA IRPNLPPELP TFSELLAENG YDCSYTGKWH VGRDQTPEDF
GFAYLGGSDK HHDDIDEAFR EYREERGVPP GEVDLEEVLY TGDDPRDASE GTFVAATTPV
DVEETRAYFL AERTIDAIEA HADGDSGEGD GNGSDPFFHR ADFYGPHHPY VVPEPYASMY
DPNEIDPPES YAETYDGKPQ VHENFHYYRG ADGLEWDHWA EATAKYWGFV SLIDDQLERI
LEALEEHGLA DETAVVHASD HGDFVGNHRQ FNKGPLMYDD TYRIPLQVRW PGVAEPGTTC
EVPVHLHDLA ATFLEMGGVD VPESFDSRSL VPLLETGDDP DAVPDDWPDS TFAQYHGDEF
GLYTQRMVRT GRYKYVYNGP DIDELYDLKA DPAELQNLID HPGYADVREE MRDRLVDWMQ
ETDDPNQGWV PDVLRDTP