Gene Htur_3375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3375 
Symbol 
ID8743995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3483604 
End bp3485028 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID646513957 
Productsulfatase 
Protein accessionYP_003404911 
Protein GI284166632 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCCAC ACATCGTTCT CATCCACTGC CACGATCTGG GGAAGTACGT GGGCTGTTGC 
GGCGCCGGCG TCGAGACGCC ACGGATCGAC GGCCTCGCCG CGGCGGGCGT CCGGTTCGAT
CGCCACTTCG TGACGGCCCC GCAGTGTTCG CCGAGTCGCT CGAGTCTGAT GACCGGCCGT
CACCCCCACC AGAACGGGAT GCTCGGACTC GCCCACGGCA ACTGGGAGGT CGGCCCCCAC
GAGCGATTCC TGCCGGAGTT ACTCGGCGAG GCCGGCTACG AGACGCACCG CTTCGGACTC
CAGCACGTCA CCGAGTACCC CGAACGACTC GGCTACGATC TGACGCACAA CGAGGAGTCC
CTGACGAGCG AAACCCCGAC GTCGGTCCAC GAGGGCGCCC GCGCACGCAC CGTCGCCGCC
GACGTCGCGG GGTGGCTCGA GGCGGGCGAC CGCGACGATC CGTTCTTCGC CTCGGTCGGC
TTCTTCGAAC TCCACCGCAT CGCGGTGGAC GGCGGGTTCA GCTTCGACGG CGAGCGGTAC
GACGCCCCCG ATCCCAACGC GGTCGAGCCC CTCGAGTTCC TCCCCGATCG GCCCGGTATC
CGGTCGGACA TCGCCGGAAT GAACGGGATG GTCCGCGCGA TCGACGACGG CGTCGGAACG
ATCGTCGACG CCCTCGAGAA CGAGGGACTG GCCGAAGACA CCCTCCTGCT CTTCACGACC
GAACACGGGC TGGCGATGCC GCGCGCGAAG GGCACCTGTT TCGACGCCGG CATCGAGGCG
GCTCTGCTGA TGGCCCAACC GGGAACCCTC GCGTCGGGCC GGGTCGTCGA CGACCTCGTG
AGCAACGTCG ACGTCTTCGC GACGCTGCTC GATATCGGGG ACGCGCCGGT CCCCGGCGTC
GACACCGATG GGGACGACAT CGCGGGGCAG AGTTTCGCGC CGCAGTTGTT CGACGGCGGG
AACGGCGGCG GCACGAACGG AGCCGCTGGC AAGGACGCCT ACAAGCCCCG CGACCGGGTC
TTCTCGGGGA TGACCTGGCA CGATCGATAC AACCCGATCC GGGCCATCCG AACCGACCGC
TGGAAGTATA TCCGTAATTT CTGGCACCTA CCCGCAGTCT ACATGACGAC GGACGTCTTC
TGCAGCGCGG CGGGTCGGGA GGTCCACGAG GACTACTACG GCGTGCAGCG ACCCTACGAG
GAACTATACG ACCTCGAGGC CGACCCGCTC GAGCGGGAGA ACCTCGCGGC GGGGGACGAC
CCGGACGATC CGGCTACCGA GACCGTTCGC GACGAGCTTC GAACGGACCT GCTCGAGTGG
ATGGACGCGA CCGGTGATCC GCTGCTTGAG GGCCCCGTGC TGCCGAACAA CTGGGAGACG
GTCCACCCCC GGCTGGAGGA CGACCGCGAC GACATCCGGC GGTAA
 
Protein sequence
MPPHIVLIHC HDLGKYVGCC GAGVETPRID GLAAAGVRFD RHFVTAPQCS PSRSSLMTGR 
HPHQNGMLGL AHGNWEVGPH ERFLPELLGE AGYETHRFGL QHVTEYPERL GYDLTHNEES
LTSETPTSVH EGARARTVAA DVAGWLEAGD RDDPFFASVG FFELHRIAVD GGFSFDGERY
DAPDPNAVEP LEFLPDRPGI RSDIAGMNGM VRAIDDGVGT IVDALENEGL AEDTLLLFTT
EHGLAMPRAK GTCFDAGIEA ALLMAQPGTL ASGRVVDDLV SNVDVFATLL DIGDAPVPGV
DTDGDDIAGQ SFAPQLFDGG NGGGTNGAAG KDAYKPRDRV FSGMTWHDRY NPIRAIRTDR
WKYIRNFWHL PAVYMTTDVF CSAAGREVHE DYYGVQRPYE ELYDLEADPL ERENLAAGDD
PDDPATETVR DELRTDLLEW MDATGDPLLE GPVLPNNWET VHPRLEDDRD DIRR