Gene Htur_4646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4646 
Symbol 
ID8745406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp225680 
End bp227065 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content58% 
IMG OID646515157 
Productsulfatase 
Protein accessionYP_003406104 
Protein GI284172722 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATT ATAAGGTGCC GGGCGGGCGA TGGGAGATAA TGACCGACGA CGACCATGAC 
CGGCCGAACG TCATCGCGGT GGTGACCGAT CAGCAGCGCT GGGACACGGT TGGCGTCTAC
GGGTGTCCGC TGGACCTCAC TCCGACGCTC GATACGCTCG CAGCGCAGGG AAGCGTCCTC
ACACAGGCAA TTACACCGCA GCCGCTCTGT GGCCCCTTCC GGGCGGCGTT TCAAAGCGGA
AAGTACGCTA GCGAGGTCGA CGTATGGCGG GACGCGGTGA GAATGCCAAG CGATGAGCTG
CATCTCTCCA GACAGTTCAA AGATGCCGGG TACGACGTCG GATACGTCGG GAACTGGCAT
ATTGCCGGAA CCTTCGATAA TCCCGTTCCT GAACAGTCCC GCGGCGGATA CGAGGACTTC
TGGATCGCTG CGGACGTTCC GGAATTCACT ACACAACCGA CGGAGGGTCA CTTGTTCGAT
GCCGACGGAA ATCCCGTCAA GTTCGAACGG TATCGTGTGG ATGCGTTTAC TGCGTTCGCC
TGCGAAGCTA TCGAGTCGCT GTCTGAGCCG TTTTTCCTTG TCGTCGCGTA CGTCGAACCG
CATAACCAGA ACGATATGTG GTCGTACGTC GCGCCGGACG GGTACGCAGA GCCGTACCAG
AAACGCCCGT ACGTACCAGA GGATTTGCAG GACCGGCCAG GCGACTGGTA CGAAGCGTTA
CCAGACTACT ATGGAATGGT CGAGCGAATC GACGAATGCG TCGATAATCT TCTCGAAGTG
TTGTCTGATC GGGGTATCCG AGACCGGACA ATTATCGCTT ACACGTCCGA TCACGGGTGC
CACTTCCGGA CGCGGCCGGG CGAGTACAAG CGAGACCCCC ATGAGTCCGC CATTCGGGTG
CCCGCAATAC TCGTCGGGCC GGGATTCGAC AAGGGAGTCG ACGTCACTCA GCCAACGAGC
ATGATCAATC TCCCACCGAC CTTGCTTGAT GCCGCCGGCA TCGATGTCCC TAACGAAATG
CACGGTGAGA GCCTCCTCCC GATCATCCGC AGAGATGTAC CTGATGTCAA CGGTGAGGCA
TTCATCCAGA TTAGCGAATC ACAGGTTGGC CGGGCGCTCC GAACCGACCG CTGGAAGTAC
GCCGTCGCCG CTTCGTCGCT AACCGGATGG CGCGGCGGCA GCGCCGAAAA ATCGAGTGAC
GTGTACGTCG AACGTTATCT CTACGATCTT GAACGCGATC CACACGAGCA GGTTAACCTA
GTTGGTCATC CAGACTTTCG ATCTATCGCT GATGATCTTC GCGATCGGAT CCTTGCATAC
ATTCAGGAGA TTGAAGACGA ATCACCCCGG ATCAAGCCCT ACGAGGGCGG TTACACCGGG
TTTTGA
 
Protein sequence
MMDYKVPGGR WEIMTDDDHD RPNVIAVVTD QQRWDTVGVY GCPLDLTPTL DTLAAQGSVL 
TQAITPQPLC GPFRAAFQSG KYASEVDVWR DAVRMPSDEL HLSRQFKDAG YDVGYVGNWH
IAGTFDNPVP EQSRGGYEDF WIAADVPEFT TQPTEGHLFD ADGNPVKFER YRVDAFTAFA
CEAIESLSEP FFLVVAYVEP HNQNDMWSYV APDGYAEPYQ KRPYVPEDLQ DRPGDWYEAL
PDYYGMVERI DECVDNLLEV LSDRGIRDRT IIAYTSDHGC HFRTRPGEYK RDPHESAIRV
PAILVGPGFD KGVDVTQPTS MINLPPTLLD AAGIDVPNEM HGESLLPIIR RDVPDVNGEA
FIQISESQVG RALRTDRWKY AVAASSLTGW RGGSAEKSSD VYVERYLYDL ERDPHEQVNL
VGHPDFRSIA DDLRDRILAY IQEIEDESPR IKPYEGGYTG F