Gene Htur_2977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2977 
Symbol 
ID8743595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3062364 
End bp3063473 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content67% 
IMG OID646513562 
Productsulfatase 
Protein accessionYP_003404518 
Protein GI284166239 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACA CTACCCTGCT AGTTACGGTC GATTCGCTCA GAACCGATCA CGTCCAGTAC 
ATGCCGGAGA CCCTGGCGTT TCTGGACGAC ACCCACGACG CCGCGTTCGC CACGAGCACC
GCAACGCCCG GCAGCTTCCC GGCGATCATC GGCGGGGAGT ATCCGGCCGG CAACGGCCTC
GAGGAAGCGG CCAGCGTCGC CCACGAGTTC GACGCTCCCA GCGTCGGGAT CACGACGAAC
CACCTGCTCT CTCAGGAGTA CGGCTACGCG GCCGGGTTCG ACTCGTTCAC GTCGCCGAAG
GGCGGCGGCG AGTCGCTGAA GGACAAGGGT GCGATCTTGC TCGAGCGCGG CTCGCTTCCC
TACAAGGTCG CCAGCTGGGG CTACAACCGC TACCAGCAGC TTCGGAGCTA CGTCGAGGAG
ACCGAGAAGT CGTTCAGACC CGCGGACGCT GTCGTAGACC AATTTCTGCG CGAGGTCGAC
GACCGCGAGG AGTGGTTCGG CTGGCTGCAC TTCATGGAGC CCCACCACCC GTACGACCCC
GACGGTGCGA ACATCGACCG CGCGACGGCC CAGCGGGTCA CCCGCCGCGT CCTCTCGGAT
CGGGGCTCCG AGGAGGACGA AGCCCTCGTA CGGGACCTCT ACCGACAGGA GATCGCCGAA
CTCGACGCGG CCCTCGAGGC CCTCTGGAAC GCGATCCCCG ACGAGACGCG GGTCGTCTTC
TGTGGCGACC ACGGCGAGTT ACTCGGCGAG GACGGACTGT GGGGCCACCC CGGCGAGATG
CGCCCCGAAC TGCTGAACGT CCCGTTCGGG ACGCGAAACG CCCCCGACGT CGGCGAGGTC
GTCTCCCTGA TCGACGTGCC GACGATCCTG ACCGGCGCCG AACACCGTCA GGGGACGCTC
GATCGCGACA CCGCGTTCGC GGCCTACGGA GACCGAAAGG CAGCGATGAC CGCCGACCGC
ATCGCGACCG AAGACGGCGT GTATCGGCTC GAGGACGGCG AACCGGTCGA CGACCCCGAT
CTCGAGCGCG AACTCGATCG GTTCGATCCC GCCTACGTCG TCAAGGAAGA GGCGCTGCAG
GAAGACCTGG AGGATCTGGG CTACGCATGA
 
Protein sequence
MTDTTLLVTV DSLRTDHVQY MPETLAFLDD THDAAFATST ATPGSFPAII GGEYPAGNGL 
EEAASVAHEF DAPSVGITTN HLLSQEYGYA AGFDSFTSPK GGGESLKDKG AILLERGSLP
YKVASWGYNR YQQLRSYVEE TEKSFRPADA VVDQFLREVD DREEWFGWLH FMEPHHPYDP
DGANIDRATA QRVTRRVLSD RGSEEDEALV RDLYRQEIAE LDAALEALWN AIPDETRVVF
CGDHGELLGE DGLWGHPGEM RPELLNVPFG TRNAPDVGEV VSLIDVPTIL TGAEHRQGTL
DRDTAFAAYG DRKAAMTADR IATEDGVYRL EDGEPVDDPD LERELDRFDP AYVVKEEALQ
EDLEDLGYA