Gene Htur_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2973 
Symbol 
ID8743590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3057297 
End bp3058739 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content68% 
IMG OID646513557 
Productsulfatase 
Protein accessionYP_003404514 
Protein GI284166235 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAC ACGCGACCAT GGGAGACGAG CCATCCATCG CCCTCGTCGT GCTCGACACC 
CTGCGGGCGG ATTCCTTCGA CGAGCACTTC GACTGGCTAC CGGGCGTGCA GTTCACGAAC
GCGTGGAGCA CGAGCCACTG GACGGCCCCG GCCCACGCCT CGCTGTTCAC CGGCCGGTAC
GCGAGCGAGG CCGGCGTGAC GATCAAATCC CAGGATTTCG ACCGGGACAC GACTCGCCTC
CCCGAACTCC TCCGGGACCG CGGCTACCGG ACGCGGGCGT TCAGCTGTAA CGTCAACATC
TCGGAGCAGC TAGGCTGGCA CCACGGGTTC GACGAGTTCG ACGGCGGCTG GCGGCTCAGC
GGCCTCGGCG AGGACGTCTT CGACTGGGAC GAGTTCATCG CCGAACACCG GGCCGACGGC
CCCGAACGGT ACCTCCGAGC GCTCTGGCGC TGCGTCGACG GCGACTGCGA TACGACCCAG
TCGCTGAAAC AGGGCGCCCT GATGAAGCTC CGAGACATGG GCCTCAAGGG GCGCCACCCC
GACGACGGCG CGTCGGAGTT TCTGGAGTAC GTCCAGAAGC GTTCGTGGAC CCAGGACAGC
GAGTTCCTCT TCGCGAATCT GATGGAGGCG CACCTGCCGT ACGACCCGCC CGACGAGTAC
AAGACCTATC CGGACGAGGA GTCGCCCCAC TTCGACAGCG TGAAGGCCAC GCTCGGGGAG
CCCTCGGCCG ATCCGGAGCG GATCAGGACC GCGTACGACG ACGCCGTACG GTACCTGTCG
GACATCTACC GCGACATCTA CGGGGAGCTC GCCGCGGAGT TCGACTACGT CGTGACCCTC
GCCGACCACG GCGAAGCGCT CGGCGAGTAC GGCGCGTGGC AACACGGCGG CGGCCTCCAT
CCGCCCGTGA CGAAAGTCCC GCTCGTCGTC TCCGCGCCCG GACCGAACGC CGACGACCGA
ACCCCCAACG CAGGCAGCGC GGAGCGGCCA ACGCGGGACG CCCTCGTGAA CCTGCTGGAC
GTCTACGCGA CGGTGCTCGA CCTCGCCGGT ATCGAGTCGG CCCACCGTCG CGGCGAGTCG
TTCCGCCCGC TGTGCTCGAG CGATCCCACC GACACCGAGC CCCACTCGAG CGACACCGTC
ATCACTGAGC CGCGCTCGAG CGCGCTGCTC GAGTTCCACG GCATCTCGAA GCGACGCGCG
CTCGCGCTCG AGGAGGACGG CTACGATATC GGCCCCGTCG ACCGCGAACG CCACGGCGTC
GCGACGGCCG ACTGTTACTA CTTCGAGGGG CTGTCCGGGA CCGAGCTGAT CGGCGACTGT
GACCCGGCGG CCCTCGAGTC GGAACTCGGT CGACTGGTCG ACGGCCTCGA GCGCCGCGAG
GGACTCTCCG AGGCGGATAT GGACGGTCTG GAATCCCAGC TCGAGGAGCT GGGGTATCTG
TGA
 
Protein sequence
MNGHATMGDE PSIALVVLDT LRADSFDEHF DWLPGVQFTN AWSTSHWTAP AHASLFTGRY 
ASEAGVTIKS QDFDRDTTRL PELLRDRGYR TRAFSCNVNI SEQLGWHHGF DEFDGGWRLS
GLGEDVFDWD EFIAEHRADG PERYLRALWR CVDGDCDTTQ SLKQGALMKL RDMGLKGRHP
DDGASEFLEY VQKRSWTQDS EFLFANLMEA HLPYDPPDEY KTYPDEESPH FDSVKATLGE
PSADPERIRT AYDDAVRYLS DIYRDIYGEL AAEFDYVVTL ADHGEALGEY GAWQHGGGLH
PPVTKVPLVV SAPGPNADDR TPNAGSAERP TRDALVNLLD VYATVLDLAG IESAHRRGES
FRPLCSSDPT DTEPHSSDTV ITEPRSSALL EFHGISKRRA LALEEDGYDI GPVDRERHGV
ATADCYYFEG LSGTELIGDC DPAALESELG RLVDGLERRE GLSEADMDGL ESQLEELGYL