Gene Htur_2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2978 
Symbol 
ID8743596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3063520 
End bp3064974 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content68% 
IMG OID646513563 
Productsulfatase 
Protein accessionYP_003404519 
Protein GI284166240 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA ATCGTCGACC GAATATCGTC GTCCTCTGTC TCGACACCGT CAGGAAGGAC 
GTCTACGACC GATTCGCGAC CCGCCTCCGG GAGCGGGCGG CCGTCCGTTT CGAGGGGATG
CGGGCGCTCG GCGGCTGGAG CGTTCCGAGC CACGCCGGGA TGCTGACGGG CACCGTCCCG
TCGGAAACGG GTGTCCACGC CCACCAGCGG CGGTTCGACC CGATCGACCC CGAGGACACC
TGGATCGCGC CCCTCGAGGG GCAGGGGTAC GAGTCGGTCT GCGTCACGTC GAACATCTAC
GCCAGCCCCG TCTTCGGGTT CGACCGCTTC TTCGATCGGA CGGTTCCCAT CTCGCCGAGC
CGCAGACTCC CGGAGGGGAT GGACGTTCAA GAACACATCT CCGATCGGTC GGCCGAGGGC
GTGGAAGCGT ACGCCGATTT CGTGCGCGAG GCCCTCGAGC ACGACCACCC GCTGCGCTCG
CTGGCCAACG GCGTCCTGCT CAAACTGGAC GACGTGAGCC GGAAGCTGCC GATCGAGAAG
CCGACCGACT TCGGGGGGCG AGCGATCGCC CGCACCCTCG AGCGCGAGGT CGCCGAGCCC
GACGGGCCGG TCGTCGCCTT CGCTAACGTT ATGGACGCCC ACGGCCCCCA CACCGCCTTC
CGCGGGCTCG ATGACTCGAT CCACGGCGTC TCCGCGGACT TCCACTCGAG TTCGTTTCGC
GACTCGGACG TCAACGTCGT GGACGGACTC GGTGCGTACG AATCGGACAT CGAGCGCGTC
CGCCGGCTCT ACGCCGCGAC GGTCGACTAC CTCGACCGGG TCGTTACCGA CCTCCTGGAC
GCGTTAGCCC GCGAGGACGA CCGCGAGTCG ATTCTGATCG TGACGGCCGA CCACGGCGAG
AACCTCGGCT ACGAGTCCGA CGGTTACCTC ATGAACCACA TGAGCAGCCT CTCGGAAGGG
CTGTTACACG TCCCGTTCGA CATCGTGGCC ACCGACGACA GCGCCCTCGA GCTCGCGGTC
GACGATACTA CCCGTCCCGT CGACGTCGAC GGGCTCGCCT CCCACGCCGA CCTCGGCGAC
GCGGTCCGGT CGCTCGCGGG CGAGGAGCCG TTCGATCCGT TCGCCCTCGA GCGTGAGCGT
GCTCGCGCCG AGATCGTCGG CTCCGGCTCC GGCATCCCGG AGGGCGGGGA CGAATCGTAC
TGGGACCGCG GCCAGCGGGT CGTCTACGAG GACGACCGGA AGTACTACCG CGATCAGCTC
GGCGACGAGG CCGTCTACGA CGTCTCCGGA CCGCCGTCGA AACAGGTCGA ACTGCCCGAC
GAGACGGTGC CGGACGGGCT CTTCGAGTCC GCCTTCGGCG ACTGGATCAC CGACGAAGAG
CGCGACGGCC GGGACCACGC CGAGGAGGTC GACGCGGCGA GTCGCGCCCG GCTGGAGGAT
CTGGGATACC TATGA
 
Protein sequence
MTENRRPNIV VLCLDTVRKD VYDRFATRLR ERAAVRFEGM RALGGWSVPS HAGMLTGTVP 
SETGVHAHQR RFDPIDPEDT WIAPLEGQGY ESVCVTSNIY ASPVFGFDRF FDRTVPISPS
RRLPEGMDVQ EHISDRSAEG VEAYADFVRE ALEHDHPLRS LANGVLLKLD DVSRKLPIEK
PTDFGGRAIA RTLEREVAEP DGPVVAFANV MDAHGPHTAF RGLDDSIHGV SADFHSSSFR
DSDVNVVDGL GAYESDIERV RRLYAATVDY LDRVVTDLLD ALAREDDRES ILIVTADHGE
NLGYESDGYL MNHMSSLSEG LLHVPFDIVA TDDSALELAV DDTTRPVDVD GLASHADLGD
AVRSLAGEEP FDPFALERER ARAEIVGSGS GIPEGGDESY WDRGQRVVYE DDRKYYRDQL
GDEAVYDVSG PPSKQVELPD ETVPDGLFES AFGDWITDEE RDGRDHAEEV DAASRARLED
LGYL