Gene Htur_3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3519 
Symbol 
ID8744139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3623017 
End bp3624654 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content68% 
IMG OID646514100 
Productsulfatase 
Protein accessionYP_003405054 
Protein GI284166775 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG CTCAGCAATC CAACGTCCTC TTTGTCGTGC TGGACACGGT CCGGAAGGAC 
CGACTGGGTC CGTACGGCTA CGAACGGGAA ACGACGCCCG AACTCTCCGC GTTCGCCGAG
GAGGCGACCG TCTTCGAGTC GGCCGTCGCG CCCGCGCCGT GGACGCTGCC GGTCCACGCC
TCGCTGTTTA CCGGCCGGTA TCCGAGCCAG CACGGGGCCG ATCAGGGGAG TCCGTACCTC
GAAGGCGACG CCACCCTCGC GACGGCCCTC TCGGCGGCCG GCTACGACAC GGCGTGTTAC
TCCTCGAACG CCTGGATCAC CCCCTACACC GGCCTCACCG AGGGGTTCGA CGCGCAGGAC
TCGTTCTTCG AGGTCCTCCC CGGCGACGTC CTCTCGGGGC CGCTGGCCAG CGCCTGGCAG
ACCGTCAACG ACAACGACTA CCTCCGCGAT CTGGCGTCGA AACTCGTCAG ACTCGGCGCG
ATGGCCCACG AGAAACTCGC CAGCGGCGAG GGCGCCGACA CGAAGACGCC GTCGGTCATC
GACCGGACGA AGTCCTTTAT CGACGACAGC GAGAGCGACG AGGGCTGGTT CGCGTTCGTC
AACCTGATGG ACGCCCACCT GCCCTACTAC CCGCCCGAGG AGTATCGCGA GGAGTTCGCT
CCCGGCGTCG ACCCCAGCGA GGTCTGCCAG AACTCGAAGG AGTACAACTC GGGCGCGCGC
GACATCGACG ACGAGGAGTG GGACGACATC CGGAGCCTGT ACGACGCCGA GATCGCCCAC
ATGGACGCCG AACTCGGCCG CTTGTTCGAC TGGCTGCGCG AGACCGGCCA GTGGGAGGAG
ACGACCGTCG TCGTCTGCGC CGATCACGGC GAACTCCACG GCGAACACGA CCTCTACGGC
CACGAGTTCG CCCTCTACGA CCAGTTGATC AACGTCCCGC TGCTGGTCAA ACACCCCGCC
CTCGAGGCCG ACCGGCGCGA CGACCTCGTC GAGTTGCTCG ACTGCTATCA CACGGTCCTC
GAGGCGCTGG ACGTCGATCC CGACGACGCG CTCGCGCCGG CGGACGACGA CATCTCCGTC
ACCGGTCGCG ATCCGACGCG GTCGCTCCTG TCCGACGAGT ACCGCGCCTT CGAGGGGGTC
TCGGAGCCGG ATCCCGGCCA GCAGGCGGTG CTCGACGCCG AGGGTGGCGA GGCGTCCGAC
GACAGCGAGG GACGAAGTCC TTCGAGCGGC CGAACGCAGT CCAATGACGA CTACGCGTTC
GTCGAGTACG CCCAGCCGGT GATCGAACTC CATCACTTAG AAGAGAAGGC CAGCGAGGCA
GGGATCGAAC TGCCCGACGA TCACCGGGCC TACTCCCGCC TGCGCGCGGC CCGCAGCACC
GACGCGAAGT ACGCCCGGGC CGACCGCATC CCCGACGAGG GCTACCGCCT CGACGAGGAT
CCCGCGGAAT CGACGCCGGT CGATCCGGTC GACGACGGGG TCGTCGCCGA CACTGAACGC
GCGCTCGCCC GCTTCGAGCA GGCCGCCGGC GGCGCGTGGA TCGATCCCAG CGAGACGGAC
GCCGAGGACG CCGACGCGTT AGCCGAGGCC GACGAGGAGA CTCGCGACCG CCTGCGCGAA
CTCGGCTACC TCGAGTAA
 
Protein sequence
MDDAQQSNVL FVVLDTVRKD RLGPYGYERE TTPELSAFAE EATVFESAVA PAPWTLPVHA 
SLFTGRYPSQ HGADQGSPYL EGDATLATAL SAAGYDTACY SSNAWITPYT GLTEGFDAQD
SFFEVLPGDV LSGPLASAWQ TVNDNDYLRD LASKLVRLGA MAHEKLASGE GADTKTPSVI
DRTKSFIDDS ESDEGWFAFV NLMDAHLPYY PPEEYREEFA PGVDPSEVCQ NSKEYNSGAR
DIDDEEWDDI RSLYDAEIAH MDAELGRLFD WLRETGQWEE TTVVVCADHG ELHGEHDLYG
HEFALYDQLI NVPLLVKHPA LEADRRDDLV ELLDCYHTVL EALDVDPDDA LAPADDDISV
TGRDPTRSLL SDEYRAFEGV SEPDPGQQAV LDAEGGEASD DSEGRSPSSG RTQSNDDYAF
VEYAQPVIEL HHLEEKASEA GIELPDDHRA YSRLRAARST DAKYARADRI PDEGYRLDED
PAESTPVDPV DDGVVADTER ALARFEQAAG GAWIDPSETD AEDADALAEA DEETRDRLRE
LGYLE