Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2977 |
Symbol | |
ID | 8743595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3062364 |
End bp | 3063473 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646513562 |
Product | sulfatase |
Protein accession | YP_003404518 |
Protein GI | 284166239 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACA CTACCCTGCT AGTTACGGTC GATTCGCTCA GAACCGATCA CGTCCAGTAC ATGCCGGAGA CCCTGGCGTT TCTGGACGAC ACCCACGACG CCGCGTTCGC CACGAGCACC GCAACGCCCG GCAGCTTCCC GGCGATCATC GGCGGGGAGT ATCCGGCCGG CAACGGCCTC GAGGAAGCGG CCAGCGTCGC CCACGAGTTC GACGCTCCCA GCGTCGGGAT CACGACGAAC CACCTGCTCT CTCAGGAGTA CGGCTACGCG GCCGGGTTCG ACTCGTTCAC GTCGCCGAAG GGCGGCGGCG AGTCGCTGAA GGACAAGGGT GCGATCTTGC TCGAGCGCGG CTCGCTTCCC TACAAGGTCG CCAGCTGGGG CTACAACCGC TACCAGCAGC TTCGGAGCTA CGTCGAGGAG ACCGAGAAGT CGTTCAGACC CGCGGACGCT GTCGTAGACC AATTTCTGCG CGAGGTCGAC GACCGCGAGG AGTGGTTCGG CTGGCTGCAC TTCATGGAGC CCCACCACCC GTACGACCCC GACGGTGCGA ACATCGACCG CGCGACGGCC CAGCGGGTCA CCCGCCGCGT CCTCTCGGAT CGGGGCTCCG AGGAGGACGA AGCCCTCGTA CGGGACCTCT ACCGACAGGA GATCGCCGAA CTCGACGCGG CCCTCGAGGC CCTCTGGAAC GCGATCCCCG ACGAGACGCG GGTCGTCTTC TGTGGCGACC ACGGCGAGTT ACTCGGCGAG GACGGACTGT GGGGCCACCC CGGCGAGATG CGCCCCGAAC TGCTGAACGT CCCGTTCGGG ACGCGAAACG CCCCCGACGT CGGCGAGGTC GTCTCCCTGA TCGACGTGCC GACGATCCTG ACCGGCGCCG AACACCGTCA GGGGACGCTC GATCGCGACA CCGCGTTCGC GGCCTACGGA GACCGAAAGG CAGCGATGAC CGCCGACCGC ATCGCGACCG AAGACGGCGT GTATCGGCTC GAGGACGGCG AACCGGTCGA CGACCCCGAT CTCGAGCGCG AACTCGATCG GTTCGATCCC GCCTACGTCG TCAAGGAAGA GGCGCTGCAG GAAGACCTGG AGGATCTGGG CTACGCATGA
|
Protein sequence | MTDTTLLVTV DSLRTDHVQY MPETLAFLDD THDAAFATST ATPGSFPAII GGEYPAGNGL EEAASVAHEF DAPSVGITTN HLLSQEYGYA AGFDSFTSPK GGGESLKDKG AILLERGSLP YKVASWGYNR YQQLRSYVEE TEKSFRPADA VVDQFLREVD DREEWFGWLH FMEPHHPYDP DGANIDRATA QRVTRRVLSD RGSEEDEALV RDLYRQEIAE LDAALEALWN AIPDETRVVF CGDHGELLGE DGLWGHPGEM RPELLNVPFG TRNAPDVGEV VSLIDVPTIL TGAEHRQGTL DRDTAFAAYG DRKAAMTADR IATEDGVYRL EDGEPVDDPD LERELDRFDP AYVVKEEALQ EDLEDLGYA
|
| |