Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3492 |
Symbol | |
ID | 8744112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3593257 |
End bp | 3594753 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514073 |
Product | sulfatase |
Protein accession | YP_003405027 |
Protein GI | 284166748 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATA GCCGCCCGAA CGTTCTGTTC GTCCTCACCG ACCAGGAGCG CTACGACTGC ACGGCGCCCG AGGGACCGCC CGTCGAGACG CCGGCGATGG ATCGCCTCTC GAGCGAGGGG ATGCGTTTCT CGCGGGCTTG CACCCCGATC AGCATCTGTA CGAGCGCCCG CGCCTCGCTC ATGACCGGCC TGTTCCCCCA CGGCCACGGG ATGTTGAACA ACAGCCACGA GGCGGACGCG ATCCGGCCGA ACCTGCCGCC CGAGTTACCG ACGTTCTCGG AACTGCTGGC CGAGAACGGG TACGACTGCA GCTACACCGG AAAGTGGCAC GTCGGCCGGG ACCAGACGCC CGAGGACTTC GGCTTCGCCT ATCTCGGCGG CAGCGACAAA CACCACGACG ACATCGACGA GGCGTTCCGG GAGTACCGCG AGGAACGCGG CGTCCCGCCG GGCGAGGTCG ACCTCGAGGA GGTGCTCTAC ACCGGCGACG ACCCGCGCGA TGCGAGCGAG GGAACCTTCG TCGCGGCGAC GACCCCGGTC GATGTCGAGG AGACCCGCGC GTACTTCCTC GCCGAGCGGA CGATCGACGC CATCGAAGCG CACGCCGATG GCGACAGCGG AGAGGGCGAC GGAAACGGCA GCGACCCATT CTTCCACCGC GCGGACTTCT ACGGCCCCCA CCACCCCTAC GTCGTCCCCG AGCCCTACGC CTCGATGTAC GACCCCAACG AGATCGATCC GCCCGAAAGC TACGCCGAGA CGTACGACGG GAAGCCCCAA GTTCACGAGA ACTTCCACTA CTACCGCGGC GCCGACGGCC TCGAGTGGGA CCACTGGGCC GAGGCCACCG CGAAGTACTG GGGGTTCGTC TCGCTGATCG ACGACCAGCT CGAGCGGATC CTCGAGGCGC TCGAGGAGCA CGGACTGGCG GACGAGACGG CCGTCGTCCA CGCCTCGGAT CACGGCGACT TCGTCGGCAA CCACCGCCAG TTCAACAAGG GCCCGCTGAT GTACGACGAC ACCTACCGGA TTCCCCTACA GGTGCGCTGG CCCGGCGTCG CCGAACCCGG AACGACGTGC GAGGTGCCCG TCCACCTCCA CGATCTGGCC GCGACGTTCC TCGAGATGGG CGGCGTCGAC GTTCCGGAGT CGTTCGATTC CCGAAGTCTG GTGCCGCTGC TCGAGACCGG CGACGACCCG GACGCGGTGC CCGACGACTG GCCCGACTCC ACCTTCGCCC AGTATCACGG CGACGAGTTC GGCCTCTACA CCCAGCGGAT GGTCCGCACT GGGCGCTACA AGTACGTCTA CAACGGTCCC GACATCGACG AGCTGTACGA CCTCAAGGCC GATCCCGCCG AATTGCAGAA CCTGATCGAC CACCCGGGAT ACGCGGACGT TCGCGAGGAA ATGCGGGATC GACTCGTCGA CTGGATGCAG GAGACGGACG ATCCGAACCA GGGGTGGGTG CCAGACGTGC TCAGAGACAC GCCGTAA
|
Protein sequence | MADSRPNVLF VLTDQERYDC TAPEGPPVET PAMDRLSSEG MRFSRACTPI SICTSARASL MTGLFPHGHG MLNNSHEADA IRPNLPPELP TFSELLAENG YDCSYTGKWH VGRDQTPEDF GFAYLGGSDK HHDDIDEAFR EYREERGVPP GEVDLEEVLY TGDDPRDASE GTFVAATTPV DVEETRAYFL AERTIDAIEA HADGDSGEGD GNGSDPFFHR ADFYGPHHPY VVPEPYASMY DPNEIDPPES YAETYDGKPQ VHENFHYYRG ADGLEWDHWA EATAKYWGFV SLIDDQLERI LEALEEHGLA DETAVVHASD HGDFVGNHRQ FNKGPLMYDD TYRIPLQVRW PGVAEPGTTC EVPVHLHDLA ATFLEMGGVD VPESFDSRSL VPLLETGDDP DAVPDDWPDS TFAQYHGDEF GLYTQRMVRT GRYKYVYNGP DIDELYDLKA DPAELQNLID HPGYADVREE MRDRLVDWMQ ETDDPNQGWV PDVLRDTP
|
| |