Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0637 |
Symbol | |
ID | 8410144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 597064 |
End bp | 598437 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645018966 |
Product | sulfatase |
Protein accession | YP_003176476 |
Protein GI | 257386703 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATCGC CGAACGTCTT GCTCGTCGTG CTCGATGCGA CCAGGAAGGA CCACCTCTCG TGTTACGGTT ACGACCGGCC GACGACGCCG GAGCTGGACG CGGTGGCCCA GGCCGGGACC CGCTACGAGC AGGCGATCGC GCCGGCACCG TGGACGCCGC CCTCCCACGC GTCGATGTTC ACGGGTCGAT ATCCGTCCGG GCACGGGAGT TTCGGCACGC AGCCGCTGGG AGAGTACGAC GGGGCGACCG TCGCCGAGTT GCTCTCGGCG GCGGGGTACG CGACGTTCGG GTTCAGCAAC TCCCACCACA CCAGCATCGA GCAGGAGTTC GACCGCGGAT TCGACTACTA CCACGACATC CTCGCGTTGC CGCGGTTCAT GGACACGATG TACGAGCCCA GCCTGGATTT CCTCCGGTTT CTCCCGCGGT ACTTCCGGGA CGGCTACGAC ATCTCGGACT TCCAGCGGCG CAAGCTCGAA ACGCAGGTTC GGCGAGCCGA CGGCCCCTTC TTCGGGTTCA TCAACTTCAA CGCGACACAC GCGCCCTACA GGCCACCCGA GGAGTTCAGA GCGCCGTTCG AGGAGCGCTT CGACGACTGG GACGCGGTCG ACGAGACGGC GGCCCGGAAG GTCGGCGACG ACGAGGGATA CGAGTACATC CTGGGCGACG TGACGATGTC GCCGACCGAG TGGGAGCTCG TGGAGTGCTG GTACGACGCC GAGATCAGGT ACGTGGACGC GCTGCTGGGC GAGCTGTTCG ACTGTCTGCG ACGGCAGGGG GTCTACGACG ACACGCTCGT CGTCGTCACC GCGGACCACG GCGAGCACTT CGGCGAGCAC GGGCTGGCCT ACCACCAGTT TTCGCTGTTC GAGGAGCTGC TCAACGTCCC GCTGCTGGTC AAGTGGCCCG AGGGCGACCG GCCGTCGCCG GCTCCCGGGA CGGTGTCGGA CCGGCTCGTG TCGCTGGTCG ATCTCGTCCC GACGATCTGT GAGTGGGCCG GCGTCGCGGT GCCAGACGAG GTCGACGGCC GGGCACTGAC CGGCGACGAC GACCGCGACG CGGTCTTCGC GGAGTACGAC CGGCCGTATC CGCCGCTGCG CGAGCGGCTC CAGCAGTACG ACAGCTTCGA AGCCTACGAC AGGGGCTTGC AGGCGGTCCG GACCGAGACG CACAAGCTGA TCCGACCGAC AGTCGGCGAA GCGACGCTGT ATCGCCTGAC CGGCGACGGC GAGGTCGAGG TCTCCGACGA CGAGCGCGAG GCGGCACTGG CCCAGCGTCT GGACGAGACG CTCGAACCGC TCCCCGATAC GAGTCGAACC ACGGAGCTGG ACGATCACGT CAGCGATCAC CTGGAGAAGA TGGGGTACCT GTGA
|
Protein sequence | MGSPNVLLVV LDATRKDHLS CYGYDRPTTP ELDAVAQAGT RYEQAIAPAP WTPPSHASMF TGRYPSGHGS FGTQPLGEYD GATVAELLSA AGYATFGFSN SHHTSIEQEF DRGFDYYHDI LALPRFMDTM YEPSLDFLRF LPRYFRDGYD ISDFQRRKLE TQVRRADGPF FGFINFNATH APYRPPEEFR APFEERFDDW DAVDETAARK VGDDEGYEYI LGDVTMSPTE WELVECWYDA EIRYVDALLG ELFDCLRRQG VYDDTLVVVT ADHGEHFGEH GLAYHQFSLF EELLNVPLLV KWPEGDRPSP APGTVSDRLV SLVDLVPTIC EWAGVAVPDE VDGRALTGDD DRDAVFAEYD RPYPPLRERL QQYDSFEAYD RGLQAVRTET HKLIRPTVGE ATLYRLTGDG EVEVSDDERE AALAQRLDET LEPLPDTSRT TELDDHVSDH LEKMGYL
|
| |