Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1210 |
Symbol | |
ID | 8410730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1153328 |
End bp | 1154764 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 645019545 |
Product | sulfatase |
Protein accession | YP_003177042 |
Protein GI | 257387269 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.200453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACA TATTGTTAAT TGTGATGGAC AGTGTACGAG CGAAGAATAC CTCATTACAC GGATATAAGA GAGAGACAAC TCCTTATCTT AAACAATTCG CTAATGGAGC AACTACCTAT CAAGAGGCTA AAGCTCCTGG AGTGGATAGT GCGACATCTC ATACATCCAT ATTTACTGGT TACGATGTCC CACAACATCA AGTTGTATCG GGTGAAGATA CACTCAAAAC GGGCCATTCC GCGTGGGAAT ATCTTACGAA GGATGGCTTT GAGACTGCGC TTTTTACGCC AAACCCATTT TTTTCCGATA GAGATATCGG TCTTGGAGAA GGGTTCGATA AGAAGTTCTT CGGAAAAGAC TACGACTTGC CGTTTAGGCG GGGACTTAAT CCAAGGAAAT ACACAAATAA AGGTGACGAG AGATGGATAG ATTTCCTATA TGATTCTATA AGAGAGATTC GGCCACTATC TTCGATAGGT AACGGATTAG TGATGAAAGC AGATGAAATT TCCCCACTTC TTATTGAACC GCACTATAAT GATCGGCCAG CGAGTATCTT TGCATCTGAA TTTACTAAAT GGGTTAAGGA CCAATCGGAA CCGTGGGCCG CTTGCATAAA CTTCATGGAT GCACATACAC CGTTTTTCCC AAAAAGAAAA TTTGATAATT GGTCTGAAAG ATACCACTGG AAGATTCAAG ACAAGATAAA TCCGCCATGG GATTTTATTT CTAAGGAAGA ATCCTCCTGG AAATTTGAAG CACTAACTCC CCTCTATGAT GCCAGCATAA GTCAGATAGA CAGCTCAATT GGAAGAATAA TAAATCATCT TAAAAAGGAG GGAAAGTATG ATGATACCCT TATTATCGTG ACTTCCGATC ATGGGGAGGC ATTCAACGAT CAAAGCCGAG TCAAACCCGA GCTTAACCTA GTTCACCATA GTATCGGTAT TCATGAGAGT TTACTTCACG TTCCACTTGT TATTAAATAC CCACACCAAA ATGAGGAAAG ACACGTACAC AAGCCAGTAT CATTGACTGA CTTATATAAA TTAATTAAAT ATCCCGGAGA CGATATTGGG ATGGTTGATA GGCTACTAGA TGAATCACCA ATAATTTCAG CGATGCCTAG CTATCAGACT CGGTCTGATG GTATGATTGA CAGGCTAGAA ACACACGCAC CAAACGCATC AACTTATAAA GGTTGGGCAT ACGCAATCTA TGAGCAAAAT GGTGATGGAA TTAGAAAACA TATCCAATGG GAAAACAGAT CTGCTCTTGT AAAAATAGAT GGCACGTATT CTTTTCGTGA TGGACATCCA GACAAACAGA TTCTCGAACA TGCTACCAAA CTTCAAGATG CTGATGTGGC GAGAAAACAA ACTAACGAAG TTACTGCGGA CATCGAATCA CGTTTAACAA ACCTTGGATA CAGATAA
|
Protein sequence | MTNILLIVMD SVRAKNTSLH GYKRETTPYL KQFANGATTY QEAKAPGVDS ATSHTSIFTG YDVPQHQVVS GEDTLKTGHS AWEYLTKDGF ETALFTPNPF FSDRDIGLGE GFDKKFFGKD YDLPFRRGLN PRKYTNKGDE RWIDFLYDSI REIRPLSSIG NGLVMKADEI SPLLIEPHYN DRPASIFASE FTKWVKDQSE PWAACINFMD AHTPFFPKRK FDNWSERYHW KIQDKINPPW DFISKEESSW KFEALTPLYD ASISQIDSSI GRIINHLKKE GKYDDTLIIV TSDHGEAFND QSRVKPELNL VHHSIGIHES LLHVPLVIKY PHQNEERHVH KPVSLTDLYK LIKYPGDDIG MVDRLLDESP IISAMPSYQT RSDGMIDRLE THAPNASTYK GWAYAIYEQN GDGIRKHIQW ENRSALVKID GTYSFRDGHP DKQILEHATK LQDADVARKQ TNEVTADIES RLTNLGYR
|
| |