Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1453 |
Symbol | |
ID | 8410974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1377273 |
End bp | 1378625 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645019783 |
Product | sulfatase |
Protein accession | YP_003177279 |
Protein GI | 257387506 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCA ATTCTAACAT CATACTCATA ACCGTGGATT CACTGCGGTA TGATACTATA TTTTCGGACG AAGCGGTATC AGCTCCGACA ATTGAACAAT TAATTGAAGA AGGGCTAACG TTCAAGCGAG CGTATGCGAA CGGTACCTCA ACGGGAATGT CTTTTCCATC TATTTTTTCA GGGATGTATC CCTGGGACTA CTATGGGTCT TATTATTCCC CCGATCGTCC ACATCTCATT GAAGAGTTCA AAAACACGAA TTATGGAACA GCAGCGTTCC ACTCAAACCC CCATCTAAGT GCCTCATTCG GATATGACCG TGGTTTCGAC ATTTTCTCCG AGGGTTCTGA TTCACCGTCT ACAGTCTCGA AATTACGTAA GAAGGCTGGG AATAACATAC CGAAAGATAG CATTCTCTAT GATGTTTTGC GGAAAGCTTG GAAGAATCTA GAAAAAGCTA CTGGAACTGG TGTCGGGACT CCTTACGTGG ATGGAACCGA ACTCAATGAG TACGTATTCG ACTGGTTAGA ATCCGCTTCA GCTCCAGTAT TTGGGTGGGT TCACTATATG GACGTACACC ATCCGTATCT GCCCCACAAA AATACGGTTA GTCACGATAT TGGAGAAAAA GAAGCGATTC GACTTCGGCA AAAATTCATC CATTCTCCAG ATGACTTGGT GAATAGCGAG ATTGAAATCC TGCGTCAATT GTATCGGGGA GAAGTGGAAT ACTTTGATCG ATGCCTCAAT TCGCTCCTTT CCAAGGTGCA GTCTGAGCTC GGCTTCGACA ATACGGTCCT TATCTTGGTG TCCGATCACG GTGAAGCATT TGGGGAAAAT GATAATTACG GTCACGGGGG CGATGCACTC GGTGACGAAG TCACCCGGGT CCCGCTTATT ATCCGCGGAC CGAACATAGA GCCTAGCAAA GTAGAGTCAC CCGTGTCGTG CGTTGATATT TATCCGACAA TTACAGATAT CAGGGGTGAG TCTGCACCGA GTAAATGTCG AGGGAAGTCG TTACTCGCAG TAGATGAAAA TGAACGACAG GAAAAGGACG AGAAACAATA TGTATTCGCA CACGCCAACG CGCCGTCAGA TGGACAGGCT ATGATCTGTG ATGGAAGGAG AAAGCTCATT CACGATCGAG GTGAGAATAC AGATACCTTG TATGACTTAT GTGACGGACA AGAAAAAGAA CGGATCAACT ATCGGGCGGA CCATGTGAAG GATCTAGTTG ACGAAATCGA AACGCACATA TCCAGTATAA AAAGCAGCGA TGACTCTGCA ATTCAACCTG ATATAGACGA TGAAATGAAA AATAAGTTAC GGGATCTAGG GTATATAGAT TAA
|
Protein sequence | MTGNSNIILI TVDSLRYDTI FSDEAVSAPT IEQLIEEGLT FKRAYANGTS TGMSFPSIFS GMYPWDYYGS YYSPDRPHLI EEFKNTNYGT AAFHSNPHLS ASFGYDRGFD IFSEGSDSPS TVSKLRKKAG NNIPKDSILY DVLRKAWKNL EKATGTGVGT PYVDGTELNE YVFDWLESAS APVFGWVHYM DVHHPYLPHK NTVSHDIGEK EAIRLRQKFI HSPDDLVNSE IEILRQLYRG EVEYFDRCLN SLLSKVQSEL GFDNTVLILV SDHGEAFGEN DNYGHGGDAL GDEVTRVPLI IRGPNIEPSK VESPVSCVDI YPTITDIRGE SAPSKCRGKS LLAVDENERQ EKDEKQYVFA HANAPSDGQA MICDGRRKLI HDRGENTDTL YDLCDGQEKE RINYRADHVK DLVDEIETHI SSIKSSDDSA IQPDIDDEMK NKLRDLGYID
|
| |