Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1626 |
Symbol | |
ID | 8411149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1546042 |
End bp | 1547472 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645019953 |
Product | sulfatase |
Protein accession | YP_003177447 |
Protein GI | 257387674 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.773199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.470838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCCG AAACCGGCGT CTCGAACGTC GTCCTCGTCA CGGTCGACTC GCTCCGTGCG GACGCCATCG GTCCGTACGA CAGCGACCGC CACACACCCG TCATGGACGA ACTCGCCCAG CGGGGGACGG TCTTCGAGCG CGCCTTCGCC AACGGAAACT GGACTCCCTT CTCGTTTCCC TCGATGCTGG CCTCCTCGCC GGTGTTCGCC GACGACGGCC AGATCGGCGT CACGGGCGTC GAGACGCTCG CGGAGACGCT CGCGGACGCG GGCTTCGACA CCGGCGGTTT CAACGCCGCC AACGGATTCC TGACCGAACA CTGGGGATAC GACGACGGCT TCGACGAGTT CGACTCCTTC GTCGCCAGCG TGGGCTCCAG CATCTACAGT CGCTATCTCG CGACCCATCC GACCGTGGAG GCCTGGCTCC AGCTGGCGAC CTCGCCGTTT CGCCGCGCCC GCTCGTGGCT CTCGTCGGCG GACAGCCACC GTCCGTTCCT CGACACCTCG CGGATGTTCG ACGTGGAACA GGGGGCGACC TCGTTCCTCG AATCGGCCTC GGAGCCGTTC TTCCTGTGGA TCCACTACAT GGACGCCCAC ACGCCGTACG TTCCGGCCCC GCGGTACATC CGGGAGGTGT CTGCCGACCG CTTCGGGACC CACCGGATGG TCCTGGCGCA CCTCCACGCC GGGCTGGGCT GGGAGGTCGG CGACCGGACG CTTGGCAACC TCCGGACGCT CTATCAGAGC GCCGTCCGGC AGGTCGACGA CAGCCTCGGA CGAGTGCTCG ACGCGCTGTC GGAACACGGC CACGACGACG ACACCGCCGT CGTACTGGCC GGCGACCACG GCGAGGAGTT CCAGGACCAC GGCCACCTGG CACACTACCC GAAGCTGTAC GACGAGCTGA TCCGCGTGCC GCTGATCGTC GACGTACCGG GCGCAGAGAG TCGGCGCGTC GAGAGACAGG TCGGTCTCGA CTCGCTCCCA CCGACGGTGG CCGACCTCGC GGGCGTCTCT CCGCCAGCGG AGTGGCGCGG GGACTCGCTC GCCCCCGCCG TCCTCGACGG CGAGGAACCG GCCGACGAGC CGGTCGTCTC GGTCACGGTC AGGGGCGAGG ACGTCACCGA CCAGCCCATC CCGCGGTCGC TCGACGACGG CGACCTCCTC GTCAGCGTCC GCGACCGCGA CTGGAGTTAC ATCGAGAACG TCGACACCGG CGACCGGGAG CTGTACCACC GGCCCTCGGA CCCCGGCCAA CAGGACGACC GCTCCGACGG TCCCGACGCC GAGGCCAGGG CCGTGATCGA GGCGTTCGAG CCCCTCGTCG ACGCCCACGC AGACCTGCTG CACGCGGCAG AGCAAAGCGA GATCGACGAC GAGATGGACG AGGATCTGGA CGCCAGACTG GAGGCGCTCG GCTACAAGTA G
|
Protein sequence | MHSETGVSNV VLVTVDSLRA DAIGPYDSDR HTPVMDELAQ RGTVFERAFA NGNWTPFSFP SMLASSPVFA DDGQIGVTGV ETLAETLADA GFDTGGFNAA NGFLTEHWGY DDGFDEFDSF VASVGSSIYS RYLATHPTVE AWLQLATSPF RRARSWLSSA DSHRPFLDTS RMFDVEQGAT SFLESASEPF FLWIHYMDAH TPYVPAPRYI REVSADRFGT HRMVLAHLHA GLGWEVGDRT LGNLRTLYQS AVRQVDDSLG RVLDALSEHG HDDDTAVVLA GDHGEEFQDH GHLAHYPKLY DELIRVPLIV DVPGAESRRV ERQVGLDSLP PTVADLAGVS PPAEWRGDSL APAVLDGEEP ADEPVVSVTV RGEDVTDQPI PRSLDDGDLL VSVRDRDWSY IENVDTGDRE LYHRPSDPGQ QDDRSDGPDA EARAVIEAFE PLVDAHADLL HAAEQSEIDD EMDEDLDARL EALGYK
|
| |