Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0538 |
Symbol | |
ID | 8382805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 543656 |
End bp | 545011 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644971600 |
Product | sulfatase |
Protein accession | YP_003129458 |
Protein GI | 257051625 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGG TCCTCCTCGT CACTATCGAT TCTCTCAGGG CCGATCACGT CGGCTATCAC GGGTACGAAC GGGATACGAC ACCAGTACTC GACGGGTATG CCGCCGACGG TAGTCGGTTC ATGAACGCAT TCGCACACGT GGGTGGCACT CGATTTTCCT TTCCGTCGAT CCTGACCGGC GTGACGCCGC TGATGTACGG TGGCCACCAC AGCGTTTCAG AGGAGCAAAC ACTGGTCTCG GAAGTGTTCG ACGATGCCGG GTTTCGAACT GGCGGGTATC ACTCCAATCT CTATATCTCG GCCGAATTCG GATACGATCG CGGCTGGGAC GAGTTCTTCG ATTCCGCGCC GGACGACTCG ACGACAGCTT CGTTCCGCCG CTGGGCCAAG ACCAATCTCC AGAACACTCC GATTTATGGT CTTCTACAGC AGGCGTACGA CTTCATCGAG TCCTCGGCCG GTGTCAACGT CGGTTCCTAT CACGCCCCCG CGGAGGATAT CACCGATAAG GGAATCGAGT TCGTAGACTC GGTAGGATCA GACGAGCCCG CCTTTCTGTG GGTTCACTAT ATGGATGTCC ACCATCCGTT TCTCCCTCCA GCGGAGTATC AACAACAATT CCGCGACGAT GTCGTCAGTG ACCGACAGTC CATCAAGTTA CGCCGGAAGT TCATCGAGGA ACCAGACGCT GTCACGGACG AGGAACTCCA GACGTTCATC GATCTCTACG ACGCGGAGAT TCGGTACAAC GACGCCGAGA TCGGTCGGCT CCTCGAGCAC GTCGAGTCGG AGTGGGGCGA AGACTACCTA CTGGCGTTTA CGGCCGACCA CGGCGATCAC TTCCTCGAAC ACGGATATTT CGGCGGCGCA CGAGCGCTGG ACGTCAAAAC ACACGTTCCG CTGTTCGTCA ATGGATGGGA TGACGACGCC GAATACGACG ACATGGTCGG GCTCGTCGAT GTCCCATCTA CACTCGTCGA CGCTGCTGGA CTCGACATTC CCGATACCTT CCACGGACAT AGCCTCCGGT CGCTCGTTTT CGACGATGAA TGGCCCCGTG AGGACGTAAT CGGTGGCTGG TTCGACGGCG ATGGAAACCA CCTCTGTGTC CGTGAACGCG ACTGGAAACT CATCGAACGA CCCGGAGACA ATGCCGACGA GTTGTACGAT CTGGTTTCTG ACCCCGGCGA ACAACGGAAC GTATTTGGCG ACCATCCCGA CCTGACCGAG CGTCTTCGGG AAAAACTCGA CCGACACAGA CAGCTCGTTC GCTCGACGGA AGACGAAAGC GTCGAGCGCC CGGATATGAA CGAAGACGTC AAAGAACGCC TTCGACGCCT TGGTTACAAG GAATAA
|
Protein sequence | MDKVLLVTID SLRADHVGYH GYERDTTPVL DGYAADGSRF MNAFAHVGGT RFSFPSILTG VTPLMYGGHH SVSEEQTLVS EVFDDAGFRT GGYHSNLYIS AEFGYDRGWD EFFDSAPDDS TTASFRRWAK TNLQNTPIYG LLQQAYDFIE SSAGVNVGSY HAPAEDITDK GIEFVDSVGS DEPAFLWVHY MDVHHPFLPP AEYQQQFRDD VVSDRQSIKL RRKFIEEPDA VTDEELQTFI DLYDAEIRYN DAEIGRLLEH VESEWGEDYL LAFTADHGDH FLEHGYFGGA RALDVKTHVP LFVNGWDDDA EYDDMVGLVD VPSTLVDAAG LDIPDTFHGH SLRSLVFDDE WPREDVIGGW FDGDGNHLCV RERDWKLIER PGDNADELYD LVSDPGEQRN VFGDHPDLTE RLREKLDRHR QLVRSTEDES VERPDMNEDV KERLRRLGYK E
|
| |