Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0533 |
Symbol | |
ID | 8382800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 536676 |
End bp | 538136 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644971595 |
Product | sulfatase |
Protein accession | YP_003129453 |
Protein GI | 257051620 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.270273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGATC CCAACGTTCT GCTGGTTATT CTAGATAGCG TTCGGGCACG GAACTGCAGC GTTTACGGCC ATTACAACGA GACGACTCCG TTCCTTGCGG AGTTTGCCCG TGAAGCGACA CGCTACGAGC AAGCACGCTC GCCCGGTGCC CGTAGCGTGA CGAGCCATAC GAGTATTTTC TCCGGCCTGC ACGTCGAGGA GCACAACGTG ACGGCTGCGA AGTACCGACT GGACCCGTCG CGTTCGGTCT TCGACCGATT GCGGCGGGAG GGCTATTCGA CGGGTGTGTT CTCGGCGAAC AACTGGATAA CGGACGTCGA TGTCGGACTG GCCGACGGCT TCGAGCACGT CGTCGGTGCA CGGAACGCGA TGTTTCCCGA TGCATTGAAT CCGGAATCGT TCGTCTCACA CCACGGGAAA GGCCAATACG GTGCGTTTCT ACGGGCCGCA CTCACCAGTG GGATGCCGTT CCGATCACTT GCAAACGGCC TCAGTACGAA GCTTTCGACG GATTATCCGA CACTCGTCCC GGAGTTCATG AAGGCGTCGA CCCCGGCGCG GACCTATGTC GATGCGTTTC TGGAGTGGCA CCAGGAGCAG TCCGAACCGT GGGCCGCCTG CGTCAATCTA ATGGACGCCC ATATCCCATA CGAACCAGAC GAAGAACATG ATCTGTGGGG TGGTGAGCGG TTGCAGACGC TACACGATTC GTTCGACGAT CACAAGTGGG ACTTCTCGGC CGGTCGACGT CCCTGGTGGC AGAAACGTGC TGTCGAAGCC CTATACGACG GGGCGATCAG AGAGATGGAC GCCGAGCTAC GGCGACTGAT CGGCAAACTT CGGGACCGAG GTGCGCTGGA CGACACACTC CTGGTAATCA CCAGTGATCA CGGCGAGGGG TTCGGCGAGC CGAGCGATCT CCGGCCGGGT ATTCGGATTG CCGAACATGG GGTCGGGATC CACGACGTCC TGTTACACGT CCCGCTACTC GTTTCCTTCC CTGGACAATC CGATGGGGTG TCGGTCGACT CACCCGCGTC TCTCACGGAG TTCCCGCGTG TGGTCGAGCA GGTTCGCGAC AGCGAGGGGA GTCCCGACGC GTTCTGCCCC GATGGACCGG TGATCGCCTC CGCTGTCGGA CTCGACCAAC CGCTTCAGGA ACGGGCAAGT GAGTATGTAG ATGATCTTTC GCCGTGGCTG TCTACGTCAC GAGCGGCCTT CGAGGAGATA AGCGACCAGT GCGCGGTCAG AAAGGCATGC GTCGATCGCG ATCGATCCGC GACGATTCAC GTTCGAAACG CACAAACGTC GTTTCCTGTC GGGAACGATG GCGACGGCAG AGACGGTGTC GATGCCGCGT TCGAGGGGAT CTCCGACGCC GGTGTGCGAG ACGAGGGCGG GGGCGTTGAC GATATCGGCG ACGGTACCTA TCAGCGCCTC GAGGACCTGG GCTACGTGTA G
|
Protein sequence | MADPNVLLVI LDSVRARNCS VYGHYNETTP FLAEFAREAT RYEQARSPGA RSVTSHTSIF SGLHVEEHNV TAAKYRLDPS RSVFDRLRRE GYSTGVFSAN NWITDVDVGL ADGFEHVVGA RNAMFPDALN PESFVSHHGK GQYGAFLRAA LTSGMPFRSL ANGLSTKLST DYPTLVPEFM KASTPARTYV DAFLEWHQEQ SEPWAACVNL MDAHIPYEPD EEHDLWGGER LQTLHDSFDD HKWDFSAGRR PWWQKRAVEA LYDGAIREMD AELRRLIGKL RDRGALDDTL LVITSDHGEG FGEPSDLRPG IRIAEHGVGI HDVLLHVPLL VSFPGQSDGV SVDSPASLTE FPRVVEQVRD SEGSPDAFCP DGPVIASAVG LDQPLQERAS EYVDDLSPWL STSRAAFEEI SDQCAVRKAC VDRDRSATIH VRNAQTSFPV GNDGDGRDGV DAAFEGISDA GVRDEGGGVD DIGDGTYQRL EDLGYV
|
| |