Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0536 |
Symbol | |
ID | 8382803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 540724 |
End bp | 542145 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644971598 |
Product | sulfatase |
Protein accession | YP_003129456 |
Protein GI | 257051623 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAGA CGGTACGTGA GAATTCCGAC AATACCCCCA TGGCCGCGCC AAACGTCATC TGGATTACGC TCGATAGCGT CCGATCCGAT CACACAACGA TTGGCGGCTA CGAGCGCGAC ACGACGCCTT ACCTGGCGGA TCTCGCATCG GCCGGCTTCG GCTTCGATAC GTGCATAAGC CATGGGAATT CCACACATCC CTCGAGTGGT ACGATCCTGA CCGGTCTCTC CCCCTCTCGT AATACCGTGG GGATCACCGG CGACGTTCTG CCTGATAGTG TCACGACAGT GTCCGAGCGT TTTTCGGAGG CAGGATACCA CACGGCATGT CTTTCGCGTA ATTCATACGT CAGCTCCGCG ACAGGGCTCG ACCGTGGATT CGATCGATTC CAGTGGCTCG CCTCGGAGAC GATAACGGAG CTTCCGCTAT CCACGTTGAT CAAGTATGTC CGGAATATTC GAACGCACTC TGCTGGCCTG ACACTGGACA CGGCTAAACA CGCTTCACCG TACCTCATGA ACGAGACGGC CAAGCGTTGG CTTGCGGACA TGCGTACCGA GCAGCCGTTT TTCTTCTATC TGCACTACAA TGAGCCCCAT CGTCCGTACT ATCCACCGCT ATCCTATATC GACAAGTACA CCGACGACAT CGAAATGAGT CCACGTGAAG CTGCCGAGTT CGCGCTCGAT GTTCACTATA ATATGGAGAA AATTGTTGCT AACGGGTGCG AGCTCGATGA TCGGGAGTGG GCGGCACTCC GGGCAATGTA CGACGCAGAG ATCGCGTACA CCGACGAGAT GATCGAGCGG CTCGTCGATC ACGTGCGATC GCTCCCGCTC GAGGAAACGG TAATCGTCGT GACAGCGGAC CACGGTGAAC TCTTCGGCGA GTACGGACTC CTTTCACACA GTTATGTCTT GACGGACGCC GTGACGCGGG TTCCACTCGT CCTCGCTGGC CTGAACGAGG AACTGACCGT CGGTGAAGAC GATCTACTCC AGCACACAGA TGTCATGCGA ACGCTGCTGG AGGTGGCCGG TGCCGATACC GATGGCATGC TAGGAGTCGA TCTCCGATCA GAACAACGGG AGTATGCTGT TTCACAGCGC GGTCCGGTCG AGTTCGATGA GTTCTACGAG TACAACGAGG ACTTCGACGC CTCCCGATTC CATCTACCGG CTCTGACCGC GCTTCGGACC GATACGTACC GCTATCAGGA GAGTGAATCG GGATCGGATC TGTTCGCGCT TCCGGATGAA GAATCTGACG TGACGGAGGA TCGCCCGGAG ATTGCCGCCG AGTTATCCGG GAGCCTCGAG AAGTGGCTCG ATGAACACGG CCAGCCAGTC GGTGCCGGAG AGCGCGGCGA GTTCTCCGGA GCCGTCCAAC GACAGTTGCG AGATCTCGGG TACGTGGACT GA
|
Protein sequence | MQKTVRENSD NTPMAAPNVI WITLDSVRSD HTTIGGYERD TTPYLADLAS AGFGFDTCIS HGNSTHPSSG TILTGLSPSR NTVGITGDVL PDSVTTVSER FSEAGYHTAC LSRNSYVSSA TGLDRGFDRF QWLASETITE LPLSTLIKYV RNIRTHSAGL TLDTAKHASP YLMNETAKRW LADMRTEQPF FFYLHYNEPH RPYYPPLSYI DKYTDDIEMS PREAAEFALD VHYNMEKIVA NGCELDDREW AALRAMYDAE IAYTDEMIER LVDHVRSLPL EETVIVVTAD HGELFGEYGL LSHSYVLTDA VTRVPLVLAG LNEELTVGED DLLQHTDVMR TLLEVAGADT DGMLGVDLRS EQREYAVSQR GPVEFDEFYE YNEDFDASRF HLPALTALRT DTYRYQESES GSDLFALPDE ESDVTEDRPE IAAELSGSLE KWLDEHGQPV GAGERGEFSG AVQRQLRDLG YVD
|
| |