Gene Huta_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0536 
Symbol 
ID8382803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp540724 
End bp542145 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content57% 
IMG OID644971598 
Productsulfatase 
Protein accessionYP_003129456 
Protein GI257051623 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA CGGTACGTGA GAATTCCGAC AATACCCCCA TGGCCGCGCC AAACGTCATC 
TGGATTACGC TCGATAGCGT CCGATCCGAT CACACAACGA TTGGCGGCTA CGAGCGCGAC
ACGACGCCTT ACCTGGCGGA TCTCGCATCG GCCGGCTTCG GCTTCGATAC GTGCATAAGC
CATGGGAATT CCACACATCC CTCGAGTGGT ACGATCCTGA CCGGTCTCTC CCCCTCTCGT
AATACCGTGG GGATCACCGG CGACGTTCTG CCTGATAGTG TCACGACAGT GTCCGAGCGT
TTTTCGGAGG CAGGATACCA CACGGCATGT CTTTCGCGTA ATTCATACGT CAGCTCCGCG
ACAGGGCTCG ACCGTGGATT CGATCGATTC CAGTGGCTCG CCTCGGAGAC GATAACGGAG
CTTCCGCTAT CCACGTTGAT CAAGTATGTC CGGAATATTC GAACGCACTC TGCTGGCCTG
ACACTGGACA CGGCTAAACA CGCTTCACCG TACCTCATGA ACGAGACGGC CAAGCGTTGG
CTTGCGGACA TGCGTACCGA GCAGCCGTTT TTCTTCTATC TGCACTACAA TGAGCCCCAT
CGTCCGTACT ATCCACCGCT ATCCTATATC GACAAGTACA CCGACGACAT CGAAATGAGT
CCACGTGAAG CTGCCGAGTT CGCGCTCGAT GTTCACTATA ATATGGAGAA AATTGTTGCT
AACGGGTGCG AGCTCGATGA TCGGGAGTGG GCGGCACTCC GGGCAATGTA CGACGCAGAG
ATCGCGTACA CCGACGAGAT GATCGAGCGG CTCGTCGATC ACGTGCGATC GCTCCCGCTC
GAGGAAACGG TAATCGTCGT GACAGCGGAC CACGGTGAAC TCTTCGGCGA GTACGGACTC
CTTTCACACA GTTATGTCTT GACGGACGCC GTGACGCGGG TTCCACTCGT CCTCGCTGGC
CTGAACGAGG AACTGACCGT CGGTGAAGAC GATCTACTCC AGCACACAGA TGTCATGCGA
ACGCTGCTGG AGGTGGCCGG TGCCGATACC GATGGCATGC TAGGAGTCGA TCTCCGATCA
GAACAACGGG AGTATGCTGT TTCACAGCGC GGTCCGGTCG AGTTCGATGA GTTCTACGAG
TACAACGAGG ACTTCGACGC CTCCCGATTC CATCTACCGG CTCTGACCGC GCTTCGGACC
GATACGTACC GCTATCAGGA GAGTGAATCG GGATCGGATC TGTTCGCGCT TCCGGATGAA
GAATCTGACG TGACGGAGGA TCGCCCGGAG ATTGCCGCCG AGTTATCCGG GAGCCTCGAG
AAGTGGCTCG ATGAACACGG CCAGCCAGTC GGTGCCGGAG AGCGCGGCGA GTTCTCCGGA
GCCGTCCAAC GACAGTTGCG AGATCTCGGG TACGTGGACT GA
 
Protein sequence
MQKTVRENSD NTPMAAPNVI WITLDSVRSD HTTIGGYERD TTPYLADLAS AGFGFDTCIS 
HGNSTHPSSG TILTGLSPSR NTVGITGDVL PDSVTTVSER FSEAGYHTAC LSRNSYVSSA
TGLDRGFDRF QWLASETITE LPLSTLIKYV RNIRTHSAGL TLDTAKHASP YLMNETAKRW
LADMRTEQPF FFYLHYNEPH RPYYPPLSYI DKYTDDIEMS PREAAEFALD VHYNMEKIVA
NGCELDDREW AALRAMYDAE IAYTDEMIER LVDHVRSLPL EETVIVVTAD HGELFGEYGL
LSHSYVLTDA VTRVPLVLAG LNEELTVGED DLLQHTDVMR TLLEVAGADT DGMLGVDLRS
EQREYAVSQR GPVEFDEFYE YNEDFDASRF HLPALTALRT DTYRYQESES GSDLFALPDE
ESDVTEDRPE IAAELSGSLE KWLDEHGQPV GAGERGEFSG AVQRQLRDLG YVD