Gene Huta_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0533 
Symbol 
ID8382800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp536676 
End bp538136 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID644971595 
Productsulfatase 
Protein accessionYP_003129453 
Protein GI257051620 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.270273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATC CCAACGTTCT GCTGGTTATT CTAGATAGCG TTCGGGCACG GAACTGCAGC 
GTTTACGGCC ATTACAACGA GACGACTCCG TTCCTTGCGG AGTTTGCCCG TGAAGCGACA
CGCTACGAGC AAGCACGCTC GCCCGGTGCC CGTAGCGTGA CGAGCCATAC GAGTATTTTC
TCCGGCCTGC ACGTCGAGGA GCACAACGTG ACGGCTGCGA AGTACCGACT GGACCCGTCG
CGTTCGGTCT TCGACCGATT GCGGCGGGAG GGCTATTCGA CGGGTGTGTT CTCGGCGAAC
AACTGGATAA CGGACGTCGA TGTCGGACTG GCCGACGGCT TCGAGCACGT CGTCGGTGCA
CGGAACGCGA TGTTTCCCGA TGCATTGAAT CCGGAATCGT TCGTCTCACA CCACGGGAAA
GGCCAATACG GTGCGTTTCT ACGGGCCGCA CTCACCAGTG GGATGCCGTT CCGATCACTT
GCAAACGGCC TCAGTACGAA GCTTTCGACG GATTATCCGA CACTCGTCCC GGAGTTCATG
AAGGCGTCGA CCCCGGCGCG GACCTATGTC GATGCGTTTC TGGAGTGGCA CCAGGAGCAG
TCCGAACCGT GGGCCGCCTG CGTCAATCTA ATGGACGCCC ATATCCCATA CGAACCAGAC
GAAGAACATG ATCTGTGGGG TGGTGAGCGG TTGCAGACGC TACACGATTC GTTCGACGAT
CACAAGTGGG ACTTCTCGGC CGGTCGACGT CCCTGGTGGC AGAAACGTGC TGTCGAAGCC
CTATACGACG GGGCGATCAG AGAGATGGAC GCCGAGCTAC GGCGACTGAT CGGCAAACTT
CGGGACCGAG GTGCGCTGGA CGACACACTC CTGGTAATCA CCAGTGATCA CGGCGAGGGG
TTCGGCGAGC CGAGCGATCT CCGGCCGGGT ATTCGGATTG CCGAACATGG GGTCGGGATC
CACGACGTCC TGTTACACGT CCCGCTACTC GTTTCCTTCC CTGGACAATC CGATGGGGTG
TCGGTCGACT CACCCGCGTC TCTCACGGAG TTCCCGCGTG TGGTCGAGCA GGTTCGCGAC
AGCGAGGGGA GTCCCGACGC GTTCTGCCCC GATGGACCGG TGATCGCCTC CGCTGTCGGA
CTCGACCAAC CGCTTCAGGA ACGGGCAAGT GAGTATGTAG ATGATCTTTC GCCGTGGCTG
TCTACGTCAC GAGCGGCCTT CGAGGAGATA AGCGACCAGT GCGCGGTCAG AAAGGCATGC
GTCGATCGCG ATCGATCCGC GACGATTCAC GTTCGAAACG CACAAACGTC GTTTCCTGTC
GGGAACGATG GCGACGGCAG AGACGGTGTC GATGCCGCGT TCGAGGGGAT CTCCGACGCC
GGTGTGCGAG ACGAGGGCGG GGGCGTTGAC GATATCGGCG ACGGTACCTA TCAGCGCCTC
GAGGACCTGG GCTACGTGTA G
 
Protein sequence
MADPNVLLVI LDSVRARNCS VYGHYNETTP FLAEFAREAT RYEQARSPGA RSVTSHTSIF 
SGLHVEEHNV TAAKYRLDPS RSVFDRLRRE GYSTGVFSAN NWITDVDVGL ADGFEHVVGA
RNAMFPDALN PESFVSHHGK GQYGAFLRAA LTSGMPFRSL ANGLSTKLST DYPTLVPEFM
KASTPARTYV DAFLEWHQEQ SEPWAACVNL MDAHIPYEPD EEHDLWGGER LQTLHDSFDD
HKWDFSAGRR PWWQKRAVEA LYDGAIREMD AELRRLIGKL RDRGALDDTL LVITSDHGEG
FGEPSDLRPG IRIAEHGVGI HDVLLHVPLL VSFPGQSDGV SVDSPASLTE FPRVVEQVRD
SEGSPDAFCP DGPVIASAVG LDQPLQERAS EYVDDLSPWL STSRAAFEEI SDQCAVRKAC
VDRDRSATIH VRNAQTSFPV GNDGDGRDGV DAAFEGISDA GVRDEGGGVD DIGDGTYQRL
EDLGYV