Gene Huta_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0475 
Symbol 
ID8382742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp472034 
End bp473338 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content61% 
IMG OID644971537 
Productsulfatase 
Protein accessionYP_003129395 
Protein GI257051562 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.501128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA CCCGCCCAAA CCTCGTCCTC GTCTCGATCG ACAGTCTCAG GGCCGATCAT 
TGCGGATTCC TGGGTGACGA CCGCGGACTC ACGCCGACGA TGGACGAACT CGCCGACGAT
GGCGTCACGT TCGAGACAGC CATCGCACCC GGACCCCAAA CCTTCTCGTC GATGCCGGCC
GTGTTTACCA GCCATCATCG GCCGACAGGC GATCTGGAAA CGTATCCTGG CGAGACAAAC
TGGAAGCGTC GCCTTGCCGC GATCGACGGG CACCTCCGTC GGCATCCATC GCTTCCCGAG
CGGTTGACGG AACTGGGCTA CTCGACGGCC GGTTTCAGCC CCAACCTCTG GGCGTCGGCA
GCCTCGGGGT TCGATCGGGG GTTCGATTAC TTTGCCGATC TCGCCGGTGA GCCAGCCGAC
AGTAGACTCC ATACGTTGCT GAGCCGGACG CCAGGAGTCG ACGAGACGAG CAAGCCCGTC
GAGCTAACGC TTGACATGCT TTCCGGCAAC TCGTTTTTCA CCCGGTGGCA GCAGTTCTAC
GACGAACTCG ACGCGGTTCG CCGTCGCCTC TCGGAGCCGT ACTTTCTGTG GGTCTTCCTG
CTGGACACCC ACTTTCCGTT CCTGCCCACG CGCACATACC GTCAGGAACA GTCACTGCTT
CGAATGTATC TCAACACGGC CCGGACCGAG AAAGTCATGC GGGGACACGC CGAAACAGTG
TCGACTCGCG CCCGTGAGTC GATGCAGCGA AGCTATCGGG ACACAGTTCG ATCGGTCGAC
GGCTTCCTCC AACGAATCCG CTCGGATCTA GCGTCGGACG ATCCGGTGAT GATAGTCCAC
TCGGATCACG GCGAATCGTT CAACGAGCAC GACAACTACG GTCACCATCA TCGGGAACTG
TACGAGGAGA ACATACACGT TCCCTACGTC GTCTCGAACG CCGGCACGAC GGGGACGATC
ACCGAACCGA CCTCGCTCGC GACCATTCCG GAGGTCGCAC TCACCATCGC CCGCGAAGGG
GCTTTCACTC CCGAAACCGT CGCCGATACC GGCGTGGTTT CCCGGTGTGA GTACGGCACA
CACCAGGCAG TTCGATATCC ACGGTTCAAG TATGTCGAAC ACGAGGACAA GCAGTCGTTG
TTCGACCTCG AGAACGATCA CCGAGAGACA GTCGACGTCT CCGATGAGTA TCCGGGTCGG
ATCGCCGATA GCAGGGCGCG GCTCGCCCGG TTGGAGCGTC ACGATCGAGA GACCCACGAG
CTCTCGCGTG CCGCCCGGAA TCTCGCCGTG GAGTGCAACC TGTAG
 
Protein sequence
MSETRPNLVL VSIDSLRADH CGFLGDDRGL TPTMDELADD GVTFETAIAP GPQTFSSMPA 
VFTSHHRPTG DLETYPGETN WKRRLAAIDG HLRRHPSLPE RLTELGYSTA GFSPNLWASA
ASGFDRGFDY FADLAGEPAD SRLHTLLSRT PGVDETSKPV ELTLDMLSGN SFFTRWQQFY
DELDAVRRRL SEPYFLWVFL LDTHFPFLPT RTYRQEQSLL RMYLNTARTE KVMRGHAETV
STRARESMQR SYRDTVRSVD GFLQRIRSDL ASDDPVMIVH SDHGESFNEH DNYGHHHREL
YEENIHVPYV VSNAGTTGTI TEPTSLATIP EVALTIAREG AFTPETVADT GVVSRCEYGT
HQAVRYPRFK YVEHEDKQSL FDLENDHRET VDVSDEYPGR IADSRARLAR LERHDRETHE
LSRAARNLAV ECNL