Gene Huta_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0538 
Symbol 
ID8382805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp543656 
End bp545011 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content57% 
IMG OID644971600 
Productsulfatase 
Protein accessionYP_003129458 
Protein GI257051625 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGG TCCTCCTCGT CACTATCGAT TCTCTCAGGG CCGATCACGT CGGCTATCAC 
GGGTACGAAC GGGATACGAC ACCAGTACTC GACGGGTATG CCGCCGACGG TAGTCGGTTC
ATGAACGCAT TCGCACACGT GGGTGGCACT CGATTTTCCT TTCCGTCGAT CCTGACCGGC
GTGACGCCGC TGATGTACGG TGGCCACCAC AGCGTTTCAG AGGAGCAAAC ACTGGTCTCG
GAAGTGTTCG ACGATGCCGG GTTTCGAACT GGCGGGTATC ACTCCAATCT CTATATCTCG
GCCGAATTCG GATACGATCG CGGCTGGGAC GAGTTCTTCG ATTCCGCGCC GGACGACTCG
ACGACAGCTT CGTTCCGCCG CTGGGCCAAG ACCAATCTCC AGAACACTCC GATTTATGGT
CTTCTACAGC AGGCGTACGA CTTCATCGAG TCCTCGGCCG GTGTCAACGT CGGTTCCTAT
CACGCCCCCG CGGAGGATAT CACCGATAAG GGAATCGAGT TCGTAGACTC GGTAGGATCA
GACGAGCCCG CCTTTCTGTG GGTTCACTAT ATGGATGTCC ACCATCCGTT TCTCCCTCCA
GCGGAGTATC AACAACAATT CCGCGACGAT GTCGTCAGTG ACCGACAGTC CATCAAGTTA
CGCCGGAAGT TCATCGAGGA ACCAGACGCT GTCACGGACG AGGAACTCCA GACGTTCATC
GATCTCTACG ACGCGGAGAT TCGGTACAAC GACGCCGAGA TCGGTCGGCT CCTCGAGCAC
GTCGAGTCGG AGTGGGGCGA AGACTACCTA CTGGCGTTTA CGGCCGACCA CGGCGATCAC
TTCCTCGAAC ACGGATATTT CGGCGGCGCA CGAGCGCTGG ACGTCAAAAC ACACGTTCCG
CTGTTCGTCA ATGGATGGGA TGACGACGCC GAATACGACG ACATGGTCGG GCTCGTCGAT
GTCCCATCTA CACTCGTCGA CGCTGCTGGA CTCGACATTC CCGATACCTT CCACGGACAT
AGCCTCCGGT CGCTCGTTTT CGACGATGAA TGGCCCCGTG AGGACGTAAT CGGTGGCTGG
TTCGACGGCG ATGGAAACCA CCTCTGTGTC CGTGAACGCG ACTGGAAACT CATCGAACGA
CCCGGAGACA ATGCCGACGA GTTGTACGAT CTGGTTTCTG ACCCCGGCGA ACAACGGAAC
GTATTTGGCG ACCATCCCGA CCTGACCGAG CGTCTTCGGG AAAAACTCGA CCGACACAGA
CAGCTCGTTC GCTCGACGGA AGACGAAAGC GTCGAGCGCC CGGATATGAA CGAAGACGTC
AAAGAACGCC TTCGACGCCT TGGTTACAAG GAATAA
 
Protein sequence
MDKVLLVTID SLRADHVGYH GYERDTTPVL DGYAADGSRF MNAFAHVGGT RFSFPSILTG 
VTPLMYGGHH SVSEEQTLVS EVFDDAGFRT GGYHSNLYIS AEFGYDRGWD EFFDSAPDDS
TTASFRRWAK TNLQNTPIYG LLQQAYDFIE SSAGVNVGSY HAPAEDITDK GIEFVDSVGS
DEPAFLWVHY MDVHHPFLPP AEYQQQFRDD VVSDRQSIKL RRKFIEEPDA VTDEELQTFI
DLYDAEIRYN DAEIGRLLEH VESEWGEDYL LAFTADHGDH FLEHGYFGGA RALDVKTHVP
LFVNGWDDDA EYDDMVGLVD VPSTLVDAAG LDIPDTFHGH SLRSLVFDDE WPREDVIGGW
FDGDGNHLCV RERDWKLIER PGDNADELYD LVSDPGEQRN VFGDHPDLTE RLREKLDRHR
QLVRSTEDES VERPDMNEDV KERLRRLGYK E