Gene EcDH1_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0533 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp562133 
End bp563776 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content43% 
IMG OID 
Productsulfatase 
Protein accessionACX38221 
Protein GI260447799 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTTGG ATTTCTGGAT GACAGTATTC AACAAATTTG CTAGAACTTT TAAATCTCAT 
TGGTTGTTGT ATCTTTGTGT TATTGTTTTT GGTATTACGA ACTTAGTCGC TTCTTCCGGA
GCGCATATGG TTCAGCGCTT GCTGTTCTTC GTTCTGACCA TCCTGGTTGT AAAACGTATA
TCATCCCTTC CGCTTCGCCT GCTTGTTGCC GCACCATTTG TGTTACTGAC TGCGGCAGAC
ATGAGTATTA GCCTCTATTC ATGGTGTACC TTTGGTACAA CTTTCAATGA TGGATTTGCG
ATTAGTGTGC TCCAGAGTGA TCCGGATGAA GTTGTCAAAA TGCTGGGGAT GTATATCCCT
TATCTATGTG CCTTTGCTTT TTTATCCCTT CTTTTTTTGG CAGTAATAAT AAAATATGAT
GTTTCCTTGC CGACAAAAAA AGTGACAGGA ATATTATTGC TGATTGTCAT TTCGGGCAGT
TTATTTTCCG CTTGTCAATT TGCTTATAAA GATGCAAAAA ATAAAAAAGC GTTCAGTCCA
TATATACTAG CGTCGCGATT TGCTACCTAT ACGCCGTTTT TCAATCTCAA CTATTTTGCT
TTAGCAGCGA AAGAGCATCA AAGATTACTC TCAATTGCAA ACACGGTGCC GTATTTTCAA
TTATCAGTCA GGGATACAGG TATTGATACC TACGTGTTGA TTGTGGGGGA GTCTGTACGT
GTCGACAATA TGTCTTTGTA TGGATATACA CGCTCTACGA CACCGCAAGT TGAAGCACAA
AGAAAACAGA TCAAACTGTT TAATCAAGCA ATAAGCGGCG CACCTTACAC TGCGCTGTCG
GTTCCCCTTT CTTTAACTGC TGATTCTGTT TTGAGTCATG ACATTCATAA TTACCCCGAC
AACATTATTA ATATGGCTAA TCAAGCAGGA TTTCAGACTT TCTGGCTAAG CTCGCAATCC
GCTTTTCGGC AGAATGGTAC AGCAGTTACC AGTATCGCCA TGCGCGCCAT GGAAACAGTT
TATGTCAGAG GATTTGATGA ATTGTTGTTG CCGCATTTAT CGCAAGCATT ACAGCAAAAT
ACGCAGCAAA AGAAACTGAT TGTTCTTCAT TTAAATGGAA GCCATGAACC GGCTTGTAGC
GCCTATCCGC AATCCAGCGC CGTGTTTCAA CCGCAGGACG ATCAGGATGC CTGCTATGAC
AACTCCATTC ATTACACAGA TAGTTTGCTA GGTCAGGTTT TTGAATTATT AAAAGATCGC
CGCGCCTCGG TCATGTATTT TGCCGACCAC GGCCTGGAAC GTGACCCTAC GAAGAAGAAC
GTCTATTTTC ATGGAGGCAG GGAGGCTAGC CAGCAGGCAT ATCATGTCCC GATGTTTATC
TGGTATAGCC CCGTTCTTGG GGATGGCGTG GATCGCACAA CGGAAAACAA CATCTTTTCG
ACAGCTTACA ATAATTACCT TATTAATGCG TGGATGGGGG TAACAAAGCC GGAACAGCCG
CAAACGCTTG AGGAAGTGAT TGCACACTAT AAAGGAGACT CACGGGTTGT GGATGCAAAC
CATGATGTTT TCGATTATGT GATGCTCAGA AAGGAGTTTA CAGAGGATAA GCAAGGTAAC
CCCACCCCTG AAGGGCAGGG TTGA
 
Protein sequence
MNLDFWMTVF NKFARTFKSH WLLYLCVIVF GITNLVASSG AHMVQRLLFF VLTILVVKRI 
SSLPLRLLVA APFVLLTAAD MSISLYSWCT FGTTFNDGFA ISVLQSDPDE VVKMLGMYIP
YLCAFAFLSL LFLAVIIKYD VSLPTKKVTG ILLLIVISGS LFSACQFAYK DAKNKKAFSP
YILASRFATY TPFFNLNYFA LAAKEHQRLL SIANTVPYFQ LSVRDTGIDT YVLIVGESVR
VDNMSLYGYT RSTTPQVEAQ RKQIKLFNQA ISGAPYTALS VPLSLTADSV LSHDIHNYPD
NIINMANQAG FQTFWLSSQS AFRQNGTAVT SIAMRAMETV YVRGFDELLL PHLSQALQQN
TQQKKLIVLH LNGSHEPACS AYPQSSAVFQ PQDDQDACYD NSIHYTDSLL GQVFELLKDR
RASVMYFADH GLERDPTKKN VYFHGGREAS QQAYHVPMFI WYSPVLGDGV DRTTENNIFS
TAYNNYLINA WMGVTKPEQP QTLEEVIAHY KGDSRVVDAN HDVFDYVMLR KEFTEDKQGN
PTPEGQG