Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_0533 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 562133 |
End bp | 563776 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | sulfatase |
Protein accession | ACX38221 |
Protein GI | 260447799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACTTGG ATTTCTGGAT GACAGTATTC AACAAATTTG CTAGAACTTT TAAATCTCAT TGGTTGTTGT ATCTTTGTGT TATTGTTTTT GGTATTACGA ACTTAGTCGC TTCTTCCGGA GCGCATATGG TTCAGCGCTT GCTGTTCTTC GTTCTGACCA TCCTGGTTGT AAAACGTATA TCATCCCTTC CGCTTCGCCT GCTTGTTGCC GCACCATTTG TGTTACTGAC TGCGGCAGAC ATGAGTATTA GCCTCTATTC ATGGTGTACC TTTGGTACAA CTTTCAATGA TGGATTTGCG ATTAGTGTGC TCCAGAGTGA TCCGGATGAA GTTGTCAAAA TGCTGGGGAT GTATATCCCT TATCTATGTG CCTTTGCTTT TTTATCCCTT CTTTTTTTGG CAGTAATAAT AAAATATGAT GTTTCCTTGC CGACAAAAAA AGTGACAGGA ATATTATTGC TGATTGTCAT TTCGGGCAGT TTATTTTCCG CTTGTCAATT TGCTTATAAA GATGCAAAAA ATAAAAAAGC GTTCAGTCCA TATATACTAG CGTCGCGATT TGCTACCTAT ACGCCGTTTT TCAATCTCAA CTATTTTGCT TTAGCAGCGA AAGAGCATCA AAGATTACTC TCAATTGCAA ACACGGTGCC GTATTTTCAA TTATCAGTCA GGGATACAGG TATTGATACC TACGTGTTGA TTGTGGGGGA GTCTGTACGT GTCGACAATA TGTCTTTGTA TGGATATACA CGCTCTACGA CACCGCAAGT TGAAGCACAA AGAAAACAGA TCAAACTGTT TAATCAAGCA ATAAGCGGCG CACCTTACAC TGCGCTGTCG GTTCCCCTTT CTTTAACTGC TGATTCTGTT TTGAGTCATG ACATTCATAA TTACCCCGAC AACATTATTA ATATGGCTAA TCAAGCAGGA TTTCAGACTT TCTGGCTAAG CTCGCAATCC GCTTTTCGGC AGAATGGTAC AGCAGTTACC AGTATCGCCA TGCGCGCCAT GGAAACAGTT TATGTCAGAG GATTTGATGA ATTGTTGTTG CCGCATTTAT CGCAAGCATT ACAGCAAAAT ACGCAGCAAA AGAAACTGAT TGTTCTTCAT TTAAATGGAA GCCATGAACC GGCTTGTAGC GCCTATCCGC AATCCAGCGC CGTGTTTCAA CCGCAGGACG ATCAGGATGC CTGCTATGAC AACTCCATTC ATTACACAGA TAGTTTGCTA GGTCAGGTTT TTGAATTATT AAAAGATCGC CGCGCCTCGG TCATGTATTT TGCCGACCAC GGCCTGGAAC GTGACCCTAC GAAGAAGAAC GTCTATTTTC ATGGAGGCAG GGAGGCTAGC CAGCAGGCAT ATCATGTCCC GATGTTTATC TGGTATAGCC CCGTTCTTGG GGATGGCGTG GATCGCACAA CGGAAAACAA CATCTTTTCG ACAGCTTACA ATAATTACCT TATTAATGCG TGGATGGGGG TAACAAAGCC GGAACAGCCG CAAACGCTTG AGGAAGTGAT TGCACACTAT AAAGGAGACT CACGGGTTGT GGATGCAAAC CATGATGTTT TCGATTATGT GATGCTCAGA AAGGAGTTTA CAGAGGATAA GCAAGGTAAC CCCACCCCTG AAGGGCAGGG TTGA
|
Protein sequence | MNLDFWMTVF NKFARTFKSH WLLYLCVIVF GITNLVASSG AHMVQRLLFF VLTILVVKRI SSLPLRLLVA APFVLLTAAD MSISLYSWCT FGTTFNDGFA ISVLQSDPDE VVKMLGMYIP YLCAFAFLSL LFLAVIIKYD VSLPTKKVTG ILLLIVISGS LFSACQFAYK DAKNKKAFSP YILASRFATY TPFFNLNYFA LAAKEHQRLL SIANTVPYFQ LSVRDTGIDT YVLIVGESVR VDNMSLYGYT RSTTPQVEAQ RKQIKLFNQA ISGAPYTALS VPLSLTADSV LSHDIHNYPD NIINMANQAG FQTFWLSSQS AFRQNGTAVT SIAMRAMETV YVRGFDELLL PHLSQALQQN TQQKKLIVLH LNGSHEPACS AYPQSSAVFQ PQDDQDACYD NSIHYTDSLL GQVFELLKDR RASVMYFADH GLERDPTKKN VYFHGGREAS QQAYHVPMFI WYSPVLGDGV DRTTENNIFS TAYNNYLINA WMGVTKPEQP QTLEEVIAHY KGDSRVVDAN HDVFDYVMLR KEFTEDKQGN PTPEGQG
|
| |