Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2148 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2295954 |
End bp | 2297636 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | sulfatase |
Protein accession | ACX39801 |
Protein GI | 260449379 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.541761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCTG CATTAAAGAA AAGTGTCGTA AGTACCTCGA TATCTTTGAT ACTGGCATCT GGTATGGCTG CATTTGCTGC TCATGCGGCA GATGATGTAA AGCTGAAAGC AACCAAAACA AACGTTGCTT TCTCAGACTT TACGCCGACA GAATACAGTA CCAAAGGAAA GCCAAATATT ATCGTACTGA CCATGGATGA TCTTGGTTAT GGACAACTTC CTTTTGATAA GGGATCTTTT GACCCAAAAA CAATGGAAAA TCGTGAAGTT GTCGATACCT ACAAAATAGG GATAGATAAA GCCATTGAAG CTGCACAAAA ATCAACGCCG ACGCTCCTTT CATTAATGGA TGAAGGCGTA CGTTTTACTA ACGGCTATGT GGCACACGGT GTTTCCGGCC CCTCCCGCGC CGCAATAATG ACCGGTCGAG CTCCCGCCCG CTTTGGTGTC TATTCCAATA CCGATGCTCA GGATGGTATT CCGCTAACAG AAACTTTCTT GCCTGAATTA TTCCAGAATC ATGGTTATTA CACTGCAGCA GTAGGTAAAT GGCACTTGTC AAAAATCAGT AATGTGCCGG TACCGGAAGA TAAACAAACG CGTGACTATC ATGACAACTT CACCACATTT TCTGCGGAAG AATGGCAACC TCAAAACCGT GGCTTTGATT ACTTTATGGG ATTCCACGCT GCAGGAACGG CATATTACAA CTCCCCTTCA CTGTTCAAAA ATCGTGAACG TGTCCCCGCA AAAGGTTATA TCAGCGATCA GTTAACCGAT GAGGCAATTG GCGTTGTTGA TCGTGCCAAA ACACTTGACC AGCCTTTTAT GCTTTACCTG GCTTATAATG CTCCGCACCT GCCAAATGAT AATCCTGCAC CGGATCAATA TCAGAAGCAA TTTAATACCG GTAGTCAAAC AGCAGATAAC TACTACGCTT CCGTTTATTC TGTTGATCAG GGTGTAAAAC GCATTCTCGA ACAACTGAAG AAAAACGGAC AGTATGACAA TACAATTATT CTCTTTACCT CCGATAATGG TGCGGTTATC GATGGTCCTC TGCCGCTGAA CGGGGCGCAA AAAGGCTATA AGAGTCAGAC CTATCCTGGC GGTACTCACA CCCCAATGTT TATGTGGTGG AAAGGAAAAC TTCAACCCGG TAATTATGAC AAGCTGATTT CCGCAATGGA TTTCTACCCG ACAGCTCTTG ATGCAGCCGA TATCAGCATT CCAAAAGACC TTAAGCTGGA TGGCGTTTCC TTGCTGCCCT GGTTGCAAGA TAAGAAACAA GGCGAGCCAC ATAAAAATCT GACCTGGATA ACCTCTTATT CTCACTGGTT TGACGAGGAA AATATTCCAT TCTGGGATAA TTACCACAAA TTTGTTCGCC ATCAGTCAGA CGATTACCCG CATAACCCCA ACACTGAGGA CTTAAGCCAA TTCTCTTATA CGGTGAGAAA TAACGATTAT TCGCTTGTCT ATACAGTAGA AAACAATCAG TTAGGTCTCT ACAAACTGAC GGATCTACAG CAAAAAGATA ACCTTGCCGC CGCCAATCCG CAGGTCGTTA AAGAGATGCA AGGCGTGGTA AGAGAGTTTA TCGACAGCAG CCAGCCACCG CTTAGCGAGG TAAATCAGGA GAAGTTTAAC AATATCAAGA AAGCACTAAG CGAAGCGAAA TAA
|
Protein sequence | MKSALKKSVV STSISLILAS GMAAFAAHAA DDVKLKATKT NVAFSDFTPT EYSTKGKPNI IVLTMDDLGY GQLPFDKGSF DPKTMENREV VDTYKIGIDK AIEAAQKSTP TLLSLMDEGV RFTNGYVAHG VSGPSRAAIM TGRAPARFGV YSNTDAQDGI PLTETFLPEL FQNHGYYTAA VGKWHLSKIS NVPVPEDKQT RDYHDNFTTF SAEEWQPQNR GFDYFMGFHA AGTAYYNSPS LFKNRERVPA KGYISDQLTD EAIGVVDRAK TLDQPFMLYL AYNAPHLPND NPAPDQYQKQ FNTGSQTADN YYASVYSVDQ GVKRILEQLK KNGQYDNTII LFTSDNGAVI DGPLPLNGAQ KGYKSQTYPG GTHTPMFMWW KGKLQPGNYD KLISAMDFYP TALDAADISI PKDLKLDGVS LLPWLQDKKQ GEPHKNLTWI TSYSHWFDEE NIPFWDNYHK FVRHQSDDYP HNPNTEDLSQ FSYTVRNNDY SLVYTVENNQ LGLYKLTDLQ QKDNLAAANP QVVKEMQGVV REFIDSSQPP LSEVNQEKFN NIKKALSEAK
|
| |