Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2827 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3028522 |
End bp | 3030105 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | |
Product | sulfatase |
Protein accession | ACX40460 |
Protein GI | 260450038 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.274935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTAA CCCTCAAAGA ATCGCTTGTT ACCCGTAGCC GGGTATTTAG CCCGTGGACT GCGTTCTACT TTTTACAGTC GCTATTAATT AACCTCGGCT TAGGTTACCC CTTCAGTTTG CTCTACACCG CTGCGTTTAC GGCTATTTTG CTTTTGCTAT GGCGAACATT GCCTCGCGTA CAAAAAGTTC TGGTCGGTGT CAGTTCGCTG GTGGCGGCTT GTTATTTCCC TTTTGCTCAG GCCTACGGCG CGCCTAATTT CAATACATTG CTGGCATTGC ACTCCACCAA TATGGAAGAG TCGACCGAAA TCCTGACGAT TTTTCCGTGG TACAGCTACC TGGTCGGCTT ATTTATTTTT GCGCTCGGCG TAATAGCAAT CAGGCGAAAA AAAGAGAATG AAAAAGCGCG CTGGAATACC TTCGACAGCC TGTGTCTGGT ATTCAGTGTG GCGACATTTT TTGTTGCTCC CGTGCAAAAC CTGGCCTGGG GTGGCGTATT TAAACTGAAA GATACTGGCT ATCCGGTATT TCGTTTTGCT AAGGATGTCA TCGTCAATAA TAACGAGGTG ATTGAAGAGC AAGAACGGAT GGCAAAACTT TCCGGAATGA AAGATACCTG GACGGTCACT GCCGTTAAGC CGAAGTATCA GACCTATGTG GTGGTGATCG GTGAAAGCGC GCGTCGCGAT GCCCTCGGTG CCTTTGGCGG TCACTGGGAC AATACCCCGT TTGCCAGCAG CGTTAACGGT TTGATATTTG CTGACTACAT TGCCGCCAGT GGCTCCACGC AGAAATCGCT TGGCTTAACG CTCAATCGCG TTGTCGATGG CAAACCACAG TTTCAGGATA ACTTTGTCAC CCTGGCAAAT CGCGCGGGCT TCCAGACCTG GTGGTTTTCC AACCAGGGTC AAATCGGCGA ATACGATACC GCTATCGCCA GCATCGCCAA ACGAGCAGAT GAAGTGTACT TCCTGAAAGA AGGTAATTTT GAAGCAGATA AAAACACCAA AGACGAAGCG TTACTGGATA TGACCGCTCA AGTGCTGGCG CAAGAGCACT CGCAACCGCA GCTGATTGTT CTACATCTGA TGGGCTCACA TCCGCAGGCC TGCGACAGGA CACAAGGAAA ATACGAAACC TTTGTGCAAT CGAAAGAAAC GTCGTGCTAT CTCTATACCA TGACGCAAAC GGACGATTTA CTGCGCAAGC TGTACGATCA GTTACGCAAC AGCGGCAGCA GCTTCTCGCT GGTTTACTTT TCTGACCACG GTCTGGCCTT TAAAGAGCGC GGTAAAGACG TGCAATACCT TGCCCATGAT GATAAATATC AGCAAAATTT CCAGGTGCCT TTTATGGTCA TTTCCAGCGA CGATAAAGCG CATCGTGTGA TTAAAGCCCG CCGCTCAGCC AATGACTTCT TAGGCTTTTT CTCCCAGTGG ACGGGGATTA AAGCGAAGGA AATTAACATC AAATACCCGT TTATATCTGA GAAGAAAGCC GGGCCGATAT ACATCACCAA CTTCCAGTTA CAGAAGGTGG ATTACAACCA TCTCGGAACC GATATTTTCG ACCCGAAACC TTAA
|
Protein sequence | MNLTLKESLV TRSRVFSPWT AFYFLQSLLI NLGLGYPFSL LYTAAFTAIL LLLWRTLPRV QKVLVGVSSL VAACYFPFAQ AYGAPNFNTL LALHSTNMEE STEILTIFPW YSYLVGLFIF ALGVIAIRRK KENEKARWNT FDSLCLVFSV ATFFVAPVQN LAWGGVFKLK DTGYPVFRFA KDVIVNNNEV IEEQERMAKL SGMKDTWTVT AVKPKYQTYV VVIGESARRD ALGAFGGHWD NTPFASSVNG LIFADYIAAS GSTQKSLGLT LNRVVDGKPQ FQDNFVTLAN RAGFQTWWFS NQGQIGEYDT AIASIAKRAD EVYFLKEGNF EADKNTKDEA LLDMTAQVLA QEHSQPQLIV LHLMGSHPQA CDRTQGKYET FVQSKETSCY LYTMTQTDDL LRKLYDQLRN SGSSFSLVYF SDHGLAFKER GKDVQYLAHD DKYQQNFQVP FMVISSDDKA HRVIKARRSA NDFLGFFSQW TGIKAKEINI KYPFISEKKA GPIYITNFQL QKVDYNHLGT DIFDPKP
|
| |