Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4031 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4364205 |
End bp | 4365938 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | sulfatase |
Protein accession | ACX41631 |
Protein GI | 260451209 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTCCA CAGAAGTCCA GGCTAAACCT CTTTTTAGCT GGAAAGCCCT GGGTTGGGCA CTGCTCTACT TTTGGTTTTT CTCTACTCTG CTACAGGCCA TTATTTACAT CAGTGGTTAT AGTGGCACTA ACGGCATTCG CGACTCGCTG TTATTCAGTT CGCTGTGGTT GATCCCGGTA TTCCTCTTTC CGAAGCGGAT TAAAATTATT GCCGCAGTAA TCGGCGTGGT GCTATGGGCG GCCTCTCTGG CGGCGCTGTG CTACTACGTC ATCTACGGTC AGGAGTTCTC GCAGAGCGTT CTGTTTGTGA TGTTCGAAAC CAACACCAAC GAAGCCAGCG AGTATTTAAG CCAGTATTTC AGCCTGAAAA TTGTGCTTAT CGCGCTGGCC TATACGGCGG TGGCAGTTCT GCTGTGGACA CGCCTGCGCC CGGTCTATAT TCCAAAGCCG TGGCGTTATG TTGTCTCTTT TGCCCTGCTT TATGGCTTGA TTCTGCATCC GATCGCCATG AATACGTTTA TCAAAAACAA GCCGTTTGAG AAAACGTTGG ATAACCTGGC CTCGCGTATG GAGCCTGCCG CACCGTGGCA ATTCCTGACC GGCTATTATC AGTATCGTCA GCAACTAAAC TCGCTAACAA AGTTACTGAA TGAAAATAAT GCCTTGCCGC CACTGGCTAA TTTCAAAGAT GAATCGGGTA ACGAACCGCG CACTTTAGTG CTGGTGATTG GCGAGTCGAC CCAGCGCGGA CGCATGAGTC TGTACGGTTA TCCGCGTGAA ACCACGCCGG AGCTGGATGC GCTGCATAAA ACCGATCCGA ATCTGACCGT GTTTAATAAC GTAGTTACGT CTCGTCCGTA CACCATTGAA ATCCTGCAAC AGGCGCTGAC CTTTGCCAAT GAAAAGAACC CGGATCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAACAGGCG GGTTATAAAA CCTTCTGGAT CACCAACCAG CAGACGATGA CCGCCCGCAA TACCATGCTG ACGGTATTTT CGCGCCAGAC CGACAAGCAG TACTACATGA ACCAGCAACG TACGCAGAGT GCGCGTGAAT ACGACACCAA CGTGCTGAAG CCGTTCCAGG AAGTGCTGAA TGACCCTGCG CCGAAGAAAC TGATCATTGT TCATCTGCTG GGTACGCATA TCAAATACAA ATACCGCTAC CCGGAAAATC AGGGCAAGTT TGATGGCAAT ACCGATCATG TTCCGCCGGG ATTAAACGCG GAAGAGCTGG AGTCATATAA CGATTATGAC AACGCTAACC TGTATAACGA TCATGTGGTT GCCAGCCTGA TTAAAGACTT TAAAGCAGCA AACCCGAACG GTTTCCTGGT TTATTTCTCT GACCACGGTG AAGAGGTTTA CGACACGCCG CCGCACAAAA CTCAGGGGCG TAATGAGGAC AACCCGACGC GTCATATGTA CACCATTCCG TTCCTGCTGT GGACGTCAGA AAAATGGCAA GCGACTCATC CCCGTGATTT CTCGCAGGAT GTTGATCGTA AATACAGCCT GGCGGAACTG ATCCACACCT GGTCAGATTT GGCGGGCTTA TCTTACGACG GTTACGATCC AACCCGTTCA GTGGTGAATC CGCAGTTCAA AGAAACTACC CGCTGGATTG GTAACCCGTA TAAGAAAAAC GCACTGATCG ATTACGACAC ACTGCCCTAT GGCGATCAGG TGGGTAATCA GTAA
|
Protein sequence | MHSTEVQAKP LFSWKALGWA LLYFWFFSTL LQAIIYISGY SGTNGIRDSL LFSSLWLIPV FLFPKRIKII AAVIGVVLWA ASLAALCYYV IYGQEFSQSV LFVMFETNTN EASEYLSQYF SLKIVLIALA YTAVAVLLWT RLRPVYIPKP WRYVVSFALL YGLILHPIAM NTFIKNKPFE KTLDNLASRM EPAAPWQFLT GYYQYRQQLN SLTKLLNENN ALPPLANFKD ESGNEPRTLV LVIGESTQRG RMSLYGYPRE TTPELDALHK TDPNLTVFNN VVTSRPYTIE ILQQALTFAN EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTMTARNTML TVFSRQTDKQ YYMNQQRTQS AREYDTNVLK PFQEVLNDPA PKKLIIVHLL GTHIKYKYRY PENQGKFDGN TDHVPPGLNA EELESYNDYD NANLYNDHVV ASLIKDFKAA NPNGFLVYFS DHGEEVYDTP PHKTQGRNED NPTRHMYTIP FLLWTSEKWQ ATHPRDFSQD VDRKYSLAEL IHTWSDLAGL SYDGYDPTRS VVNPQFKETT RWIGNPYKKN ALIDYDTLPY GDQVGNQ
|
| |