Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4184 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4536942 |
End bp | 4538192 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | polysaccharide biosynthesis protein |
Protein accession | ACX41784 |
Protein GI | 260451362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTGG CAAAAGCGTC CTTGTGGACG GCGGCCAGTA CACTGGTCAA GATTGGTGCC GGGTTACTGG TCGGTAAGTT GCTGGCGGTG TCATTTGGTC CGGCGGGGCT TGGGCTGGCG GCAAATTTCC GCCAGTTGAT TACCGTGCTC GGCGTGCTTG CCGGGGCTGG CATCTTTAAC GGTGTAACCA AATACGTTGC CCAGTACCAT GATAATCCGC AACAGCTGCG CCGCGTGGTC GGCACTTCAT CAGCGATGGT ACTTGGTTTC TCTACGCTGA TGGCGCTGGT TTTTGTGCTG GCAGCTGCGC CAATCAGCCA GGGATTGTTT GGTAATACCG ACTATCAGGG GCTGGTGCGT TTAGTGGCGC TGGTGCAAAT GGGGATCGCC TGGGGCAACC TGTTACTGGC GCTGATGAAA GGCTTTCGCG ATGCCGCAGG TAATGCGTTA TCGCTGATTG TCGGCAGCTT GATTGGCGTT CTCGCGTACT ACGTCAGTTA CCGTTTGGGC GGTTATGAAG GGGCGTTGCT GGGTCTGGCG CTGATTCCCG CGCTGGTGGT AATTCCTGCC GCCATCATGT TGATCAAACG TGGTGTCATC CCGTTAAGCT ATCTGAAACC CAGCTGGGAT AACGGTCTGG CAGGGCAGTT GAGCAAATTT ACGCTCATGG CGTTGATTAC GTCGGTGACC TTGCCTGTTG CTTACATCAT GATGCGTAAA CTGCTGGCGG CGCAGTATAG CTGGGATGAG GTGGGGATCT GGCAAGGGGT GAGCAGTATT TCCGATGCCT ACCTGCAATT TATTACGGCA TCGTTCAGCG TATATTTGCT GCCCACGTTG TCGCGGCTAA CGGAAAAGCG CGATATCACC CGGGAAGTGG TTAAATCGCT GAAATTCGTC TTACCGGCAG TGGCGGCGGC GAGTTTTACC GTCTGGCTGC TGCGTGATTT TGCTATCTGG CTGCTGTTGT CGAATAAATT TACCGCTATG CGCGATCTCT TTGCCTGGCA GTTAGTGGGT GATGTGTTAA AAGTGGGCGC TTATGTCTTT GGTTATCTGG TGATCGCCAA AGCGTCACTG CGGTTTTATA TTCTGGCGGA AGTCAGCCAG TTCACTTTAT TGATGGTATT TGCCCACTGG CTAATCCCTG CGCATGGTGC ACTGGGCGCG GCGCAGGCAT ATATGGCAAC TTATATCGTC TATTTTTCTC TTTGTTGTGG CGTGTTTTTA CTCTGGCGTA GGCGGGCATG A
|
Protein sequence | MSLAKASLWT AASTLVKIGA GLLVGKLLAV SFGPAGLGLA ANFRQLITVL GVLAGAGIFN GVTKYVAQYH DNPQQLRRVV GTSSAMVLGF STLMALVFVL AAAPISQGLF GNTDYQGLVR LVALVQMGIA WGNLLLALMK GFRDAAGNAL SLIVGSLIGV LAYYVSYRLG GYEGALLGLA LIPALVVIPA AIMLIKRGVI PLSYLKPSWD NGLAGQLSKF TLMALITSVT LPVAYIMMRK LLAAQYSWDE VGIWQGVSSI SDAYLQFITA SFSVYLLPTL SRLTEKRDIT REVVKSLKFV LPAVAAASFT VWLLRDFAIW LLLSNKFTAM RDLFAWQLVG DVLKVGAYVF GYLVIAKASL RFYILAEVSQ FTLLMVFAHW LIPAHGALGA AQAYMATYIV YFSLCCGVFL LWRRRA
|
| |