Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1600 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 1745986 |
End bp | 1747203 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | colanic acid biosynthesis glycosyl transferase WcaC |
Protein accession | ACX39265 |
Protein GI | 260448843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TGCAATTTAA TGTGCGACTG GCGGAAGGCG GGGCAGCAGG TGTGGCGTTA GATCTCCACC AGCGTGCGCT GCAACAGGGG CTGGCGTCAC ATTTTGTCTA CGGTTACGGC AAAGGCGGCA AAGAGAGCGT CAGCCATCAG AACTATCCGC AGGTCATCAA ACATACGCCG CGGATGACCG CGATGGCGAA TATTGCTCTG TTTCGTCTGT TTAATCGCGA TCTGTTTGGC AATTTCAATG AGTTATATCG CACCATTACT CGCACAGCGG GTCCGGTGGT CCTGCATTTT CATGTGCTGC ACAGCTACTG GCTAAATCTT AAGAGCGTGG TGCGCTTTTG CGAAAAAGTG AAAAACCATA AACCGGACGT CACTCTGGTC TGGACGCTGC ACGACCACTG GAGTGTTACC GGACGCTGCG CCTTTACCGA CGGTTGCGAA GGCTGGAAAA CAGGCTGCCA GAAATGCCCG ACCTTAAATA ACTATCCGCC GGTGAAGATT GATCGCGCAC ACCAACTGGT GGCGGGCAAA CGCCAGTTAT TCCGTGAGAT GCTGGCGCTG GGCTGTCAGT TTATTTCCCC CAGCCAGCAT GTGGCTGACG CTTTCAATAG CCTGTACGGT CCAGGGCGTT GCCGGATTAT CAATAATGGC ATTGATATGG CAACTGAAGC GATTCTGGCG GACTTGCCTC CGGTGCGCGA AACCCAGGGC AAGCCGAAAA TCGCGGTGGT GGCGCATGAT CTGCGTTACG ACGGCAAAAC TAACCAGCAA CTGGTACGCG AGATGATGGC GCTGGGCGAC AAAATTGAAC TGCATACCTT TGGTAAGTTC TCGCCGTTCA CCGCTGGCAA CGTGGTTAAT CACGGCTTTG AAACCGACAA ACGTAAGCTG ATGAGCGCGC TCAATCAGAT GGATGCGCTG GTATTCAGTT CTCGCGTCGA TAACTACCCG CTGATTTTGT GTGAGGCGCT ATCGATTGGC GTGCCGGTGA TTGCCACCCA TAGCGATGCG GCGCGGGAAG TGTTGCAAAA ATCCGGCGGT AAAACCGTCA GCGAAGAAGA GGTGCTGCAA CTGGTGCAGT TAAGCAAACC GGAAATCGCG CAGGCGATAT TTGGTACCAC GCTGGCTGAG TTCAGCCAAC GCAGCCGCGC CGCCTACAGT GGACAACAGA TGCTGGAGGA GTATGTCAAC TTCTATCAGA ATCTGTAG
|
Protein sequence | MNILQFNVRL AEGGAAGVAL DLHQRALQQG LASHFVYGYG KGGKESVSHQ NYPQVIKHTP RMTAMANIAL FRLFNRDLFG NFNELYRTIT RTAGPVVLHF HVLHSYWLNL KSVVRFCEKV KNHKPDVTLV WTLHDHWSVT GRCAFTDGCE GWKTGCQKCP TLNNYPPVKI DRAHQLVAGK RQLFREMLAL GCQFISPSQH VADAFNSLYG PGRCRIINNG IDMATEAILA DLPPVRETQG KPKIAVVAHD LRYDGKTNQQ LVREMMALGD KIELHTFGKF SPFTAGNVVN HGFETDKRKL MSALNQMDAL VFSSRVDNYP LILCEALSIG VPVIATHSDA AREVLQKSGG KTVSEEEVLQ LVQLSKPEIA QAIFGTTLAE FSQRSRAAYS GQQMLEEYVN FYQNL
|
| |