Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0186 |
Symbol | |
ID | 6068233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 204393 |
End bp | 205499 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641599587 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_001723194 |
Protein GI | 170018240 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTGT TGCGTAGTGG ACTCGTGACG ATGCTGCTGC TGGCTGCCTT TAGTGTTCAG GCAGCCTGTA CCTGGCCTGC CTGGGAGCAG TTTAAAAAGG ATTACATCAG TCAGGAAGGG CGCGTCATCG ACCCCAGCGA CGCGCGCAAA ATCACCACCT CCGAAGGGCA AAGTTACGGT ATGTTCTTTG CCCTGGCGGC TAACGACCGT GCAGCTTTCG ATAATATTCT CGACTGGACG CAGAACAATC TCGCTCAGGG TTCTTTAAAA GAAGGTTTGC CCGCCTGGCT GTGGGGCAAG AAAGAGAACA GTAAGTGGGA AGTGCTGGAC AGCAATTCGG CCTCCGATGG TGATGTCTGG ATGGCCTGGT CATTGCTGGA GGCGGGGCGT TTGTGGAAAG AGCAGCGTTA TACCGACATC GGCAGCGCGT TGCTAAAACG TATCGCGCGG GAGGAAGTGG TGACGGTGCC TGGGCTGGGT TCCATGTTGT TACCGGGCAA AGTGGGTTTT GCTGAGGATA ACAGCTGGCG TTTTAACCCC AGCTACCTGC CGCCGACGCT GGCGCAGTAT TTCACCCGCT TTGGCGCGCC GTGGACTACG CTGCGCGAAA CCAATCAACG TTTATTGCTG GAAACCGCCC CGAAAGGCTT TTCGCCAGAC TGGGTGCGCT ATGAGAAAGA CAAAGGCTGG CAGCTAAAAG CCGAAAAAAC ATTGATCAGC AGCTACGACG CTATCCGCGT TTACATGTGG GTAGGCATGA TGCCTGACAG CGATCCGCAA AAAGCGCGGA TGCTCAACCG GTTTAAACCG ATGGCGACAT TCACTGAGAA AAACGGTTAT CCGCCGGAAA AAGTGGATGT GGCTACGGGG AAAGCGCAGG GTAAAGGACC GGTCGGTTTT TCTGCCGCCA TGCTGCCTTT TTTACAAAAC CGCGATGCGC AGGCCGTTCA GCGCCAGCGC GTGGCCGATA ACTTTCCCGG CAGCGATGCC TATTACAACT ATGTGCTGAC CCTGTTTGGA CAAGGCTGGG ATCAACACCG TTTCCGCTTC TCGACAAAAG GTGAGTTATT ACCTGACTGG GGCCAGGAAT GCGCAAATTC ACACTAA
|
Protein sequence | MNVLRSGLVT MLLLAAFSVQ AACTWPAWEQ FKKDYISQEG RVIDPSDARK ITTSEGQSYG MFFALAANDR AAFDNILDWT QNNLAQGSLK EGLPAWLWGK KENSKWEVLD SNSASDGDVW MAWSLLEAGR LWKEQRYTDI GSALLKRIAR EEVVTVPGLG SMLLPGKVGF AEDNSWRFNP SYLPPTLAQY FTRFGAPWTT LRETNQRLLL ETAPKGFSPD WVRYEKDKGW QLKAEKTLIS SYDAIRVYMW VGMMPDSDPQ KARMLNRFKP MATFTEKNGY PPEKVDVATG KAQGKGPVGF SAAMLPFLQN RDAQAVQRQR VADNFPGSDA YYNYVLTLFG QGWDQHRFRF STKGELLPDW GQECANSH
|
| |