Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1637 |
Symbol | celA |
ID | 6144754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1627429 |
End bp | 1628868 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641616513 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_001743691 |
Protein GI | 170683087 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGAT TTAAAAAAGG TTTTTTATGG GGTGGCGCGG TAGCCGCGCA TCAGTTGGAA GGTGGCTGGA ATGAAGGAGG AAAAGGCATC AGTATCGCTG ATGTGATGAC TGCTGGCGCT CACGGGGTGC CGCGTGAAGT GACAGAAGGC ATTATCGACG GGCTTAATTA TCCCAATCAT GAAGCAATTG ATTTTTATCA TCGCTATAAA ACAGATATTC AGTTATTTGC CGAGATGGGA TTCAAATGCT TTCGAACTTC CATTGCCTGG ACACGAATCT TTCCGCAAGG TGACGAACAG GAGCCGAATG AAGAGGGTTT ACAATTTTAT GATGATCTGT TCGATGAATG CCTGAAGCAG GGAATGGAGC CTGTGGTGAC GCTTTCGCAT TTTGAGATGC CTTATCATCT GGTGACAAAA TATGGTGGCT GGCGACACCG TAAACTGATC GACTTTTTCA TCCGCTTCGC ATCAACGGTC TTCACGCGCT ATAAAGAAAA AGTAAAGTAC TGGATGACGT TTAACGAAAT CAATAATCAG GTGAATTTCA GCGAAAGCCT GTGTCCATTT ACTAATTCCG GTATCTTGTA TTCGCCAGAG GAAGATATCA ATGAGCGCGA ACAAATAATG TACCAGGCGG TACATTACGA GTTAGTTGCC AGTGCCCTGG CGGTACAGAC TGGAAAATCG ATCAATCCTG AATTTAATAT CGGCTGTATG ATCGCCATGT GCCCCATCTA TCCTCTGACG TGTGCACCCA ACGATATGAT GATGGCCACG AAAGCGATGC ATCGTCGTTA CTGGTTTACT GATGTTCATG CGCGTGGATA TTATCCGCAA CATATGCTGA ATTACTTTGC CAGGAAAGGA TTCAACCTCG ATATCACACC AGAAGATAAC ACGATTCTTG CCAGAGGTTG TGTCGACTTT ATCGGTTTTA GCTACTACAT GTCTTTTACT ACGCAATTTT CGCCAGATAA CCCGCAACTG AATTATGTTG AACCACGAGA TTTGGTCAGC AACCCTTATA TCGATACATC CGAATGGGGA TGGCAAATTG ATCCGGCAGG GCTACGTTAT TCACTCAACT GGTTCTGGGA TCATTTCCAG TTGCCGCTGT TTATTGTCGA AAATGGATTG GGTGCGGTTG ACCAGAGACA AGCTGACGGC ACGGTGAACG ATCACTATCG CATTGAGTAC TTTGCTTCCC ATATTCGGGA AATGAAAAAA GCCGTTGTTG AAGATGGTGT TGACTTAATT GGCTACACAC CGTGGGGCTG CATTGACCTG GTTTCTGCCG GAACAGGGGA AATGAAAAAA CGCTACGGAA TGATTTATGT CGATAAAGAC AACGAAGGGA AGGGAACGCT GGAAAGGATA CGTAAAGCGT CGTTTTACTG GTATCGGGAT CTCATCGCCA ACAATGGCGA AAATATTTGA
|
Protein sequence | MSGFKKGFLW GGAVAAHQLE GGWNEGGKGI SIADVMTAGA HGVPREVTEG IIDGLNYPNH EAIDFYHRYK TDIQLFAEMG FKCFRTSIAW TRIFPQGDEQ EPNEEGLQFY DDLFDECLKQ GMEPVVTLSH FEMPYHLVTK YGGWRHRKLI DFFIRFASTV FTRYKEKVKY WMTFNEINNQ VNFSESLCPF TNSGILYSPE EDINEREQIM YQAVHYELVA SALAVQTGKS INPEFNIGCM IAMCPIYPLT CAPNDMMMAT KAMHRRYWFT DVHARGYYPQ HMLNYFARKG FNLDITPEDN TILARGCVDF IGFSYYMSFT TQFSPDNPQL NYVEPRDLVS NPYIDTSEWG WQIDPAGLRY SLNWFWDHFQ LPLFIVENGL GAVDQRQADG TVNDHYRIEY FASHIREMKK AVVEDGVDLI GYTPWGCIDL VSAGTGEMKK RYGMIYVDKD NEGKGTLERI RKASFYWYRD LIANNGENI
|
| |