Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2596 |
Symbol | eutB |
ID | 6147405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2648456 |
End bp | 2649817 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617467 |
Product | ethanolamine ammonia-lyase, large subunit |
Protein accession | YP_001744632 |
Protein GI | 170683706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4303] Ethanolamine ammonia-lyase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAA AGACCACATT GTTCGGCAAT GTATATCAGT TTAAGGATGT AAAAGAGGTG CTGGCTAAAG CCAACGAACT GCGTTCGGGG GATGTGCTGG CGGGCGTTGC TGCGGCAAGC TCACAGGAGC GCGTGGCGGC AAAGCAGGTG TTGTCGGAAA TGACCGTAGC GGACATCCGC AATAATCCGG TGATTGCCTA TGAAGATGAC TGCGTGACGC GGCTGATTCA GGACGATGTT AACGAAACGG CCTACAACCA GATTAAAAAC TGGAGCATCA GCGAACTGCG TGAGTATGTG CTGAGCGATG AAACCAGCGT GGACGACATT GCCTTTACCC GCAAAGGGCT GACCTCGGAA GTGGTAGCGG CGGTAGCGAA GATTTGCTCC AACGCGGACC TGATCTACGG CGCGAAGAAA ATGCCGGTGA TCAAAAAGGC CAATACCACC ATCGGTATTC CGGGCACCTT TAGCGCCCGC TTGCAGCCAA ACGATACCCG TGACGATGTG CAAAGTATCG CCGCGCAAAT CTACGAAGGG CTTTCCTTCG GGGTGGGCGA TGCGGTGATC GGCGTTAACC CGGTGACTGA CGACGTGGAA AACTTAAGCC GCGTGTTGGA TACCATCTAT GGCGTGATCG ACAAATTCAA CATCCCAACT CAGGGCTGCG TACTGGCGCA CGTCACCACC CAGATCGAAG CGATTCGTCG CGGCGCACCG GGCGGGCTGA TTTTCCAGAG TATTTGCGGC AGCGAAAAAG GGCTGAAAGA GTTTGGCGTG GAGCTGGCGA TGCTCGACGA AGCGCGCGCA GTGGGTGCAG AGTTTAACCG TATCGCCGGG GAAAACTGCC TCTACTTCGA AACCGGACAA GGTTCGGCGC TGTCCGCTGG CGCTAACTTC GGCGCTGACC AGGTGACGAT GGAAGCGCGT AACTATGGGC TGGCGCGTCA TTACGATCCG TTTATCGTCA ACACCGTGGT CGGTTTTATT GGGCCGGAGT ATCTCTACAA CGACCGCCAG ATTATCCGCG CGGGCTTAGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCTATGGGC TGTGACTGCT GCTACACCAA CCACGCTGAC GCTGACCAGA ACCTCAACGA AAACCTGATG ATCCTGCTCG CCACCGCAGG CTGTAACTAC ATCATGGGGA TGCCGCTGGG TGATGACATC ATGCTCAACT ACCAGACTAC GGCCTTCCAC GACACCGCCA CTGTGCGTCA GTTACTCAAC CTGCGCCCGT CACCGGAGTT TGAACGCTGG CTGGAAAGCA TGGGCATTAT GGCAAACGGT CGCCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
|
Protein sequence | MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR NNPVIAYEDD CVTRLIQDDV NETAYNQIKN WSISELREYV LSDETSVDDI AFTRKGLTSE VVAAVAKICS NADLIYGAKK MPVIKKANTT IGIPGTFSAR LQPNDTRDDV QSIAAQIYEG LSFGVGDAVI GVNPVTDDVE NLSRVLDTIY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF GADQVTMEAR NYGLARHYDP FIVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN LRPSPEFERW LESMGIMANG RLTKRAGDPS LFF
|
| |