Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2578 |
Symbol | eutB |
ID | 5592573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2586015 |
End bp | 2587376 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640921699 |
Product | ethanolamine ammonia-lyase, large subunit |
Protein accession | YP_001459226 |
Protein GI | 157161908 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4303] Ethanolamine ammonia-lyase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 73 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTAA AGACCACATT GTTCGGCAAT GTATATCAGT TTAAGGATGT AAAAGAGGTG CTGGCTAAAG CCAACGAACT GCGTTCGGGG GATGTGCTGG CGGGCGTTGC AGCGGCAAGC TCACAGGAGC GCGTGGCGGC AAAGCAGGTG TTGTCGGAAA TGACCGTAGC GGACATCCGC AATAATCCGG TGATTGCCTA TGAAGATGAC TGCGTGACGC GGCTGATTCA GGACGACGTT AACGAAACGG CCTACAACCA GATTAAAAAC TGGAGCATCA GCGAACTGCG TGAGTATGTG CTGAGCGATG AAACCAGCGT GGACGACATT GCCTTTACCC GCAAAGGGCT GACCTCGGAA GTGGTCGCGG CGGTAGCGAA GATTTGCTCC AACGCGGACC TGATCTACGG CGCGAAGAAA ATGCCGGTAA TCAAAAAGGC CAATACCACC ATCGGTATTC CGGGCACCTT TAGCGCCCGT TTGCAGCCGA ACGATACCCG TGACGACGTG CAAAGTATCG CTGCGCAAAT CTATGAAGGG CTTTCCTTTG GGGTGGGCGA TGCGGTGATC GGCGTTAACC CGGTAACTGA CGACGTGGAA AACTTAAGCC GCGTGCTGGA TACCATTTAT GGCGTGATCG ACAAATTCAA CATCCCAACT CAGGGCTGCG TACTGGCGCA CGTCACCACC CAGATCGAAG CGATTCGTCG CGGCGCGCCT GGCGGACTGA TTTTCCAGAG TATCTGTGGC AGCGAAAAAG GGCTGAAAGA GTTTGGCGTG GAACTGGCGA TGCTCGACGA AGCGCGCGCA GTGGGCGCAG AGTTCAATCG TATCGCCGGG GAAAACTGCC TCTACTTCGA AACCGGACAA GGCTCTGCGC TATCCGCTGG CGCTAACTTC GGCGCTGACC AGGTGACGAT GGAAGCACGT AACTATGGGC TGGCGCGTCA TTACGATCCG TTTATCGTCA ACACCGTGGT CGGCTTTATT GGGCCGGAGT ATCTCTACAA CGACCGCCAG ATTATCCGTG CTGGCTTAGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCTATGGGC TGTGACTGCT GCTACACCAA CCACGCTGAC GCTGACCAGA ACCTCAACGA AAACCTGATG ATCCTGCTCG CCACCGCAGG CTGCAACTAC ATCATGGGGA TGCCGCTGGG TGATGACATC ATGCTCAACT ACCAGACCAC CGCATTCCAC GATACCGCCA CTGTGCGTCA GTTACTCAAC CTGCGTCCGT CACCGGAGTT TGAACGCTGG CTGGAAAGCA TGGGCATTAT GGCAAACGGT CGCCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
|
Protein sequence | MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR NNPVIAYEDD CVTRLIQDDV NETAYNQIKN WSISELREYV LSDETSVDDI AFTRKGLTSE VVAAVAKICS NADLIYGAKK MPVIKKANTT IGIPGTFSAR LQPNDTRDDV QSIAAQIYEG LSFGVGDAVI GVNPVTDDVE NLSRVLDTIY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF GADQVTMEAR NYGLARHYDP FIVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN LRPSPEFERW LESMGIMANG RLTKRAGDPS LFF
|
| |