Gene EcSMS35_2596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2596 
SymboleutB 
ID6147405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2648456 
End bp2649817 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content56% 
IMG OID641617467 
Productethanolamine ammonia-lyase, large subunit 
Protein accessionYP_001744632 
Protein GI170683706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA AGACCACATT GTTCGGCAAT GTATATCAGT TTAAGGATGT AAAAGAGGTG 
CTGGCTAAAG CCAACGAACT GCGTTCGGGG GATGTGCTGG CGGGCGTTGC TGCGGCAAGC
TCACAGGAGC GCGTGGCGGC AAAGCAGGTG TTGTCGGAAA TGACCGTAGC GGACATCCGC
AATAATCCGG TGATTGCCTA TGAAGATGAC TGCGTGACGC GGCTGATTCA GGACGATGTT
AACGAAACGG CCTACAACCA GATTAAAAAC TGGAGCATCA GCGAACTGCG TGAGTATGTG
CTGAGCGATG AAACCAGCGT GGACGACATT GCCTTTACCC GCAAAGGGCT GACCTCGGAA
GTGGTAGCGG CGGTAGCGAA GATTTGCTCC AACGCGGACC TGATCTACGG CGCGAAGAAA
ATGCCGGTGA TCAAAAAGGC CAATACCACC ATCGGTATTC CGGGCACCTT TAGCGCCCGC
TTGCAGCCAA ACGATACCCG TGACGATGTG CAAAGTATCG CCGCGCAAAT CTACGAAGGG
CTTTCCTTCG GGGTGGGCGA TGCGGTGATC GGCGTTAACC CGGTGACTGA CGACGTGGAA
AACTTAAGCC GCGTGTTGGA TACCATCTAT GGCGTGATCG ACAAATTCAA CATCCCAACT
CAGGGCTGCG TACTGGCGCA CGTCACCACC CAGATCGAAG CGATTCGTCG CGGCGCACCG
GGCGGGCTGA TTTTCCAGAG TATTTGCGGC AGCGAAAAAG GGCTGAAAGA GTTTGGCGTG
GAGCTGGCGA TGCTCGACGA AGCGCGCGCA GTGGGTGCAG AGTTTAACCG TATCGCCGGG
GAAAACTGCC TCTACTTCGA AACCGGACAA GGTTCGGCGC TGTCCGCTGG CGCTAACTTC
GGCGCTGACC AGGTGACGAT GGAAGCGCGT AACTATGGGC TGGCGCGTCA TTACGATCCG
TTTATCGTCA ACACCGTGGT CGGTTTTATT GGGCCGGAGT ATCTCTACAA CGACCGCCAG
ATTATCCGCG CGGGCTTAGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCTATGGGC
TGTGACTGCT GCTACACCAA CCACGCTGAC GCTGACCAGA ACCTCAACGA AAACCTGATG
ATCCTGCTCG CCACCGCAGG CTGTAACTAC ATCATGGGGA TGCCGCTGGG TGATGACATC
ATGCTCAACT ACCAGACTAC GGCCTTCCAC GACACCGCCA CTGTGCGTCA GTTACTCAAC
CTGCGCCCGT CACCGGAGTT TGAACGCTGG CTGGAAAGCA TGGGCATTAT GGCAAACGGT
CGCCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
 
Protein sequence
MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR 
NNPVIAYEDD CVTRLIQDDV NETAYNQIKN WSISELREYV LSDETSVDDI AFTRKGLTSE
VVAAVAKICS NADLIYGAKK MPVIKKANTT IGIPGTFSAR LQPNDTRDDV QSIAAQIYEG
LSFGVGDAVI GVNPVTDDVE NLSRVLDTIY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP
GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF
GADQVTMEAR NYGLARHYDP FIVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG
CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN
LRPSPEFERW LESMGIMANG RLTKRAGDPS LFF