Gene ECD_02341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02341 
SymboleutB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2428498 
End bp2429859 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content56% 
IMG OID 
Productethanolamine ammonia-lyase, large subunit, heavy chain 
Protein accessionACT44162 
Protein GI253978492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTAA AGACCACATT GTTCGGCAAT GTATATCAGT TTAAGGATGT AAAAGAGGTG 
CTGGCTAAAG CCAACGAACT GCGTTCGGGG GATGTGCTGG CGGGCGTCGC AGCGGCAAGC
TCACAGGAGC GCGTGGCGGC AAAGCAGGTG TTGTCGGAAA TGACCGTAGC GGACATCCGC
AATAATCCGG TGATTGCCTA TGAAGATGAC TGCGTGACGC GGCTGATTCA GGACGATGTT
AACGAAACGG CCTACAACCA GATTAAAAAC TGGAGCATCA GCGAACTGCG TGAGTATGTG
CTGAGCGATG AAACCAGCGT GGACGACATT GCCTTTACCC GCAAAGGGCT GACCTCGGAA
GTGGTCGCGG CGGTAGCGAA GATTTGCTCC AACGCGGACC TGATCTACGG CGCGAAGAAA
ATGCCGGTAA TCAAAAAGGC CAATACCACC ATCGGTATTC CGGGCACCTT TAGCGCCCGT
TTGCAGCCAA ATGACACCCG TGACGACGTG CAAAGTATCG CCGCGCAAAT CTACGAAGGG
CTTTCCTTCG GGGTGGGCGA TGCGGTGATC GGCGTTAACC CGGTGACTGA CGACGTGGAA
AACTTAAGCC GCGTGTTGGA TACCATCTAT GGCGTGATCG ACAAATTCAA CATCCCAACT
CAGGGCTGTG TACTGGCGCA CGTCACCACC CAGATCGAAG CGATCCGTCG TGGCGCACCG
GGCGGGCTGA TTTTCCAGAG TATCTGTGGC AGCGAAAAAG GGCTGAAAGA GTTTGGCGTG
GAGCTGGCGA TGCTCGACGA AGCGCGCGCA GTGGGCGCGG AGTTCAACCG TATCGCCGGG
GAAAACTGCC TCTACTTCGA AACCGGACAA GGCTCTGCGC TATCCGCTGG CGCTAACTTC
GGCGCAGACC AGGTAACGAT GGAAGCACGT AACTACGGGC TGGCGCGTCA TTACGATCCG
TTTATCGTCA ACACCGTGGT CGGCTTTATT GGGCCGGAGT ATCTCTACAA CGACCGCCAG
ATTATCCGTG CTGGCTTAGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCTATGGGC
TGTGACTGCT GTTATACCAA CCACGCTGAC GCTGACCAGA ACCTCAACGA AAACCTGATG
ATCCTGCTCG CCACCGCAGG CTGCAACTAC ATCATGGGGA TGCCGCTGGG TGATGACATC
ATGCTCAACT ACCAGACCAC CGCATTCCAC GATACCGCCA CTGTGCGTCA GTTACTCAAC
CTGCGCCCGT CACCGGAGTT TGAACGCTGG CTGGAAAGCA TGGGCATTAT GGCAAACGGT
CGCCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
 
Protein sequence
MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR 
NNPVIAYEDD CVTRLIQDDV NETAYNQIKN WSISELREYV LSDETSVDDI AFTRKGLTSE
VVAAVAKICS NADLIYGAKK MPVIKKANTT IGIPGTFSAR LQPNDTRDDV QSIAAQIYEG
LSFGVGDAVI GVNPVTDDVE NLSRVLDTIY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP
GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF
GADQVTMEAR NYGLARHYDP FIVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG
CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN
LRPSPEFERW LESMGIMANG RLTKRAGDPS LFF