Gene EcolC_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1238 
Symbol 
ID6067354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1357697 
End bp1359058 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content55% 
IMG OID641600653 
Productethanolamine ammonia lyase large subunit 
Protein accessionYP_001724231 
Protein GI170019277 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.357696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA AGACCACATT GTTCGGCAAT GTATATCAGT TTAAGGATGT AAAAGAGGTG 
CTGGCTAAAG CCAACGAACT GCGTTCGGGG GATGTGCTGG CGGGCGTTGC AGCGGCAAGC
TCACAGGAGC GCGTGGCGGC AAAGCAGGTG TTGTCGGAAA TGACCGTAGC GGACATCCGC
AATAATCCGG TGATTGCCTA TGAAGATGAC TGCGTGACGC GGCTGATTCA GGACGACGTT
AACGAAACGG CCTACAACCA GATTAAAAAC TGGAGCATCA GCGAACTGCG TGAGTATGTG
CTGAGCGATG AAACCAGCGT GGACGACATT GCCTTTACCC GCAAAGGGCT GACCTCGGAA
GTGGTCGCGG CGGTAGCGAA GATTTGCTCC AACGCGGACC TGATCTACGG CGCGAAGAAA
ATGCCGGTAA TCAAAAAGGC CAATACCACC ATCGGTATTC CGGGCACCTT TAGCGCCCGT
TTGCAGCCGA ACGATACCCG TGACGACGTG CAAAGTATCG CTGCGCAAAT CTATGAAGGG
CTTTCCTTTG GGGTGGGCGA TGCGGTGATC GGCGTTAACC CGGTAACTGA CGACGTGGAA
AACTTAAGCC GCGTGCTGGA TACCATTTAT GGCGTGATCG ACAAATTCAA CATCCCAACT
CAGGGCTGCG TACTGGCGCA CGTCACCACC CAGATCGAAG CGATTCGTCG CGGCGCGCCT
GGCGGACTGA TTTTCCAGAG TATCTGTGGC AGCGAAAAAG GGCTGAAAGA GTTTGGCGTG
GAACTGGCGA TGCTCGACGA AGCGCGCGCA GTGGGCGCAG AGTTCAATCG TATCGCCGGG
GAAAACTGCC TCTACTTCGA AACCGGACAA GGCTCTGCGC TATCCGCTGG CGCTAACTTC
GGCGCTGACC AGGTGACGAT GGAAGCACGT AACTATGGGC TGGCGCGTCA TTACGATCCG
TTTATCGTCA ACACCGTGGT CGGCTTTATT GGGCCGGAGT ATCTCTACAA CGACCGCCAG
ATTATCCGTG CTGGCTTAGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCTATGGGC
TGTGACTGCT GCTACACCAA CCACGCTGAC GCTGACCAGA ACCTCAACGA AAACCTGATG
ATCCTGCTCG CCACCGCAGG CTGCAACTAC ATCATGGGGA TGCCGCTGGG TGATGACATC
ATGCTCAACT ACCAGACCAC CGCATTCCAC GATACCGCCA CTGTGCGTCA GTTACTCAAC
CTGCGTCCGT CACCGGAGTT TGAACGCTGG CTGGAAAGCA TGGGCATTAT GGCAAACGGT
CGCCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
 
Protein sequence
MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR 
NNPVIAYEDD CVTRLIQDDV NETAYNQIKN WSISELREYV LSDETSVDDI AFTRKGLTSE
VVAAVAKICS NADLIYGAKK MPVIKKANTT IGIPGTFSAR LQPNDTRDDV QSIAAQIYEG
LSFGVGDAVI GVNPVTDDVE NLSRVLDTIY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP
GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF
GADQVTMEAR NYGLARHYDP FIVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG
CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN
LRPSPEFERW LESMGIMANG RLTKRAGDPS LFF