Gene SeD_A2825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2825 
SymboleutB 
ID6871044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2693495 
End bp2694856 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content57% 
IMG OID642785878 
Productethanolamine ammonia-lyase large subunit 
Protein accessionYP_002216528 
Protein GI198245031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA AGACCACATT GTTCGGCAAT GTTTATCAGT TTAAGGATGT AAAAGAGGTA 
CTGGCTAAAG CCAACGAACT GCGTTCGGGG GACGTGCTGG CCGGGGTTGC CGCGGCAAGT
TCGCAGGAGC GCGTAGCGGC AAAACAGGTA CTGTCGGAAA TGACGGTGGC GGATATCCGC
AACAATCCGG TGATTGCCTA TGAAGAGGAC TGCGTGACGC GCCTGATTCA GGACGACGTC
AACGAAACGG CCTATAACCG GATTAAAAAC TGGAGCATCA GCGAACTGCG CGAATACGTA
CTGAGCGATG AAACCTCCGT GGACGACATC GCGTTTATCC GCAAAGGGCT GACCTCCGAA
GTGGTGGCGG CAGTAGCGAA AATCTGCTCC AACGCTGACC TGATCTACGG CGGCAAGAAA
ATGCCGGTGA TCAAAAAAGC CAATACCACT ATCGGTATTC CGGGCACCTT TAGCTGCCGT
TTGCAGCCGA ACGATACCCG TGACGATGTA CAGAGTATCG CCGCGCAAAT CTACGAAGGG
CTTTCTTTCG GCGCAGGCGA TGCGGTGATC GGCGTTAACC CGGTGACCGA TGACGTGGAG
AACCTGACCC GCGTGCTCGA CACCGTTTAC GGCGTTATCG ATAAATTCAA TATTCCGACC
CAGGGCTGCG TGCTGGCGCA CGTCACCACC CAGATCGAAG CGATTCGTCG CGGCGCGCCG
GGCGGACTGA TTTTCCAGAG CATTTGCGGC AGCGAGAAGG GCTTAAAAGA GTTCGGCGTC
GAGCTGGCCA TGCTCGACGA AGCGCGGGCT GTGGGGGCGG AGTTCAACCG CATCGCCGGG
GAAAACTGCC TGTACTTTGA AACCGGGCAA GGGTCTGCGC TCTCCGCAGG CGCGAACTTT
GGTGCCGACC AGGTGACGAT GGAAGCGCGT AACTACGGGC TGGCGCGCCA CTACGATCCG
TTCCTGGTGA ACACCGTGGT GGGCTTTATC GGGCCGGAGT ATCTCTACAA CGACAGGCAG
ATTATCCGCG CCGGTCTCGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCGATGGGC
TGCGACTGCT GCTATACCAA CCATGCCGAC GCCGACCAGA ACCTTAACGA AAACCTGATG
ATTCTGCTCG CCACTGCCGG CTGTAACTAC ATCATGGGGA TGCCGCTCGG CGACGACATC
ATGCTCAACT ACCAGACCAC CGCTTTCCAC GATACCGCCA CCGTCCGTCA GTTGCTGAAT
TTACGGCCAT CGCCGGAGTT TGAACGCTGG CTGGAAACGA TGGGCATTAT GGCAAACGGT
CGTCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
 
Protein sequence
MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR 
NNPVIAYEED CVTRLIQDDV NETAYNRIKN WSISELREYV LSDETSVDDI AFIRKGLTSE
VVAAVAKICS NADLIYGGKK MPVIKKANTT IGIPGTFSCR LQPNDTRDDV QSIAAQIYEG
LSFGAGDAVI GVNPVTDDVE NLTRVLDTVY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP
GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF
GADQVTMEAR NYGLARHYDP FLVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG
CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN
LRPSPEFERW LETMGIMANG RLTKRAGDPS LFF