Gene SNSL254_A2651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2651 
SymboleutB 
ID6486807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2565303 
End bp2566664 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content57% 
IMG OID642737984 
Productethanolamine ammonia-lyase large subunit 
Protein accessionYP_002041718 
Protein GI194443898 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA AGACCACATT GTTCGGCAAT GTTTATCAGT TTAAGGATGT AAAAGAGGTA 
CTGGCTAAAG CCAACGAACT GCGTTCGGGG GATGTGCTGG CCGGGGTTGC CGCGGCAAGT
TCGCAGGAGC GCGTAGCGGC AAAACAGGTA CTGTCGGAAA TGACGGTGGC GGATATCCGC
AACAATCCGG TGATTGCCTA TGAAGAGGAC TGCGTGACGC GCCTGATTCA GGATGACGTC
AACGAAACGG CCTATAACCG GATTAAAAAC TGGAGCATCA GCGAACTGCG CGAATACGTA
CTGAGCGATG AAACCTCCGT GGACGACATC GCGTTTACCC GCAAAGGGCT GACCTCCGAA
GTGGTGGCGG CAGTAGCGAA AATCTGCTCC AACGCTGACC TGATCTACGG CGGCAAGAAA
ATGCCGGTGA TCAAAAAAGC CAATACCACC ATCGGTATTC CGGGCACCTT TAGCTGCCGT
TTGCAGCCGA ACGATACCCG TGACGATGTA CAGAGTATCG CCGCGCAAAT CTACGAAGGG
CTTTCTTTCG GCGCAGGCGA TGCGGTGATC GGCGTTAACC CGGTGACCGA TGACGTGGAG
AACCTGACCC GCGTGCTCGA CACCGTTTAC GGCGTTATCG ATAAATTCAA TATTCCGACC
CAGGGCTGCG TGCTGGCGCA TGTCACCACC CAGATTGAAG CGATTCGCCG CGGCGCGCCG
GGCGGACTGA TTTTCCAGAG CATTTGCGGC AGCGAGAAGG GCTTAAAAGA GTTCGGCGTC
GAGCTGGCTA TGCTCGATGA AGCGCGGGCC GTGGGGGCGG AGTTCAACCG CATCGCCGGG
GAAAACTGCC TGTACTTTGA AACCGGACAG GGCTCGGCGC TGTCGGCTGG CGCGAACTTT
GGTGCCGACC AGGTGACAAT GGAAGCGCGT AACTACGGGC TGGCGCGCCA CTACGATCCG
TTCCTGGTGA ACACCGTGGT GGGCTTTATC GGGCCGGAAT ATCTCTACAA CGACAGGCAG
ATTATCCGCG CCGGTCTCGA AGATCACTTT ATGGGCAAGC TGAGCGGCAT CTCGATGGGC
TGCGACTGCT GCTATACCAA CCATGCCGAC GCCGACCAGA ACCTTAACGA AAACCTGATG
ATTCTGCTCG CCACTGCCGG CTGTAACTAC ATCATGGGGA TGCCGCTCGG CGACGACATC
ATGCTCAACT ACCAGACCAC CGCTTTCCAC GATACCGCCA CCGTCCGTCA GTTGCTGAAT
TTACGGCCGT CGCCGGAGTT TGAACGCTGG CTGGAAACGA TGGGCATTAT GGCAAACGGT
CGTCTGACCA AACGGGCGGG CGATCCGTCA CTGTTCTTCT GA
 
Protein sequence
MKLKTTLFGN VYQFKDVKEV LAKANELRSG DVLAGVAAAS SQERVAAKQV LSEMTVADIR 
NNPVIAYEED CVTRLIQDDV NETAYNRIKN WSISELREYV LSDETSVDDI AFTRKGLTSE
VVAAVAKICS NADLIYGGKK MPVIKKANTT IGIPGTFSCR LQPNDTRDDV QSIAAQIYEG
LSFGAGDAVI GVNPVTDDVE NLTRVLDTVY GVIDKFNIPT QGCVLAHVTT QIEAIRRGAP
GGLIFQSICG SEKGLKEFGV ELAMLDEARA VGAEFNRIAG ENCLYFETGQ GSALSAGANF
GADQVTMEAR NYGLARHYDP FLVNTVVGFI GPEYLYNDRQ IIRAGLEDHF MGKLSGISMG
CDCCYTNHAD ADQNLNENLM ILLATAGCNY IMGMPLGDDI MLNYQTTAFH DTATVRQLLN
LRPSPEFERW LETMGIMANG RLTKRAGDPS LFF