Gene ECH74115_5655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5655 
SymbolaspA 
ID6968656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5296542 
End bp5297978 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content49% 
IMG OID643389289 
Productaspartate ammonia-lyase 
Protein accessionYP_002273685 
Protein GI209399309 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0582808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.320226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA ACATTCGTAT CGAAGAAGAT CTGTTGGGTA CCAGGGAAGT TCCAGCTGAT 
GCCTACTATG GTGTTCACAC TCTGAGAGCG ATTGAAAACT TCTATATCAG CAACAACAAA
ATCAGTGATA TTCCTGAATT TGTTCGCGGT ATGGTAATGG TTAAAAAAGC CGCAGCTATG
GCAAACAAAG AGCTGCAAAC CATTCCTAAA AGTGTAGCGA ATGCCATCAT TGCCGCATGT
GATGAAGTCC TGAACAACGG AAAATGCATG GATCAGTTCC CGGTAGACGT CTACCAGGGC
GGCGCAGGTA CTTCCGTAAA CATGAACACC AACGAAGTGC TGGCCAATAT CGGTCTGGAA
CTGATGGGTC ACCAGAAAGG TGAATATCAG TACCTGAACC CGAACGACCA TGTTAACAAA
TGTCAGTCCA CTAACGACGC CTACCCGACC GGTTTCCGTA TCGCAGTTTA CTCTTCCCTG
ATTAAGCTGG TAGATGCGAT TAACCAATTG CGTGAAGGCT TTGAACGTAA AGCGGTCGAA
TTCCAGGACA TCCTGAAAAT GGGTCGTACC CAGCTGCAGG ACGCAGTACC GATGACCCTC
GGTCAGGAAT TCCGCGCTTT CAGCATCCTG CTGAAAGAAG AAGTGAAAAA CATCCAACGT
ACCGCTGAAC TGCTGCTGGA AGTTAACCTT GGCGCAACAG CAATCGGTAC TGGTCTGAAC
ACGCCGAAAG AGTACTCTCC GCTGGCAGTG AAAAAACTGG CTGAAGTTAC TGGCTTCCCA
TGCGTACCGG CTGAAGACCT GATCGAAGCG ACCTCTGACT GCGGCGCTTA TGTTATGGTT
CACGGAGCGC TGAAACGCCT GGCTGTGAAG ATGTCCAAAA TCTGTAACGA CCTGCGCTTG
CTCTCTTCTG GTCCACGTGC CGGCCTGAAC GAGATCAACC TGCCGGAACT GCAGGCGGGC
TCTTCCATCA TGCCAGCTAA AGTAAACCCG GTTGTTCCGG AAGTGGTTAA CCAGGTATGC
TTCAAAGTCA TCGGTAACGA CACCACTGTT ACCATGGCAG CAGAAGCAGG TCAGCTGCAG
TTGAACGTTA TGGAGCCGGT CATTGGCCAG GCTATGTTCG AATCCGTTCA CATTCTGACC
AACGCTTGCT ACAACCTGCT GGAAAAATGC ATTAACGGCA TCACTGCTAA CAAAGAAGTG
TGCGAAGGTT ACGTTTACAA CTCTATCGGT ATCGTTACTT ACCTGAACCC GTTCATCGGT
CACCACAACG GTGACATCGT GGGTAAAATC TGTGCCGAAA CCGGTAAGAG TGTACGTGAA
GTCGTTCTGG AACGCGGTCT GTTGACTGAA GCGGAACTTG ACGATATTTT CTCCGTACAG
AATCTGATGC ACCCGGCTTA CAAAGCAAAA CGCTATACTG ATGAAAGCGA ACAGTAA
 
Protein sequence
MSNNIRIEED LLGTREVPAD AYYGVHTLRA IENFYISNNK ISDIPEFVRG MVMVKKAAAM 
ANKELQTIPK SVANAIIAAC DEVLNNGKCM DQFPVDVYQG GAGTSVNMNT NEVLANIGLE
LMGHQKGEYQ YLNPNDHVNK CQSTNDAYPT GFRIAVYSSL IKLVDAINQL REGFERKAVE
FQDILKMGRT QLQDAVPMTL GQEFRAFSIL LKEEVKNIQR TAELLLEVNL GATAIGTGLN
TPKEYSPLAV KKLAEVTGFP CVPAEDLIEA TSDCGAYVMV HGALKRLAVK MSKICNDLRL
LSSGPRAGLN EINLPELQAG SSIMPAKVNP VVPEVVNQVC FKVIGNDTTV TMAAEAGQLQ
LNVMEPVIGQ AMFESVHILT NACYNLLEKC INGITANKEV CEGYVYNSIG IVTYLNPFIG
HHNGDIVGKI CAETGKSVRE VVLERGLLTE AELDDIFSVQ NLMHPAYKAK RYTDESEQ