Gene EcSMS35_4608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4608 
SymbolaspA 
ID6145593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4711858 
End bp4713294 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content49% 
IMG OID641619424 
Productaspartate ammonia-lyase 
Protein accessionYP_001746535 
Protein GI170680757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.21143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.755736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA ACATTCGTAT CGAAGAAGAT CTGTTGGGTA CCAGGGAAGT TCCAGCTGAT 
GCCTACTATG GTGTTCACAC TCTGAGAGCG ATTGAAAACT TCTATATCAG CAACAACAAA
ATCAGTGATA TTCCTGAATT TGTTCGCGGT ATGGTAATGG TTAAAAAAGC CGCAGCTATG
GCAAACAAAG AGCTGCAAAC CATTCCTAAA AGTGTAGCGA ATGCCATCAT TGCCGCATGT
GATGAAGTCC TGAACAACGG AAAATGCATG GATCAGTTCC CGGTAGACGT CTACCAGGGC
GGCGCAGGTA CTTCCGTAAA CATGAACACC AACGAAGTGC TGGCCAATAT CGGTCTGGAA
CTGATGGGTC ACCAGAAAGG TGAATATCAG TACCTGAACC CGAACGACCA TGTTAACAAA
TGTCAGTCCA CTAACGACGC CTACCCGACC GGTTTCCGTA TCGCAGTTTA CTCTTCCCTG
ATTAAGCTGG TAGATGCGAT TAACCAACTG CGTGAAGGCT TTGAACGTAA AGCTGTCGAA
TTCCAGGACA TCCTGAAAAT GGGTCGTACA CAGCTGCAGG ACGCAGTACC GATGACCCTC
GGCCAGGAAT TCCGCGCTTT CAGCATCCTG CTGAAAGAAG AAGTTAAAAA CATCCAACGT
ACCGCTGAAC TGCTGCTGGA AGTTAACCTT GGCGCAACAG CAATCGGTAC TGGTCTGAAC
ACGCCGAAAG AGTACTCTCC GCTGGCAGTG AAAAAACTGG CTGAAGTCAC TGGCTTCCCA
TGCGTACCGG CTGAAGACCT GATCGAAGCG ACCTCTGACT GCGGCGCTTA TGTTATGGTT
CACGGCGCGC TGAAACGCCT GGCTGTGAAG ATGTCCAAAA TCTGTAACGA CCTGCGCTTG
CTCTCTTCTG GCCCACGTGC CGGCCTGAAC GAGATCAACC TGCCGGAACT GCAGGCGGGC
TCTTCCATCA TGCCAGCTAA AGTAAACCCG GTTGTTCCGG AAGTGGTTAA CCAGGTATGC
TTCAAAGTCA TCGGTAACGA CACCACTGTT ACCATGGCAG CAGAAGCAGG TCAGCTGCAG
TTGAACGTTA TGGAGCCGGT CATTGGCCAG GCCATGTTTG AATCCGTTCA CATTCTGACC
AACGCTTGCT ACAACCTGCT GGAAAAATGC ATTAACGGCA TCACTGCTAA CAAAGAAGTG
TGCGAAGGTT ACGTTTACAA CTCTATCGGT ATCGTTACTT ACCTGAACCC GTTCATCGGT
CACCACAACG GTGACATCGT GGGTAAAATC TGTGCCGAAA CCGGTAAGAG TGTACGTGAA
GTCGTTCTGG AACGCGGTCT GTTGACTGAA GCGGAACTTG ACGATATTTT CTCCGTACAG
AATCTGATGC ACCCGGCTTA CAAAGCAAAA CGCTATACTG ATGAAAGCGA ACAGTAA
 
Protein sequence
MSNNIRIEED LLGTREVPAD AYYGVHTLRA IENFYISNNK ISDIPEFVRG MVMVKKAAAM 
ANKELQTIPK SVANAIIAAC DEVLNNGKCM DQFPVDVYQG GAGTSVNMNT NEVLANIGLE
LMGHQKGEYQ YLNPNDHVNK CQSTNDAYPT GFRIAVYSSL IKLVDAINQL REGFERKAVE
FQDILKMGRT QLQDAVPMTL GQEFRAFSIL LKEEVKNIQR TAELLLEVNL GATAIGTGLN
TPKEYSPLAV KKLAEVTGFP CVPAEDLIEA TSDCGAYVMV HGALKRLAVK MSKICNDLRL
LSSGPRAGLN EINLPELQAG SSIMPAKVNP VVPEVVNQVC FKVIGNDTTV TMAAEAGQLQ
LNVMEPVIGQ AMFESVHILT NACYNLLEKC INGITANKEV CEGYVYNSIG IVTYLNPFIG
HHNGDIVGKI CAETGKSVRE VVLERGLLTE AELDDIFSVQ NLMHPAYKAK RYTDESEQ