Gene EcolC_3873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3873 
SymbolaspA 
ID6065160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4228747 
End bp4230183 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content49% 
IMG OID641603288 
Productaspartate ammonia-lyase 
Protein accessionYP_001726804 
Protein GI170021850 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA ACATTCGTAT CGAAGAAGAT CTGTTGGGTA CCAGGGAAGT TCCAGCTGAT 
GCCTACTATG GTGTTCACAC TCTGAGAGCG ATTGAAAACT TCTATATCAG CAACAACAAA
ATCAGTGATA TTCCTGAATT TGTTCGCGGT ATGGTAATGG TTAAAAAAGC CGCAGCTATG
GCAAACAAAG AGCTGCAAAC CATTCCTAAA AGTGTAGCGA ATGCCATCAT TGCCGCATGT
GATGAAGTCC TGAACAACGG AAAATGCATG GATCAGTTCC CGGTAGACGT CTACCAGGGC
GGCGCAGGTA CTTCCGTAAA CATGAACACC AACGAAGTGC TGGCCAATAT CGGTCTGGAA
CTGATGGGTC ACCAAAAAGG TGAATATCAG TACCTGAACC CGAACGACCA TGTTAACAAA
TGTCAGTCCA CTAACGACGC CTACCCGACC GGTTTCCGTA TCGCAGTTTA CTCTTCCCTG
ATTAAGCTGG TAGATGCGAT TAACCAACTG CGTGAAGGCT TTGAACGTAA AGCTGTCGAA
TTCCAGGACA TCCTGAAAAT GGGTCGTACC CAGCTGCAGG ACGCAGTACC GATGACCCTC
GGTCAGGAAT TCCGCGCTTT CAGCATCCTG CTGAAAGAAG AAGTGAAAAA CATCCAACGT
ACCGCTGAAC TGCTGCTGGA AGTTAACCTT GGTGCAACAG CAATCGGTAC TGGTCTGAAC
ACGCCGAAAG AGTACTCTCC GCTGGCAGTG AAAAAACTGG CTGAAGTTAC TGGCTTCCCA
TGCGTACCGG CTGAAGACCT GATCGAAGCG ACCTCTGACT GCGGCGCTTA TGTTATGGTT
CACGGCGCGC TGAAACGCCT GGCTGTGAAG ATGTCCAAAA TCTGTAACGA CCTGCGCTTG
CTCTCTTCAG GCCCACGTGC CGGCCTGAAC GAGATCAACC TGCCGGAACT GCAGGCGGGC
TCTTCCATCA TGCCAGCTAA AGTAAACCCG GTTGTTCCGG AAGTGGTTAA CCAGGTATGC
TTCAAAGTCA TCGGTAACGA CACCACTGTT ACCATGGCAG CAGAAGCAGG TCAGCTGCAG
TTGAACGTTA TGGAGCCGGT CATTGGCCAG GCCATGTTCG AATCCGTTCA CATTCTGACC
AACGCTTGCT ACAACCTGCT GGAAAAATGC ATTAACGGCA TCACTGCTAA CAAAGAAGTG
TGCGAAGGTT ACGTTTACAA CTCTATCGGT ATCGTTACTT ACCTGAACCC GTTCATCGGT
CACCACAACG GTGACATCGT GGGTAAAATC TGTGCCGAAA CCGGTAAGAG TGTACGTGAA
GTCGTTCTGG AACGCGGTCT GTTGACTGAA GCGGAACTTG ACGATATTTT CTCCGTACAG
AATCTGATGC ACCCGGCTTA CAAAGCAAAA CGCTATACTG ATGAAAGCGA ACAGTAA
 
Protein sequence
MSNNIRIEED LLGTREVPAD AYYGVHTLRA IENFYISNNK ISDIPEFVRG MVMVKKAAAM 
ANKELQTIPK SVANAIIAAC DEVLNNGKCM DQFPVDVYQG GAGTSVNMNT NEVLANIGLE
LMGHQKGEYQ YLNPNDHVNK CQSTNDAYPT GFRIAVYSSL IKLVDAINQL REGFERKAVE
FQDILKMGRT QLQDAVPMTL GQEFRAFSIL LKEEVKNIQR TAELLLEVNL GATAIGTGLN
TPKEYSPLAV KKLAEVTGFP CVPAEDLIEA TSDCGAYVMV HGALKRLAVK MSKICNDLRL
LSSGPRAGLN EINLPELQAG SSIMPAKVNP VVPEVVNQVC FKVIGNDTTV TMAAEAGQLQ
LNVMEPVIGQ AMFESVHILT NACYNLLEKC INGITANKEV CEGYVYNSIG IVTYLNPFIG
HHNGDIVGKI CAETGKSVRE VVLERGLLTE AELDDIFSVQ NLMHPAYKAK RYTDESEQ