Gene EcDH1_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3853 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4146254 
End bp4147690 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content49% 
IMG OID 
Productaspartate ammonia-lyase 
Protein accessionACX41455 
Protein GI260451033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA ACATTCGTAT CGAAGAAGAT CTGTTGGGTA CCAGGGAAGT TCCAGCTGAT 
GCCTACTATG GTGTTCACAC TCTGAGAGCG ATTGAAAACT TCTATATCAG CAACAACAAA
ATCAGTGATA TTCCTGAATT TGTTCGCGGT ATGGTAATGG TTAAAAAAGC CGCAGCTATG
GCAAACAAAG AGCTGCAAAC CATTCCTAAA AGTGTAGCGA ATGCCATCAT TGCCGCATGT
GATGAAGTCC TGAACAACGG AAAATGCATG GATCAGTTCC CGGTAGACGT CTACCAGGGC
GGCGCAGGTA CTTCCGTAAA CATGAACACC AACGAAGTGC TGGCCAATAT CGGTCTGGAA
CTGATGGGTC ACCAAAAAGG TGAATATCAG TACCTGAACC CGAACGACCA TGTTAACAAA
TGTCAGTCCA CTAACGACGC CTACCCGACC GGTTTCCGTA TCGCAGTTTA CTCTTCCCTG
ATTAAGCTGG TAGATGCGAT TAACCAACTG CGTGAAGGCT TTGAACGTAA AGCTGTCGAA
TTCCAGGACA TCCTGAAAAT GGGTCGTACC CAGCTGCAGG ACGCAGTACC GATGACCCTC
GGTCAGGAAT TCCGCGCTTT CAGCATCCTG CTGAAAGAAG AAGTGAAAAA CATCCAACGT
ACCGCTGAAC TGCTGCTGGA AGTTAACCTT GGTGCAACAG CAATCGGTAC TGGTCTGAAC
ACGCCGAAAG AGTACTCTCC GCTGGCAGTG AAAAAACTGG CTGAAGTTAC TGGCTTCCCA
TGCGTACCGG CTGAAGACCT GATCGAAGCG ACCTCTGACT GCGGCGCTTA TGTTATGGTT
CACGGCGCGC TGAAACGCCT GGCTGTGAAG ATGTCCAAAA TCTGTAACGA CCTGCGCTTG
CTCTCTTCAG GCCCACGTGC CGGCCTGAAC GAGATCAACC TGCCGGAACT GCAGGCGGGC
TCTTCCATCA TGCCAGCTAA AGTAAACCCG GTTGTTCCGG AAGTGGTTAA CCAGGTATGC
TTCAAAGTCA TCGGTAACGA CACCACTGTT ACCATGGCAG CAGAAGCAGG TCAGCTGCAG
TTGAACGTTA TGGAGCCGGT CATTGGCCAG GCCATGTTCG AATCCGTTCA CATTCTGACC
AACGCTTGCT ACAACCTGCT GGAAAAATGC ATTAACGGCA TCACTGCTAA CAAAGAAGTG
TGCGAAGGTT ACGTTTACAA CTCTATCGGT ATCGTTACTT ACCTGAACCC GTTCATCGGT
CACCACAACG GTGACATCGT GGGTAAAATC TGTGCCGAAA CCGGTAAGAG TGTACGTGAA
GTCGTTCTGG AACGCGGTCT GTTGACTGAA GCGGAACTTG ACGATATTTT CTCCGTACAG
AATCTGATGC ACCCGGCTTA CAAAGCAAAA CGCTATACTG ATGAAAGCGA ACAGTAA
 
Protein sequence
MSNNIRIEED LLGTREVPAD AYYGVHTLRA IENFYISNNK ISDIPEFVRG MVMVKKAAAM 
ANKELQTIPK SVANAIIAAC DEVLNNGKCM DQFPVDVYQG GAGTSVNMNT NEVLANIGLE
LMGHQKGEYQ YLNPNDHVNK CQSTNDAYPT GFRIAVYSSL IKLVDAINQL REGFERKAVE
FQDILKMGRT QLQDAVPMTL GQEFRAFSIL LKEEVKNIQR TAELLLEVNL GATAIGTGLN
TPKEYSPLAV KKLAEVTGFP CVPAEDLIEA TSDCGAYVMV HGALKRLAVK MSKICNDLRL
LSSGPRAGLN EINLPELQAG SSIMPAKVNP VVPEVVNQVC FKVIGNDTTV TMAAEAGQLQ
LNVMEPVIGQ AMFESVHILT NACYNLLEKC INGITANKEV CEGYVYNSIG IVTYLNPFIG
HHNGDIVGKI CAETGKSVRE VVLERGLLTE AELDDIFSVQ NLMHPAYKAK RYTDESEQ