Gene ECH74115_2543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2543 
SymbolsdaA 
ID6968324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2405055 
End bp2406419 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content54% 
IMG OID643386411 
ProductL-serine ammonia-lyase 1 
Protein accessionYP_002270893 
Protein GI209400418 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.263173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAGTC TATTCGACAT GTTTAAGGTG GGGATTGGTC CCTCATCTTC CCATACCGTA 
GGGCCTATGA AGGCGGGTAA ACAGTTCGTC GATGATCTGG TCGAAAAAGG CTTACTGGAT
AGCGTTACTC GCGTTGCCGT GGACGTTTAT GGTTCACTGT CGCTGACGGG TAAAGGCCAC
CACACCGATA TCGCCATTAT TATGGGTCTT GCAGGTAACG AACCTGCCAC CGTGGATATC
GACAGTATTC CCGGTTTTAT TCGCGACGTA GAAGAGCGCG AACGTCTGCT GCTGGCACAG
GGACGGCATG AAGTGGATTT CCCGCGCGAC AACGGGATGC GTTTTCATAA CGGCAACCTG
CCGCTGCATG AAAACGGTAT GCAAATCCAC GCCTATAACG GCGATGAAGT CGTCTACAGC
AAAACTTATT ATTCCATCGG CGGCGGTTTT ATCGTCGATG AAGAACACTT TGGTCAGGAT
ACTGCCAACG AAGTGAGCGT GCCGTATCCG TTCAAATCTG CCACCGAACT GCTCGCGTAC
TGTAATGAAA CCGGCTATTC GCTGTCTGGT CTCGCTATGC AGAACGAACT GGCGCTGCAC
AGCAAGAAAG AGATCGACGA GTATTTCGCG CATGTCTGGC AAACCATGCA GGCATGTATC
GATCGCGGGA TGAACACCGA AGGTGTACTG CCAGGCCCGC TGCGCGTGCC ACGTCGTGCG
TCTGCCCTGC GCCGGATGCT GGTTTCCAGC GATAAACTGT CTAACGATCC GATGAATGTC
ATTGACTGGG TAAACATGTT TGCGCTGGCA GTTAACGAAG AAAACGCCGC CGGTGGTCGT
GTGGTAACTG CGCCAACCAA CGGTGCCTGC GGTATCGTTC CGGCAGTGCT GGCTTACTAT
GACCACTTTA TTGAATCGGT CAGCCCGGAC ATCTATACCC GTTACTTTAT GGCAGCGGGC
GCGATTGGTG CATTGTATAA AATGAACGCC TCTATTTCCG GTGCGGAAGT TGGTTGCCAG
GGCGAAGTGG GTGTTGCCTG TTCAATGGCT GCTGCGGGTC TTGCAGAACT GCTGGGCGGT
AGCCCGGAAC AGGTTTGCGT GGCGGCGGAA ATTGGCATGG AACACAACCT TGGTTTAACC
TGCGACCCGG TTGCAGGGCA GGTTCAGGTG CCGTGCATTG AGCGTAATGC CATTGCCTCT
GTGAAGGCGA TTAACGCCGC ACGGATGGCT CTGCGCCGCA CCAGTGCACC GCGCGTCTCG
CTGGATAAGG TCATCGAAAC GATGTACGAA ACCGGTAAGG ACATGAACGC CAAATACCGC
GAAACCTCAC GCGGTGGTCT GGCAATCAAA GTCCAGTGTG ACTAA
 
Protein sequence
MISLFDMFKV GIGPSSSHTV GPMKAGKQFV DDLVEKGLLD SVTRVAVDVY GSLSLTGKGH 
HTDIAIIMGL AGNEPATVDI DSIPGFIRDV EERERLLLAQ GRHEVDFPRD NGMRFHNGNL
PLHENGMQIH AYNGDEVVYS KTYYSIGGGF IVDEEHFGQD TANEVSVPYP FKSATELLAY
CNETGYSLSG LAMQNELALH SKKEIDEYFA HVWQTMQACI DRGMNTEGVL PGPLRVPRRA
SALRRMLVSS DKLSNDPMNV IDWVNMFALA VNEENAAGGR VVTAPTNGAC GIVPAVLAYY
DHFIESVSPD IYTRYFMAAG AIGALYKMNA SISGAEVGCQ GEVGVACSMA AAGLAELLGG
SPEQVCVAAE IGMEHNLGLT CDPVAGQVQV PCIERNAIAS VKAINAARMA LRRTSAPRVS
LDKVIETMYE TGKDMNAKYR ETSRGGLAIK VQCD