Gene ECH74115_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3597 
SymboldsdA 
ID6967579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3311455 
End bp3312783 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content51% 
IMG OID643387394 
ProductD-serine dehydratase 
Protein accessionYP_002271853 
Protein GI209396399 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3048] D-serine dehydratase 
TIGRFAM ID[TIGR02035] D-serine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00277726 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAACG CTAAAATGAA CTCGCTCATC GCCCAGTATC CGTTGGTAAA GGATCTGGTT 
GCTCTTAAAG AAACCACCTG GTTTAATCCT GGCACGACCT CATTGGCTGA AGGTTTACCT
TATGTTGGCC TGACCGAACA GGATGTTCAG GACGCCCATG CGCGCTTATC CCGTTTTGCA
CCCTATCTGG CAAAAGCATT TCCTGAAACT GCTGCCACTG GGGGGATTAT TGAATCAGAA
CTGGTTGCCA TTCCGGCTAT GCAAAAACGG CTGGAAAAGG AATATCAACA ACCGATCAGC
GGGCAACTGT TACTGAAAAA AGATAGCCAT TTGCCCATTT CCGGCTCCAT AAAAGCACGC
GGCGGGATTT ATGAAGTCCT GGCACATGCA GAAAAACTGG CTCTGGAAGC GGGGTTGCTG
ACGCTTGAAG ATGACTACAG CAAACTGCTT TCTCCGGAGT TTAAACAGTT CTTTAGCCAG
TACAGCATTG CTGTGGGCTC AACCGGAAAT CTGGGGTTAT CAATCGGCAT TATGAGCGCC
CGCATTGGCT TTAAGGTGAC AGTGCATATG TCTGCTGATG CCCGGGCATG GAAAAAAGCG
AAACTGCGCA GCCATGGCGT TACGGTCGTG GAATATGAGC AAGATTATGG TGTTGCCGTC
GAGGAAGGAC GTAAAGCAGC GCAGTCTGAC CCGAACTGTT TCTTTATTGA TGACGAGAAT
TCCCGCACGT TGTTCCTTGG GTATTCCGTC GCAGGCCAGC GTCTTAAAGC GCAATTTGCC
GAGCAAGGTC GTATTGTCGA TGCTGATAAC CCTCTGTTTG TCTATCTGCC GTGTGGTGTT
GGCGGTGGTC CTGGTGGCGT CGCATTCGGA CTTAAGCTGG CGTTTGGCGA TCATGTTCAC
TGCTTTTTTG CCGAACCAAC GCACTCCCCT TGTATGTTGT TAGGCGTCCA TACAGGATTA
CACGATCAGA TTTCTGTTCA GGATATTGGT ATCGACAACC TTACCGCAGC GGATGGCCTT
GCAGTTGGTC GCGCATCAGG CTTTGTCGGG CGGGCAATGG AGCGTCTGCT GGATGGCTTC
TATACCCTTA GCGATCAAAC CATGTATGAC ATGCTTGGCT GGCTGGCGCA GGAAGAAGGT
ATTCGTCTTG AACCTTCGGC ACTGGCGGGT ATGGCCGGAC CTCAGCGCGT GTGTGCATCA
GTAAGTTACC AACAGATGCA CGGTTTCAGC GCAGAACAAC TGCGTAATGC CACTCATCTG
GTGTGGGCGA CGGGAGGTGG AATGGTGCCG GAAGAAGAGA TGGAGCAATA TCTGGCAAAA
GGCCGTTAA
 
Protein sequence
MENAKMNSLI AQYPLVKDLV ALKETTWFNP GTTSLAEGLP YVGLTEQDVQ DAHARLSRFA 
PYLAKAFPET AATGGIIESE LVAIPAMQKR LEKEYQQPIS GQLLLKKDSH LPISGSIKAR
GGIYEVLAHA EKLALEAGLL TLEDDYSKLL SPEFKQFFSQ YSIAVGSTGN LGLSIGIMSA
RIGFKVTVHM SADARAWKKA KLRSHGVTVV EYEQDYGVAV EEGRKAAQSD PNCFFIDDEN
SRTLFLGYSV AGQRLKAQFA EQGRIVDADN PLFVYLPCGV GGGPGGVAFG LKLAFGDHVH
CFFAEPTHSP CMLLGVHTGL HDQISVQDIG IDNLTAADGL AVGRASGFVG RAMERLLDGF
YTLSDQTMYD MLGWLAQEEG IRLEPSALAG MAGPQRVCAS VSYQQMHGFS AEQLRNATHL
VWATGGGMVP EEEMEQYLAK GR