Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3597 |
Symbol | dsdA |
ID | 6967579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3311455 |
End bp | 3312783 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387394 |
Product | D-serine dehydratase |
Protein accession | YP_002271853 |
Protein GI | 209396399 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3048] D-serine dehydratase |
TIGRFAM ID | [TIGR02035] D-serine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00277726 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAACG CTAAAATGAA CTCGCTCATC GCCCAGTATC CGTTGGTAAA GGATCTGGTT GCTCTTAAAG AAACCACCTG GTTTAATCCT GGCACGACCT CATTGGCTGA AGGTTTACCT TATGTTGGCC TGACCGAACA GGATGTTCAG GACGCCCATG CGCGCTTATC CCGTTTTGCA CCCTATCTGG CAAAAGCATT TCCTGAAACT GCTGCCACTG GGGGGATTAT TGAATCAGAA CTGGTTGCCA TTCCGGCTAT GCAAAAACGG CTGGAAAAGG AATATCAACA ACCGATCAGC GGGCAACTGT TACTGAAAAA AGATAGCCAT TTGCCCATTT CCGGCTCCAT AAAAGCACGC GGCGGGATTT ATGAAGTCCT GGCACATGCA GAAAAACTGG CTCTGGAAGC GGGGTTGCTG ACGCTTGAAG ATGACTACAG CAAACTGCTT TCTCCGGAGT TTAAACAGTT CTTTAGCCAG TACAGCATTG CTGTGGGCTC AACCGGAAAT CTGGGGTTAT CAATCGGCAT TATGAGCGCC CGCATTGGCT TTAAGGTGAC AGTGCATATG TCTGCTGATG CCCGGGCATG GAAAAAAGCG AAACTGCGCA GCCATGGCGT TACGGTCGTG GAATATGAGC AAGATTATGG TGTTGCCGTC GAGGAAGGAC GTAAAGCAGC GCAGTCTGAC CCGAACTGTT TCTTTATTGA TGACGAGAAT TCCCGCACGT TGTTCCTTGG GTATTCCGTC GCAGGCCAGC GTCTTAAAGC GCAATTTGCC GAGCAAGGTC GTATTGTCGA TGCTGATAAC CCTCTGTTTG TCTATCTGCC GTGTGGTGTT GGCGGTGGTC CTGGTGGCGT CGCATTCGGA CTTAAGCTGG CGTTTGGCGA TCATGTTCAC TGCTTTTTTG CCGAACCAAC GCACTCCCCT TGTATGTTGT TAGGCGTCCA TACAGGATTA CACGATCAGA TTTCTGTTCA GGATATTGGT ATCGACAACC TTACCGCAGC GGATGGCCTT GCAGTTGGTC GCGCATCAGG CTTTGTCGGG CGGGCAATGG AGCGTCTGCT GGATGGCTTC TATACCCTTA GCGATCAAAC CATGTATGAC ATGCTTGGCT GGCTGGCGCA GGAAGAAGGT ATTCGTCTTG AACCTTCGGC ACTGGCGGGT ATGGCCGGAC CTCAGCGCGT GTGTGCATCA GTAAGTTACC AACAGATGCA CGGTTTCAGC GCAGAACAAC TGCGTAATGC CACTCATCTG GTGTGGGCGA CGGGAGGTGG AATGGTGCCG GAAGAAGAGA TGGAGCAATA TCTGGCAAAA GGCCGTTAA
|
Protein sequence | MENAKMNSLI AQYPLVKDLV ALKETTWFNP GTTSLAEGLP YVGLTEQDVQ DAHARLSRFA PYLAKAFPET AATGGIIESE LVAIPAMQKR LEKEYQQPIS GQLLLKKDSH LPISGSIKAR GGIYEVLAHA EKLALEAGLL TLEDDYSKLL SPEFKQFFSQ YSIAVGSTGN LGLSIGIMSA RIGFKVTVHM SADARAWKKA KLRSHGVTVV EYEQDYGVAV EEGRKAAQSD PNCFFIDDEN SRTLFLGYSV AGQRLKAQFA EQGRIVDADN PLFVYLPCGV GGGPGGVAFG LKLAFGDHVH CFFAEPTHSP CMLLGVHTGL HDQISVQDIG IDNLTAADGL AVGRASGFVG RAMERLLDGF YTLSDQTMYD MLGWLAQEEG IRLEPSALAG MAGPQRVCAS VSYQQMHGFS AEQLRNATHL VWATGGGMVP EEEMEQYLAK GR
|
| |