Gene EcolC_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1303 
Symbol 
ID6068555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1430553 
End bp1431881 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content51% 
IMG OID641600724 
ProductD-serine dehydratase 
Protein accessionYP_001724296 
Protein GI170019342 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3048] D-serine dehydratase 
TIGRFAM ID[TIGR02035] D-serine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.29121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACG CTAAAATGAA CTCGCTCATC GCCCAGTATC CGTTGGTAAA GGATCTGGTT 
GCTCTTAAAG AAACCACCTG GTTTAATCCT GGCACGACCT CATTGGCTGA AGGTTTACCT
TATGTTGGCC TGACCGAACA GGATGTTCAG GACGCCCATG CGCGCTTATC CCGTTTTGCA
CCCTATCTGG CAAAAGCATT TCCTGAAACT GCTGCCACTG GGGGGATTAT TGAATCAGAA
CTGGTTGCCA TTCCAGCTAT GCAAAAACGG CTGGAAAAAG AATATCAGCA ACCGATCAGC
GGGCAACTGT TACTGAAAAA AGATAGCCAT TTGCCCATTT CCGGCTCCAT AAAAGCACGC
GGCGGGATTT ATGAAGTCCT GGCACACGCA GAAAAACTGG CTCTGGAAGC GGGGTTGCTG
ACGCTTGATG ATGACTACAG CAAACTGCTT TCTCCGGAGT TTAAACAGTT CTTTAGCCAA
TACAGCATTG CTGTGGGCTC AACCGGAAAT CTGGGGTTAT CAATCGGCAT TATGAGCGCC
CGCATTGGCT TTAAGGTGAC AGTTCATATG TCTGCTGATG CCCGGGCATG GAAAAAAGCG
AAACTGCGCA GCCATGGCGT TACGGTCGTG GAATATGAGC AAGATTATGG TGTTGCCGTC
GAGGAAGGAC GTAAAGCAGC GCAGTCTGAC CCGAACTGTT TCTTTATTGA TGACGAAAAT
TCCCGCACGT TGTTCCTTGG GTATTCCGTC GCTGGCCAGC GTCTTAAAGC GCAATTTGCC
CAGCAAGGCC GTATCGTCGA TGCTGATAAC CCTCTGTTTG TCTATCTGCC GTGTGGTGTT
GGCGGTGGTC CTGGTGGCGT CGCATTCGGG CTTAAACTGG CGTTTGGCGA TCATGTTCAC
TGCTTTTTTG CCGAACCAAC GCACTCCCCT TGTATGTTGT TAGGCGTCCA TACAGGATTA
CACGATCAGA TTTCTGTTCA GGATATTGGT ATCGACAACC TTACCGCAGC GGATGGCCTT
GCAGTTGGTC GCGCATCAGG CTTTGTCGGG CGGGCAATGG AGCGTCTGCT GGATGGCTTC
TATACCCTTA GCGATCAAAC CATGTATGAC ATGCTTGGCT GGCTGGCGCA GGAAGAAGGT
ATTCGTCTTG AACCTTCGGC ACTGGCGGGT ATGGCCGGAC CTCAGCGCGT GTGTGCATCA
GTAAGTTACC AACAGATGCA CGGTTTCAGC GCAGAACAAC TGCGTAATAC CACTCATCTG
GTGTGGGCGA CGGGAGGTGG AATGGTGCCG GAAGAAGAGA TGAATCAATA TCTGGCAAAA
GGCCGTTAA
 
Protein sequence
MENAKMNSLI AQYPLVKDLV ALKETTWFNP GTTSLAEGLP YVGLTEQDVQ DAHARLSRFA 
PYLAKAFPET AATGGIIESE LVAIPAMQKR LEKEYQQPIS GQLLLKKDSH LPISGSIKAR
GGIYEVLAHA EKLALEAGLL TLDDDYSKLL SPEFKQFFSQ YSIAVGSTGN LGLSIGIMSA
RIGFKVTVHM SADARAWKKA KLRSHGVTVV EYEQDYGVAV EEGRKAAQSD PNCFFIDDEN
SRTLFLGYSV AGQRLKAQFA QQGRIVDADN PLFVYLPCGV GGGPGGVAFG LKLAFGDHVH
CFFAEPTHSP CMLLGVHTGL HDQISVQDIG IDNLTAADGL AVGRASGFVG RAMERLLDGF
YTLSDQTMYD MLGWLAQEEG IRLEPSALAG MAGPQRVCAS VSYQQMHGFS AEQLRNTTHL
VWATGGGMVP EEEMNQYLAK GR