Gene EcSMS35_4895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4895 
Symbol 
ID6143763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5014092 
End bp5015312 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID641619698 
Productputative D-serine ammonia-lyase 
Protein accessionYP_001746805 
Protein GI170680265 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.333429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACC ACTCAGAACC TCTGGTTCCC CATAAATCTG CCCTGATGCA AATGCCCGCA 
AATCTCCTTG CCGAAGATGT CTGTTTACCT GCCGCGATCA TTAAAAAGCA GGCCCTGGAG
AACAACATTA CGTGGATGCA ACGCTACGCT GACGCGCGCG GCGTTTCACT GGCACCGCAC
GGTAAAACCA CCATGACGCC GTGGATTTTT CAGGCGCAGC AGGCGGCTGG TGCCTGGGCA
ATAGGTGTCG GTAGCGCATG GCAGGCTGGT GCGGCAATGG CGAGCGGTAT TCAGCGAGTG
CTGATGGTTA ACCAGCTGGT CGGCAAGGCG AATATGCAGT TGATTGCCCA GTTGCAACGA
CATTACCCGA CGGTCGATTT TATCAGCTGT ATCGACAGCA TTGAGAACGC CCGCGCTTTG
TCTGCCTTTT TTGCCAGTCA GCAACAGACA CTGAACGTGA TGATTGAGTT GGGTGTCCCC
GGCGGGCGCT GTGGTTGCCG CTCAAGGGAT GCCGCACTGA CACTGGCTAA ACAGGCAGCC
CAATTACCGG GGTTACGCCT GCGGGGGCTT GAGTTATATG AAGGCGTTTT ACACGGTGAC
GATCCACAAC CGCAGGTAGA AGCTCTGCTG CGGGATGCCG CACAACTGGC CTGTGATATG
GCAAGTCTGG TTGATGGCGA GTTTATTCTC ACCGGAGCAG GCACGGTCTG GTATGACGTG
GTGTGCAATA TCTGGCTGGC GGCAGAAAAG CCTGATAACT GCCGCATTGT TATTCGCCCT
GGCTGCTACA TCACTCACGA CACCGGGATC TACGACGCGG CACAACAACA GTTGCTTGCC
CGTGATCCTG TGGCTTGCGA TCTGGCAGGC GATCTCACAT CTGCACTGGA ACTGGTCGCC
ATGGTGCAAT CGGTGCCGGA AGCTGATCGT GCGGTGGTTA ATTTTGGTAA GCGTGATTGC
GCCTTCGACG CCGGACTGCC GCAACCAGTC GCCCACTATC GCAACGGCAA ATCACTGGCA
TTTGATCCAC AGGCGATTCG CAGTACAGGC ATTATGGATC AGCACTGCAT GTTGCAGTTG
GGTGCCGACA GCGACGTGCA AGTGGGGGAT ATTCTGGTGT TTGGCACATC GCATCCGTGC
CTGACCTTCG ACAAATGGAA AACGTTGTTA TTGACTGATG ACGACTACAA CGTACTGGCA
GAATTAGACA CTCTCTTCTA A
 
Protein sequence
MKYHSEPLVP HKSALMQMPA NLLAEDVCLP AAIIKKQALE NNITWMQRYA DARGVSLAPH 
GKTTMTPWIF QAQQAAGAWA IGVGSAWQAG AAMASGIQRV LMVNQLVGKA NMQLIAQLQR
HYPTVDFISC IDSIENARAL SAFFASQQQT LNVMIELGVP GGRCGCRSRD AALTLAKQAA
QLPGLRLRGL ELYEGVLHGD DPQPQVEALL RDAAQLACDM ASLVDGEFIL TGAGTVWYDV
VCNIWLAAEK PDNCRIVIRP GCYITHDTGI YDAAQQQLLA RDPVACDLAG DLTSALELVA
MVQSVPEADR AVVNFGKRDC AFDAGLPQPV AHYRNGKSLA FDPQAIRSTG IMDQHCMLQL
GADSDVQVGD ILVFGTSHPC LTFDKWKTLL LTDDDYNVLA ELDTLF