Gene EcSMS35_2937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2937 
SymbolsdaB 
ID6147516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3012826 
End bp3014193 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID641617806 
ProductL-serine ammonia-lyase 2 
Protein accessionYP_001744961 
Protein GI170680778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.427341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGCG TATTCGATAT TTTCAAAATC GGCATTGGCC CTTCCAGTTC TCATACCGTT 
GGACCAATGA AAGCGGGTAA ACAATTTACC GACGATCTGA TTGCCCGTGA CCTGCTTAAA
GACGTGACCC GCGTGGTGGT TGACGTGTAC GGCTCGCTCT CTCTGACCGG TAAAGGCCAC
CACACTGATA TCGCCATAAT TATGGGTCTG GCGGGTAACC TGCCGGATAC CGTGGATATC
GATTCCATCC CTGGTTTTAT TCAGGATGTG AATACTCATG GTCGCCTGAT GCTGGCAAAC
GGTCAGCATG AAGTGGAGTT CCCGGTTGAT CAGTGCATGA ACTTCCATGC CGACAACCTT
TCTCTGCATG AAAACGGTAT GCGCATTACC GCGCTGGCGG GCGATAAAGT CGTTTACAGC
CAGACTTATT ACTCTATCGG CGGTGGCTTT ATTGTTGATG AAGAACATTT TGGTCAGCAG
GATAGCGCAC CGGTTGAAGT TCCTTATCCG TACAGTTCAG CAGCCGATCT GCAAAAACAT
TGTCAGGAAA CCGGGCTGTC ACTCTCTGGC CTGATGATGA AAAACGAGCT GGCGCTGCAC
AGCAAAGAAG AGCTGGAACA GCACCTGGCG AACGTCTGGG AAGTCATGCG TGGCGGTATT
GAGCGCGGTA TTTCTACCGA AGGCGTGTTA CCTGGCAAAC TGCGCGTTCC ACGCCGTGCT
GCGGCACTAC GCCGGATGCT GGTCAGCCAG GATAAAACCA CCACTGACCC GATGGCGGTT
GTTGACTGGA TCAACATGTT TGCACTGGCA GTGAACGAAG AGAACGCTGC TGGCGGTCGC
GTGGTGACTG CGCCGACTAA CGGTGCGTGC GGGATTATCC CAGCTGTGCT GGCGTACTAC
GATAAGTTTA TCCGCGAAGT GAACGCTAAC TCACTGGCTC GTTACCTGCT GGTAGCCAGC
GCCATTGGTT CTCTTTATAA GATGAACGCG TCGATTTCTG GTGCCGAAGT GGGTTGCCAG
GGTGAAGTTG GCGTGGCGTG CTCAATGGCG GCGGCTGGTC TGGCAGAACT GTTAGGCGCA
AGCCCGGCGC AGGTGTGCAT CGCGGCGGAA ATCGCCATGG AGCACAACCT CGGTCTGACG
TGTGACCCGG TTGCTGGACA GGTCCAGGTG CCATGCATCG AGCGTAACGC CATTGCGGCA
GTGAAAGCGG TAAATGCTGC ACGTATGGCG CTGCGCCGTA CCAGCGAGCC GCGCGTCTGC
CTCGATAAAG TTATCGAGAC CATGTACGAA ACAGGTAAAG ATATGAACGC CAAGTACCGC
GAAACCTCTC GCGGTGGCCT GGCAATGAAG ATCGTTGCCT GCGATTAA
 
Protein sequence
MISVFDIFKI GIGPSSSHTV GPMKAGKQFT DDLIARDLLK DVTRVVVDVY GSLSLTGKGH 
HTDIAIIMGL AGNLPDTVDI DSIPGFIQDV NTHGRLMLAN GQHEVEFPVD QCMNFHADNL
SLHENGMRIT ALAGDKVVYS QTYYSIGGGF IVDEEHFGQQ DSAPVEVPYP YSSAADLQKH
CQETGLSLSG LMMKNELALH SKEELEQHLA NVWEVMRGGI ERGISTEGVL PGKLRVPRRA
AALRRMLVSQ DKTTTDPMAV VDWINMFALA VNEENAAGGR VVTAPTNGAC GIIPAVLAYY
DKFIREVNAN SLARYLLVAS AIGSLYKMNA SISGAEVGCQ GEVGVACSMA AAGLAELLGA
SPAQVCIAAE IAMEHNLGLT CDPVAGQVQV PCIERNAIAA VKAVNAARMA LRRTSEPRVC
LDKVIETMYE TGKDMNAKYR ETSRGGLAMK IVACD