Gene EcSMS35_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1374 
SymbolsdaA 
ID6145711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1361797 
End bp1363161 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content55% 
IMG OID641616252 
ProductL-serine ammonia-lyase 1 
Protein accessionYP_001743432 
Protein GI170681757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.779144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0170145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAGTC TATTCGACAT GTTTAAGGTG GGGATTGGTC CCTCATCTTC CCATACCGTA 
GGGCCTATGA AGGCGGGTAA ACAGTTCGTC GATGACCTGG TCGAAAAAGG CTTACTGGAT
AGCGTTACTC GCGTTGCCGT GGACGTTTAT GGTTCACTGT CGCTGACGGG TAAAGGCCAC
CACACCGATA TCGCCATTAT TATGGGTCTT GCAGGTAACG AACCTGCCAC CGTGGATATC
GACAGTATTC CCGGTTTTAT TCGCGACGTA GAAGAGCGCG AACGTCTGCT GCTGGCACAG
GGACGGCATG AAGTGGATTT CCCGCGCGAC AACGGGATGC GTTTTCATAA CGGTAACCTG
CCGCTGCATG AAAACGGTAT GCAAATCCAC GCCTATAACG GCGATGAAGT CGTCTACAGC
AAAACTTATT ATTCCATCGG CGGCGGTTTT ATCGTCGATG AAGAACACTT TGGTCAGGAT
GCCGCCAACG AAGTGAGCGT GCCGTATCCG TTCAAATCTG CCACCGAACT GCTCGCGTAC
TGCAATGAAA CCGGCTATTC GCTGTCTGGT CTCGCTATGC AGAATGAACT GGCGCTGCAC
AGCAAGAAAG AGATCGACGA GTATTTCGCG CATGTCTGGC AAACCATGCA GGCATGTATC
GATCGCGGGA TGAACACCGA AGGCGTACTG CCAGGCCCGC TGCGCGTGCC GCGTCGTGCG
TCTGCCCTGC GCCGGATGCT GGTTTCCAGC GATAAACTGT CTAACGATCC GATGAATGTC
ATTGACTGGG TAAACATGTT TGCGCTGGCA GTTAACGAAG AGAACGCCGC CGGTGGTCGC
GTGGTAACTG CGCCAACCAA CGGTGCTTGT GGCATTGTTC CGGCAGTGCT GGCTTACTAT
GACCACTTTA TTGAGTCCGT CAGCCCGGAC ATCTATACCC GCTACTTTAT GGCAGCGGGT
GCGATTGGTG CACTGTATAA AATGAACGCC TCTATTTCCG GTGCGGAAGT CGGTTGCCAG
GGCGAAGTGG GTGTTGCCTG TTCAATGGCT GCTGCGGGCC TTGCCGAGCT GTTGGGCGGT
AGCCCGGAAC AGGTTTGTGT GGCGGCGGAA ATTGGCATGG AGCACAACCT CGGTCTGACC
TGCGACCCGG TTGCAGGGCA GGTTCAGGTG CCGTGCATTG AGCGTAATGC TATTGCCTCT
GTGAAGGCGA TTAACGCCGC ACGGATGGCT CTGCGCCGCA CCAGCGCACC GCGCGTCTCG
CTGGATAAGG TCATCGAAAC GATGTACGAA ACCGGTAAGG ACATGAACGC CAAATACCGC
GAAACCTCAC GCGGTGGTCT GGCAATCAAA GTCCAGTGTG ACTAA
 
Protein sequence
MISLFDMFKV GIGPSSSHTV GPMKAGKQFV DDLVEKGLLD SVTRVAVDVY GSLSLTGKGH 
HTDIAIIMGL AGNEPATVDI DSIPGFIRDV EERERLLLAQ GRHEVDFPRD NGMRFHNGNL
PLHENGMQIH AYNGDEVVYS KTYYSIGGGF IVDEEHFGQD AANEVSVPYP FKSATELLAY
CNETGYSLSG LAMQNELALH SKKEIDEYFA HVWQTMQACI DRGMNTEGVL PGPLRVPRRA
SALRRMLVSS DKLSNDPMNV IDWVNMFALA VNEENAAGGR VVTAPTNGAC GIVPAVLAYY
DHFIESVSPD IYTRYFMAAG AIGALYKMNA SISGAEVGCQ GEVGVACSMA AAGLAELLGG
SPEQVCVAAE IGMEHNLGLT CDPVAGQVQV PCIERNAIAS VKAINAARMA LRRTSAPRVS
LDKVIETMYE TGKDMNAKYR ETSRGGLAIK VQCD