Gene ECH74115_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4061 
SymbolsdaB 
ID6968449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3756373 
End bp3757740 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID643387820 
ProductL-serine ammonia-lyase 2 
Protein accessionYP_002272263 
Protein GI209396781 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.830001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGCG TATTCGATAT TTTCAAAATC GGCATTGGCC CTTCCAGTTC TCATACCGTT 
GGACCAATGA AAGCGGGTAA ACAATTTACC GACGATCTGA TTGCCCGTAA CCTGCTTAAA
GACGTGACCC GCGTGGTGGT TGACGTGTAC GGCTCGCTCT CTCTGACTGG TAAAGGCCAC
CACACTGATA TCGCCATTAT TATGGGCCTG GCGGGTAACC TGCCGGATAC CGTGGATATC
GATTCCATCC CTGGTTTTAT TCAGGATGTG AATACTCATG GTCGCCTGAT GCTGGCAAAC
GGTCAGCATG AAGTGGAGTT CCCGGTTGAT CAGTGCATGA ACTTCCATGC CGACAACCTT
TCTCTGCATG AAAACGGTAT GCGCATTACC GCGCTGGCGG GTGATAAAGT CGTTTACAGT
CAGACTTACT ACTCTATTGG CGGTGGCTTT ATCGTTGATG AAGAGCATTT TGGCCAGCAG
GATAGCGCAC CGGTTGAAGT TCCTTATCCG TACAGTTCAG CAGCCGATCT GCAAAAACAT
TGTCAGGAAA CCGGGCTGTC ACTCTCTGGC CTGATGATGA AAAACGAGCT GGCGCTGCAC
AGCAAAGAAG AGCTGGAACA GCACCTGGCG AACGTCTGGG AAGTGATGTG TGGCGGTATT
GAGCGCGGTA TTTCCACCGA AGGTGTGTTA CCTGGCAAAC TGCGTGTTCC ACGCCGTGCT
GCGGCACTAC GCCGGATGCT GGTCAGCCAG GATAAAACCA CCACTGACCC GATGGCGGTT
GTTGACTGGA TCAACATGTT TGCACTGGCA GTGAACGAAG AGAACGCTGC TGGCGGTCGC
GTGGTGACTG CGCCGACTAA CGGTGCGTGC GGGATTATCC CGGCAGTGCT GGCGTACTAC
GATAAGTTTA TCCGCGAAGT GAACGCTAAC TCACTGGCTC GTTACCTGCT GGTAGCCAGC
GCCATTGGTT CTCTTTATAA GATGAACGCG TCGATTTCTG GTGCCGAAGT GGGTTGCCAG
GGTGAAGTTG GTGTGGCGTG CTCAATGGCG GCGGCTGGTC TGGCAGAACT GTTAGGTGCA
AGCCCGGCGC AGGTGTGCAT CGCGGCGGAA ATCGCCATGG AGCACAACCT CGGTCTGACG
TGTGACCCGG TCGCCGGACA GGTACAGGTG CCATGCATCG AGCGTAACGC CATTGCGGCA
GTAAAAGCGG TGAACGCCGC ACGTATGGCG CTGCGCCGTA CCAGCGAGCC GCGCGTCTGC
CTCGATAAAG TTATCGAAAC CATGTACGAA ACAGGTAAAG ATATGAACGC CAAGTACCGC
GAAACCTCTC GCGGCGGCCT GGCAATGAAG ATCGTTGCCT GCGATTAA
 
Protein sequence
MISVFDIFKI GIGPSSSHTV GPMKAGKQFT DDLIARNLLK DVTRVVVDVY GSLSLTGKGH 
HTDIAIIMGL AGNLPDTVDI DSIPGFIQDV NTHGRLMLAN GQHEVEFPVD QCMNFHADNL
SLHENGMRIT ALAGDKVVYS QTYYSIGGGF IVDEEHFGQQ DSAPVEVPYP YSSAADLQKH
CQETGLSLSG LMMKNELALH SKEELEQHLA NVWEVMCGGI ERGISTEGVL PGKLRVPRRA
AALRRMLVSQ DKTTTDPMAV VDWINMFALA VNEENAAGGR VVTAPTNGAC GIIPAVLAYY
DKFIREVNAN SLARYLLVAS AIGSLYKMNA SISGAEVGCQ GEVGVACSMA AAGLAELLGA
SPAQVCIAAE IAMEHNLGLT CDPVAGQVQV PCIERNAIAA VKAVNAARMA LRRTSEPRVC
LDKVIETMYE TGKDMNAKYR ETSRGGLAMK IVACD