Gene EcHS_A2941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2941 
SymbolsdaB 
ID5594010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2947785 
End bp2949152 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID640922059 
ProductL-serine ammonia-lyase 
Protein accessionYP_001459569 
Protein GI157162251 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.73805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGCG TATTCGATAT TTTCAAAATC GGCATTGGCC CTTCCAGTTC TCATACCGTT 
GGACCAATGA AAGCGGGTAA ACAATTTACC GACGATCTGA TTGCCCATAA TCTGCTTAAA
GACGTGACCC GCGTGGTGGT TGACGTGTAC GGCTCGCTCT CTCTGACCGG TAAAGGCCAC
CACACTGATA TCGCCATTAT TATGGGCCTG GCGGGTAACC TGCCGGATAC CGTGGATATC
GATTCCATCC CTGGTTTTAT TCAGGATGTG AATACTCATG GTCGCCTGAT GCTGGCAAAC
GGTCAGCATG AAGTGGAGTT CCCGGTTGAT CAGTGCATGA ACTTCCATGC AGACAACCTT
TCTCTGCATG AAAACGGTAT GCGCATTACC GCGCTGGCGG GCGATAAAGT CGTTTACAGC
CAGACTTACT ACTCTATTGG CGGTGGCTTT ATCGTTGATG AAGAGCATTT TGGCCAGCAG
AATAGCGCAC CGGTTGAAGT TCCTTATCCG TACAGTTCAG CAGCCGATCT GCAAAAACAT
TGTCAGGAAA CCGGGCTGTC ACTCTCTGGC CTGATGATGA AAAATGAGCT GGCGCTGCAC
AGCAAAGAAG AGCTGGAACA GCACCTGGCG AACGTATGGG AAGTCATGCG TGGCGGTATT
GAGCGCGGTA TTTCCACCGA AGGCGTGTTG CCTGGCAAAC TGCGCGTTCC ACGCCGTGCT
GCGGCACTAC GCCGGATGCT GGTCAGCCAG GATAAAACCA CCACTGACCC GATGGCGGTT
GTTGACTGGA TCAACATGTT TGCACTGGCA GTGAACGAAG AGAACGCTGC TGGCGGACGC
GTGGTGACTG CGCCGACTAA CGGTGCGTGC GGGATTATCC CGGCAGTGCT GGCGTACTAC
GACAAGTTTA TCCGCGAAGT GAACGCTAAC TCACTGGCTC GTTACCTGCT GGTAGCCAGT
GCCATTGGTT CTCTTTATAA GATGAACGCG TCAATTTCTG GTGCTGAAGT CGGCTGCCAG
GGTGAAGTTG GTGTGGCGTG CTCAATGGCG GCGGCTGGTC TGGCAGAGCT GTTAGGTGCA
AGCCCGGCGC AGGTGTGCAT CGCGGCGGAA ATCGCCATGG AGCACAACCT CGGTCTGACG
TGTGACCCGG TCGCCGGACA GGTCCAGGTG CCATGCATCG AGCGTAACGC CATTGCGGCA
GTAAAAGCGG TGAACGCCGC ACGTATGGCG CTACGCCGTA CCAGCGAGCC GCGCGTCTGC
CTCGATAAAG TTATCGAAAC CATGTACGAA ACAGGTAAAG ATATGAACGC CAAGTACCGC
GAAACCTCTC GCGGCGGCCT GGCAATGAAG ATCGTTGCCT GCGATTAA
 
Protein sequence
MISVFDIFKI GIGPSSSHTV GPMKAGKQFT DDLIAHNLLK DVTRVVVDVY GSLSLTGKGH 
HTDIAIIMGL AGNLPDTVDI DSIPGFIQDV NTHGRLMLAN GQHEVEFPVD QCMNFHADNL
SLHENGMRIT ALAGDKVVYS QTYYSIGGGF IVDEEHFGQQ NSAPVEVPYP YSSAADLQKH
CQETGLSLSG LMMKNELALH SKEELEQHLA NVWEVMRGGI ERGISTEGVL PGKLRVPRRA
AALRRMLVSQ DKTTTDPMAV VDWINMFALA VNEENAAGGR VVTAPTNGAC GIIPAVLAYY
DKFIREVNAN SLARYLLVAS AIGSLYKMNA SISGAEVGCQ GEVGVACSMA AAGLAELLGA
SPAQVCIAAE IAMEHNLGLT CDPVAGQVQV PCIERNAIAA VKAVNAARMA LRRTSEPRVC
LDKVIETMYE TGKDMNAKYR ETSRGGLAMK IVACD