Gene EcHS_A3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3300 
SymbolsdaA1 
ID5592210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3307123 
End bp3308487 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID640922418 
ProductL-serine ammonia-lyase 
Protein accessionYP_001459912 
Protein GI157162594 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGTG CATTCGATAT TTTCAAAATT GGGATTGGTC CCTCCAGTTC GCATACCGTG 
GGGCCAATGA ATGCCGGAAA AAGTTTTATT GATCGGCTGG AAAGTAGCGG CTTATTAACC
GCGACGAGCC ATATTGTGGT CGATCTGTAC GGGTCGTTGT CACTGACGGG CAAAGGCCAT
GCCACGGATG TCGCCATCAT CATGGGACTG GCAGGAAACA GTCCGCAGGA TGTTGTCATT
GATGAGATCC CTGCATTTAT AGAGTTAGTA ACGCGCAGCG GGCGGCTGCC AGTGGCATCT
GGTGCGCATA TTGTTGATTT TCCTGTAGCA AAGAACATTA TCTTCCATCC CGAAATGTTG
CCTCGCCATG AGAACGGAAT GCGGATCACT GCCTGGAAGG GACAGGAAGA GCTATTAAGT
AAAACCTATT ACTCTGTCGG CGGCGGGTTT ATTGTCGAAG AAGAACACTT CGGCCTGTCG
CACGATGTCG AAACGTCCGT ACCTTACGAT TTCCACTCAG CAGGTGAACT GCTGAAAATG
TGTGATTACA ACGGCCTGTC TATATCTGGT CTGATGATGC ACAACGAGCT AGCGCTGCGC
AGCAAAGCGG AAATTGACGC CGGTTTTGCC CGTATCTGGC AAGTGATGCA TGACGGTATT
GAACGTGGGA TGAACACTGA AGGCGTGCTG CCTGGTCCGC TCAATGTGCC GCGCCGTGCC
GTAGCGCTGC GTCGTCAGCT GGTTTCCAGC GATAACATCT CTAACGATCC GATGAATGTC
ATCGACTGGA TCAACATGTA CGCGCTGGCG GTTAGTGAAG AAAACGCAGC TGGCGGGCGC
GTGGTAACGG CACCGACTAA CGGTGCGTGC GGCATTATTC CGGCAGTACT GGCTTATTAC
GATAAGTTCC GTCGTCCGGT AAACGAGCGG TCAATTGCCC GCTATTTTCT GGCCGCGGGG
GCTATTGGCG CGCTGTATAA AATGAACGCC TCCATCTCTG GCGCGGAAGT CGGCTGTCAG
GGGGAGATTG GCGTGGCCTG TTCAATGGCG GCGGCAGGGT TAACTGAACT ACTGGGCGGC
AGTCCGGCGC AGGTATGCAA TGCGGCGGAA ATCGCGATGG AGCATAACCT TGGGCTGACC
TGCGATCCGG TTGCCGGACA GGTACAAATC CCGTGCATTG AACGTAATGC CATTAATGCC
GTGAAAGCAG TAAACGCCGC GCGGATGGCG ATGCGCCGCA CCTCGGCACC GCGTGTTTCA
CTCGATAAAG TGATCGAGAC GATGTATGAA ACCGGCAAAG ATATGAACGA TAAATACCGC
GAAACATCAC GCGGAGGACT GGCCATTAAA GTGGTCTGCG GCTGA
 
Protein sequence
MISAFDIFKI GIGPSSSHTV GPMNAGKSFI DRLESSGLLT ATSHIVVDLY GSLSLTGKGH 
ATDVAIIMGL AGNSPQDVVI DEIPAFIELV TRSGRLPVAS GAHIVDFPVA KNIIFHPEML
PRHENGMRIT AWKGQEELLS KTYYSVGGGF IVEEEHFGLS HDVETSVPYD FHSAGELLKM
CDYNGLSISG LMMHNELALR SKAEIDAGFA RIWQVMHDGI ERGMNTEGVL PGPLNVPRRA
VALRRQLVSS DNISNDPMNV IDWINMYALA VSEENAAGGR VVTAPTNGAC GIIPAVLAYY
DKFRRPVNER SIARYFLAAG AIGALYKMNA SISGAEVGCQ GEIGVACSMA AAGLTELLGG
SPAQVCNAAE IAMEHNLGLT CDPVAGQVQI PCIERNAINA VKAVNAARMA MRRTSAPRVS
LDKVIETMYE TGKDMNDKYR ETSRGGLAIK VVCG