Gene ECH74115_5502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5502 
SymbollysC 
ID6970937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5149498 
End bp5150847 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content54% 
IMG OID643389146 
Productaspartate kinase III 
Protein accessionYP_002273543 
Protein GI209399821 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TTGTTGTCTC CAAATTTGGC GGTACCAGCG TAGCTGATTT TGACGCCATG 
AACCGCAGCG CTGATATTGT GCTTTCTGAT GCCAACGTGC GTTTAGTTGT CCTCTCGGCT
TCTGCTGGTA TCACTAATCT GCTGGTCGCT TTAGCGGAAG GACTGGAACC TGGCGAGCGA
TTCGAAAAAC TCGACGCTAT TCGCAACATC CAGTTTGCCA TTCTGGAACG TCTGCGTTAC
CCGAACGTTA TCCGTGAAGA GATTGAACGT CTGCTGGAGA ACATTACTGT TCTGGCAGAA
GCGGCGGCGC TGGCAACGTC TCCGGCGCTG ACAGATGAAC TGGTCAGCCA CGGCGAGCTG
ATGTCGACCC TGCTGTTTGT CGAGATCCTG CGCGAACGCG ATGTTCAGGC ACAGTGGTTT
GATGTACGTA AAGTGATGCG TACCAACGAC CGATTTGGTC GTGCAGAGCC AGATGTAGCC
GCGCTGGCGG AACTGGCCGC GCTGCAGCTG CTCCCACGTC TCAATGAAGG CTTAGTGATC
ACCCAGGGAT TTATCGGTAG CGAAAATAAA GGTCGTACAA CGACGCTTGG CCGTGGAGGC
AGCGATTATA CGGCAGCCTT GCTGGCGGAG GCTTTACACG CATCTCGTGT TGATATCTGG
ACCGACGTCC CGGGCATCTA CACCACCGAT CCACGCGTAG TTTCCGCAGC AAAACGCATT
GATGAAATCG CGTTTGCCGA AGCGGCAGAG ATGGCAACTT TTGGTGCAAA AGTACTGCAT
CCGGCAACGT TGCTACCCGC AGTACGCAGC GATATCCCAG TCTTTGTCGG CTCCAGCAAA
GACCCACGCG CAGGTGGTAC GCTGGTGTGC AATAAAACTG AAAATCCGCC GCTGTTCCGC
GCGCTGGCGC TTCGTCGCAA TCAGACTCTG CTCACTTTGC ACAGCCTGAA TATGCTGCAT
TCTCGCGGTT TCCTCGCGGA AGTTTTCGGC ATCCTCGCGC GGCATAATAT TTCGGTAGAC
TTAATCACCA CGTCAGAAGT GAGCGTGGCA TTAACCCTTG ATACCACCGG TTCAACCTCC
ACTGGCGATA CGTTGCTGAC GCAATCTCTG CTGATGGAGC TTTCCGCACT GTGCCGGGTG
GAGGTGGAAG AAGGTCTGGC GCTGGTCGCG TTGATTGGCA ATGACCTGTC AAAAGCCTGC
GGCGTTGGCA AAGAGGTATT CGGCGTACTG GAACCGTTCA ACATTCGCAT GATTTGTTAC
GGCGCATCCA GCCATAACCT GTGCTTCCTG GTGCCCGGCG AAGATGCCGA GCAGGTGGTG
CAAAAACTGC ATAGTAATTT GTTTGAGTAA
 
Protein sequence
MSEIVVSKFG GTSVADFDAM NRSADIVLSD ANVRLVVLSA SAGITNLLVA LAEGLEPGER 
FEKLDAIRNI QFAILERLRY PNVIREEIER LLENITVLAE AAALATSPAL TDELVSHGEL
MSTLLFVEIL RERDVQAQWF DVRKVMRTND RFGRAEPDVA ALAELAALQL LPRLNEGLVI
TQGFIGSENK GRTTTLGRGG SDYTAALLAE ALHASRVDIW TDVPGIYTTD PRVVSAAKRI
DEIAFAEAAE MATFGAKVLH PATLLPAVRS DIPVFVGSSK DPRAGGTLVC NKTENPPLFR
ALALRRNQTL LTLHSLNMLH SRGFLAEVFG ILARHNISVD LITTSEVSVA LTLDTTGSTS
TGDTLLTQSL LMELSALCRV EVEEGLALVA LIGNDLSKAC GVGKEVFGVL EPFNIRMICY
GASSHNLCFL VPGEDAEQVV QKLHSNLFE