Gene ECH74115_5400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5400 
SymbolmetL 
ID6969769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5039830 
End bp5042262 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content59% 
IMG OID643389053 
Productbifunctional aspartate kinase II/homoserine dehydrogenase II 
Protein accessionYP_002273462 
Protein GI209399480 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.324557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGA TTGCGCAGGC AGGGGCGAAA GGTCGTCAGC TGCATAAATT TGGTGGCAGT 
AGTCTGGCTG ATGTGAAGTG TTATTTGCGT GTCGCGGGCA TTATGGCGGA GTACTCTCAG
CCTGACGATA TGATGGTGGT TTCCGCCGCT GGTAGCACCA CTAACCAGTT GATTAACTGG
TTGAAACTAA GCCAGACCGA TCGTCTCTCT GCGCATCAGG TTCAACAAAC GCTGCGTCGC
TATCAGTGCG ATCTGATTAG CGGTCTGCTA CCCGCCGAAG AAGCCGATAG CCTCATTAGC
GCTTTTGTCA GCGATCTTGA GCGCCTGGCG GCGCTGCTCG ACAGCGGTAT TAACGACGCA
GTGTATGCGG AAGTGGTGGG CCACGGCGAA GTGTGGTCGG CGCGCCTGAT GTCTGCGGTG
CTTAATCAAC AAGGGCTGCC AGCGGCCTGG CTTGATGCCC GCGAGTTTTT ACGCGCTGAA
CGCGCTGCGC AACCGCAGGT TGATGAAGGG CTTTCTTACC CGTTGCTGCA ACAGCTACTG
GTACAACATC CGGGTAAACG TCTGGTGGTG ACCGGGTTTA TCAGCCGTAA CAACGCCGGT
GAAACGGTGC TGCTGGGGCG TAACGGTTCC GACTATTCCG CGACACAAAT CGGTGCGCTG
GCGGGTGTTT CTCGTGTAAC CATCTGGAGC GACGTCGCCG GGGTATACAG TGCCGATCCG
CGTAAAGTGA AAGATGCCTG TCTGCTGCCA TTGCTGCGTC TGGATGAAGC CAGCGAACTG
GCGCGTCTGG CGGCTCCCGT TCTTCACGCC CGTACTTTAC AGCCAGTATC CGGTAGCGAA
ATCGACCTGC AGCTGCGCTG TAGCTACACG CCGGATCAAG GCTCCACGCG CATTGAACGC
GTGCTGGCCT CCGGTACTGG TGCGCGTATT GTCACCAGCC ACGATGATGT CTGCTTGATT
GAGTTTCAGG TGCCCACCAG CCAGGATTTC AAACTGGCGC ATAAAGAGAT CGACCAGATC
CTGAAACGTG CTCAGGTGCG CCCACTGGCG GTTGGCGTGC ATAACGATCG CCAGTTGCTG
CAATTTTGCT ACACCTCAGA AGTGGCCGAC AGTGCGCTGA AAATCCTCGA CGAAGCGGGA
TTACCTGGCG AACTGCGCCT GCGTCAGGGG CTGGCGCTGG TGGCGATGGT CGGGGCGGGT
GTCACCCGTA ACCCGCTGCA TTGCCACCGC TTCTGGCAGC AACTGAAAGG CCAGCCGGTC
GAGTTTACCT GGCAGTCCGA TGACGGCATC AGCCTGGTGG CAGTACTGCG CACCGGCCCG
ACCGAAAGCC TGATTCAGGG CCTGCATCAG TCCGTCTTCC GCGCAGAAAA ACGCATCGGC
CTGGTATTGT TCGGCAAAGG CAATATCGGT TCCCGCTGGC TGGAACTGTT CGCCCGCGAG
CAGAGCACGC TTTCGGCGCG TACCGGCTTT GAATTTGTGC TGGCAGGCGT GGTGGACAGC
CGCCGCAGTC TGTTGAGTTA TGACGGGCTG GACGCCAGCC GCGCGTTAGC CTTCTTCAAC
GATGAAGCGG TTGAGCAGGA TGAAGAGTCG TTGTTCCTGT GGATGCGCGC CCATCCGTAT
GATGATTTAG TGGTGCTGGA CGTTACCGCC AGCCAGCAGC TGGCTGACCA GTATCTTGAT
TTCGCCAGCC ACGGTTTCCA CGTCATCAGT GCCAACAAAC TGGCGGGAGC AAGCGACAGC
AATAAATATC GCCAGATCCA CGACGCCTTC GAAAAAACCG GGCGTCACTG GCTATACAAC
GCCACCGTCG GTGCGGGCTT GCCGATCAAC CATACCGTGC GCGATCTGAT CGACAGCGGC
GATACTATTT TGTCGATCAG TGGGATCTTC TCCGGCACGC TCTCCTGGCT GTTCCTGCAA
TTCGACGGTA GCGTGCCGTT TACCGAGCTG GTGGATCAGG CGTGGCAGCA GGGCTTAACC
GAGCCTGATC CGCGTGACGA CCTCTCCGGC AAAGACGTAA TGCGCAAGCT GGTGATTCTG
GCGCGTGAAG CGGGTTACAA CATTGAACCG GACCAGGTAC GTGTGGAATC GCTGGTGCCT
GCTCATTGCG AAGGCGGCAG CATCGACCAT TTCTTTGAAA ATGGCGATGA ACTTAACGAG
CAGATGGTGC AACGGCTGGA AGCGGCCCGC GAAATGGGGC TGGTGCTGCG CTACGTGGCG
CGTTTCGATG CCAACGGCAA AGCGCGTGTG GGCGTGGAGG CGGTGCGTGA AGACCATCCG
CTGGCATCAC TGCTACCGTG CGATAACGTC TTTGCCATCG AAAGCCGCTG GTATCGCGAC
AACCCGCTGG TGATCCGTGG GCCTGGCGCG GGGCGCGACG TCACCGCCGG GGCGATTCAG
TCGGATATCA ACCGGCTGGC ACAGTTGTTG TAA
 
Protein sequence
MSVIAQAGAK GRQLHKFGGS SLADVKCYLR VAGIMAEYSQ PDDMMVVSAA GSTTNQLINW 
LKLSQTDRLS AHQVQQTLRR YQCDLISGLL PAEEADSLIS AFVSDLERLA ALLDSGINDA
VYAEVVGHGE VWSARLMSAV LNQQGLPAAW LDAREFLRAE RAAQPQVDEG LSYPLLQQLL
VQHPGKRLVV TGFISRNNAG ETVLLGRNGS DYSATQIGAL AGVSRVTIWS DVAGVYSADP
RKVKDACLLP LLRLDEASEL ARLAAPVLHA RTLQPVSGSE IDLQLRCSYT PDQGSTRIER
VLASGTGARI VTSHDDVCLI EFQVPTSQDF KLAHKEIDQI LKRAQVRPLA VGVHNDRQLL
QFCYTSEVAD SALKILDEAG LPGELRLRQG LALVAMVGAG VTRNPLHCHR FWQQLKGQPV
EFTWQSDDGI SLVAVLRTGP TESLIQGLHQ SVFRAEKRIG LVLFGKGNIG SRWLELFARE
QSTLSARTGF EFVLAGVVDS RRSLLSYDGL DASRALAFFN DEAVEQDEES LFLWMRAHPY
DDLVVLDVTA SQQLADQYLD FASHGFHVIS ANKLAGASDS NKYRQIHDAF EKTGRHWLYN
ATVGAGLPIN HTVRDLIDSG DTILSISGIF SGTLSWLFLQ FDGSVPFTEL VDQAWQQGLT
EPDPRDDLSG KDVMRKLVIL AREAGYNIEP DQVRVESLVP AHCEGGSIDH FFENGDELNE
QMVQRLEAAR EMGLVLRYVA RFDANGKARV GVEAVREDHP LASLLPCDNV FAIESRWYRD
NPLVIRGPGA GRDVTAGAIQ SDINRLAQLL