Gene EcolC_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4075 
SymbolmetL 
ID6065484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4499561 
End bp4501993 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content59% 
IMG OID641603498 
Productbifunctional aspartate kinase II/homoserine dehydrogenase II 
Protein accessionYP_001727001 
Protein GI170022047 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGA TTGCGCAGGC AGGGGCGAAA GGTCGTCAGC TGCATAAATT TGGTGGCAGT 
AGTCTGGCTG ATGTGAAGTG TTATTTGCGT GTCGCGGGCA TTATGGCGGA GTTCTCTCAG
CCTGACGATA TGATGGTGGT TTCCGCCGCC GGTAGCACCA CTAACCAGTT GATTAACTGG
TTGAAACTAA GCCAGACCGA TCGTCTCTCT GCGCATCAGG TTCAACAAAC GCTGCGTCGC
TATCAGTGCG ATCTGATTAG CGGTCTGCTA CCCGCTGAAG AAGCCGATAG CCTCATTAGC
GCTTTTGTCA GCGATCTGGA ACGCCTGGCG GCGTTGCTCG ACAGTGGTAT TAACGATGCG
GTGTATGCGG AAGTGGTGGG CCACGGTGAA GTGTGGTCGG CGCGCCTGAT GTCTGCGGTA
CTTAATCAAC AAGGGCTGCC AGCGGCCTGG CTTGATGCCC GCGAGTTTTT ACGCGCTGAA
CGCGCCGCAC AACCGCAGGT TGATGAAGGG CTTTCTTACC CGTTGCTGCA ACAGCTGCTG
GTGCAACATC CGGGCAAACG TCTGGTGGTG ACCGGATTTA TCAGCCGCAA CAACGCCGGT
GAAACGGTGC TGCTGGGGCG TAACGGTTCC GACTATTCCG CGACACAAAT CGGTGCGCTG
GCGGGTGTTT CTCGCGTAAC CATCTGGAGC GACGTCGCCG GGGTATACAG TGCCGACCCG
CGTAAAGTGA AAGATGCCTG CCTGCTGCCG TTGCTGCGTC TGGATGAGGC CAGCGAACTG
GCGCGCCTGG CGGCTCCCGT TCTTCACGCC CGTACTTTAC AGCCGGTTTC TGGCAGCGAA
ATCGACCTGC AACTGCGCTG TAGCTACACG CCGGATCAAG GTTCCACGCG CATTGAACGC
GTGCTGGCCT CCGGTACTGG TGCGCGTATT GTCACCAGCC ACGATGATGT CTGTTTGATT
GAGTTTCAGG TGCCCGCCAG TCAGGATTTC AAACTGGCGC ATAAAGAGAT CGACCAAATC
CTGAAACGCG CGCAGGTACG CCCGCTGGCG GTTGGCGTAC ATAACGATCG CCAGTTGCTG
CAATTTTGCT ACACCTCAGA AGTGGCCGAC AGTGCGCTGA AAATCCTCGA CGAAGCGGGA
TTACCTGGCG AACTGCGCCT GCGTCAGGGG CTGGCGCTGG TGGCGATGGT CGGTGCAGGC
GTCACCCGTA ACCCGCTGCA TTGCCACCGC TTCTGGCAGC AACTGAAAGG CCAGCCGGTC
GAATTTACCT GGCAGTCCGA TGACGGCATC AGCCTGGTGG CAGTACTGCG CACCGGCCCG
ACCGAAAGCC TGATTCAGGG GCTGCATCAG TCCGTCTTCC GCGCAGAAAA ACGCATCGGC
CTGGTATTGT TCGGTAAGGG CAATATCGGT TCCCGTTGGC TGGAACTGTT CGCCCGTGAG
CAGAGCACGC TTTCGGCACG TACCGGCTTT GAGTTTGTGC TGGCAGGTGT GGTGGACAGC
CGCCGCAGCC TGTTGAGCTA TGACGGGCTG GACGCCAGCC GCGCGTTAGC CTTCTTCAAC
GATGAAGCGG TTGAGCAGGA TGAAGAGTCG TTGTTCCTGT GGATGCGCGC CCATCCGTAT
GATGATTTAG TGGTGCTGGA CGTTACCGCC AGCCAGCAGC TTGCTGATCA GTATCTTGAT
TTCGCCAGCC ACGGTTTCCA CGTTATCAGC GCCAACAAAC TGGCGGGAGC CAGCGACAGC
AATAAATATC GCCAGATCCA CGACGCCTTC GAAAAAACCG GGCGTCACTG GCTGTACAAT
GCCACCGTCG GTGCGGGCTT GCCGATCAAC CACACCGTGC GCGATCTGAT CGACAGCGGC
GATACTATTT TGTCGATCAG CGGGATCTTC TCCGGCACGC TCTCCTGGCT GTTCCTGCAA
TTCGACGGTA GCGTGCCGTT TACCGAGCTG GTGGATCAGG CGTGGCAGCA GGGCTTAACC
GAGCCTGATC CGCGTGACGA CCTCTCCGGC AAAGACGTAA TGCGCAAGCT GGTAATTCTG
GCGCGTGAAG CGGGTTACAA CATTGAACCG GACCAGGTAC GTGTGGAATC GCTGGTGCCT
GCTCATTGCG AAGGCGGCAG CATCGACCAT TTCTTTGAAA ATGGCGATGA ACTTAACGAG
CAGATGGTGC AACGGCTGGA AGCGGCCCGC GAAATGGGGC TGGTGCTACG CTACGTGGCG
CGTTTCGATG CCAACGGCAA AGCGCGTGTG GGCGTGGAGG CGGTGCGTGA AGACCATCCG
CTGGCATCAC TGCTGCCGTG CGATAACGTC TTTGCCATCG AAAGCCGCTG GTATCGCGAC
AACCCGCTGG TGATCCGTGG GCCTGGCGCG GGGCGCGACG TCACCGCCGG GGCGATTCAG
TCGGATATCA ACCGGCTGGC ACAGTTGTTG TAA
 
Protein sequence
MSVIAQAGAK GRQLHKFGGS SLADVKCYLR VAGIMAEFSQ PDDMMVVSAA GSTTNQLINW 
LKLSQTDRLS AHQVQQTLRR YQCDLISGLL PAEEADSLIS AFVSDLERLA ALLDSGINDA
VYAEVVGHGE VWSARLMSAV LNQQGLPAAW LDAREFLRAE RAAQPQVDEG LSYPLLQQLL
VQHPGKRLVV TGFISRNNAG ETVLLGRNGS DYSATQIGAL AGVSRVTIWS DVAGVYSADP
RKVKDACLLP LLRLDEASEL ARLAAPVLHA RTLQPVSGSE IDLQLRCSYT PDQGSTRIER
VLASGTGARI VTSHDDVCLI EFQVPASQDF KLAHKEIDQI LKRAQVRPLA VGVHNDRQLL
QFCYTSEVAD SALKILDEAG LPGELRLRQG LALVAMVGAG VTRNPLHCHR FWQQLKGQPV
EFTWQSDDGI SLVAVLRTGP TESLIQGLHQ SVFRAEKRIG LVLFGKGNIG SRWLELFARE
QSTLSARTGF EFVLAGVVDS RRSLLSYDGL DASRALAFFN DEAVEQDEES LFLWMRAHPY
DDLVVLDVTA SQQLADQYLD FASHGFHVIS ANKLAGASDS NKYRQIHDAF EKTGRHWLYN
ATVGAGLPIN HTVRDLIDSG DTILSISGIF SGTLSWLFLQ FDGSVPFTEL VDQAWQQGLT
EPDPRDDLSG KDVMRKLVIL AREAGYNIEP DQVRVESLVP AHCEGGSIDH FFENGDELNE
QMVQRLEAAR EMGLVLRYVA RFDANGKARV GVEAVREDHP LASLLPCDNV FAIESRWYRD
NPLVIRGPGA GRDVTAGAIQ SDINRLAQLL