Gene EcHS_A4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4174 
SymbolmetL 
ID5594263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4158468 
End bp4160900 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content58% 
IMG OID640923276 
Productbifunctional aspartate kinase II/homoserine dehydrogenase II 
Protein accessionYP_001460735 
Protein GI157163417 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTGA TTGCGCAGGC AGGGGCGAAA GGTCGTCAGC TGCATAAATT TGGTGGCAGT 
AGTCTGGCTG ATGTGAAGTG TTATTTGCGT GTCGCGGGCA TTATGGCGGA GTACTCTCAG
CCTGACGATA TGATGGTGGT TTCCGCCGCC GGTAGCACCA CTAACCAGTT GATTAACTGG
TTGAAACTAA GCCAGACCGA TCGTCTCTCT GCGCATCAGG TTCAACAAAC GCTGCGTCGC
TATCAGTGCG ATCTGATTAG CGGTCTGCTA CCCGCTGAAG AAGCCGATAG CCTCATTAGC
GCTTTTGTCA GCGATCTGGA ACGCCTGGCG GCGTTGCTCG ACAGTGGTAT TAACGATGCG
GTGTATGCGG AAGTGGTGGG CCACGGTGAA GTGTGGTCGG CGCGCCTGAT GTCTGCGGTA
CTTAATCAAC AAGGGCTGCC AGCGGCCTGG CTTGATGCCC GCGAGTTTTT ACGCGCTGAA
CGCGCCGCAC AACCGCAGGT TGATGAAGGG CTTTCTTACC CGTTGCTGCA ACAGCTGCTG
GTGCAACATC CGGGCAAACG TCTGGTGGTG ACCGGATTTA TCAGCCGCAA CAACGCCGGT
GAAACGGTGC TGCTGGGGCG TAACGGTTCC GACTATTCCG CGACACAAAT CGGTGCGCTG
GCGGGTGTTT CTCGCGTAAC CATCTGGAGC GACGTCGCCG GGGTATACAG TGCCGACCCG
CGTAAAGTGA AAGATGCCTG CCTGCTGCCG TTGCTGCGTC TGGATGAGGC CAGCGAACTG
GCGCGCCTGG CGGCTCCCGT TCTTCACGCC CGTACTTTAC AGCCGGTTTC TGGCAGCGAA
ATCGACCTGC AACTGCGCTG TAGCTACACG CCGGATCAAG GTTCCACGCG CATTGAACGC
GTGCTGGCCT CCGGTACTGG TGCGCGTATT GTCACCAGCC ACGATGATGT CTGTTTGATT
GAGTTTCAGG TGCCCGCCAG TCAGGATTTC AAACTGGCGC ATAAAGAGAT CGACCAAATC
CTGAAACGCG CGCAGGTACG CCCGCTGGCG GTTGGCGTAC ATAACGATCG CCAGTTGCTG
CAATTTTGCT ACACCTCAGA AGTGGCCGAC AGTGCGCTGA AAATCCTCGA CGAAGCGGGA
TTACCTGGCG AACTGCGCCT GCGTCAGGGG CTGGCGCTGG TGGCGATGGT CGGTGCAGGC
GTCACCCGTA ACCCGCTGCA TTGCCACCGC TTCTGGCAGC AACTGAAAGG CCAGCCGGTC
GAATTTACCT GGCAGTCCGA TGACGGCATC AGCCTGGTGG CAGTACTGCG CACCGGCCCG
ACCGAAAGCC TGATTCAGGG GCTGCATCAG TCCGTCTTCC GCGCAGAAAA ACGCATCGGC
CTGGTATTGT TCGGTAAGGG CAATATCGGT TCCCGTTGGC TGGAACTGTT CGCCCGTGAG
CAGAGCACGC TTTCGGCACG TACCGGCTTT GAGTTTGTGC TGGCAGGTGT GGTGGACAGC
CGCCGCAGCC TGTTGAGCTA TGACGGGCTG GACGCCAGCC GCGCGTTAGC CTTCTTCAAC
GATGAAGCGG TTGAGCAGGA TGAAGAGTCG TTGTTCCTGT GGATGCGCGC CCATCCGTAT
GATGATTTAG TGGTGCTGGA CGTTACCGCC AGCCAGCAGC TTGCTGATCA GTATCTTGAT
TTCGCCAGCC ACGGTTTCCA CGTTATCAGC GCCAACAAAC TGGCGGGAGC CAGCGACAGC
AATAAATATC GCCAGATCCA CGACGCCTTC GAAAAAACCG GGCGTCACTG GCTGTACAAT
GCCACCGTCG GTGCGGGCTT GCCGATCAAC CACACCGTGC GCGATCTGAT CGACAGCGGC
GATACTATTT TGTCGATCAG CGGGATCTTC TCCGGCACGC TCTCCTGGCT GTTCCTGCAA
TTCGACGGTA GCGTGCCGTT TACCGAGCTG GTGGATCAGG CGTGGCAGCA GGGCTTAACC
GAACCTGACC CGCGTGACGA TCTCTCTGGC AAAGACGTGA TGCGCAAGCT GGTGATTCTG
GCGCGTGAAG CAGGTTACAA CATCGAACCG GATCAGGTAC GTGTGGAATC GCTGGTGCCT
GCTCATTGCG AAGGCGGCAG CATCGACCAT TTCTTTGAAA ATGGCGATGA ACTGAACGAG
CAGATGGTGC AACGGCTGGA AGCGGCCCGC GAAATGGGGC TGGTGCTGCG CTACGTGGCG
CGTTTCGATG CCAACGGTAA AGCGCGTGTA GGCGTGGAAG CGGTGCGTGA AGATCATCCG
TTGGCATCAC TGCTGCCGTG CGATAACGTC TTTGCCATCG AAAGCCGCTG GTATCGCGAT
AACCCTCTGG TGATCCGCGG ACCTGGCGCT GGGCGCGACG TCACCGCCGG GGCGATTCAG
TCGGATATCA ACCGGCTGGC ACAGTTGTTG TAA
 
Protein sequence
MSVIAQAGAK GRQLHKFGGS SLADVKCYLR VAGIMAEYSQ PDDMMVVSAA GSTTNQLINW 
LKLSQTDRLS AHQVQQTLRR YQCDLISGLL PAEEADSLIS AFVSDLERLA ALLDSGINDA
VYAEVVGHGE VWSARLMSAV LNQQGLPAAW LDAREFLRAE RAAQPQVDEG LSYPLLQQLL
VQHPGKRLVV TGFISRNNAG ETVLLGRNGS DYSATQIGAL AGVSRVTIWS DVAGVYSADP
RKVKDACLLP LLRLDEASEL ARLAAPVLHA RTLQPVSGSE IDLQLRCSYT PDQGSTRIER
VLASGTGARI VTSHDDVCLI EFQVPASQDF KLAHKEIDQI LKRAQVRPLA VGVHNDRQLL
QFCYTSEVAD SALKILDEAG LPGELRLRQG LALVAMVGAG VTRNPLHCHR FWQQLKGQPV
EFTWQSDDGI SLVAVLRTGP TESLIQGLHQ SVFRAEKRIG LVLFGKGNIG SRWLELFARE
QSTLSARTGF EFVLAGVVDS RRSLLSYDGL DASRALAFFN DEAVEQDEES LFLWMRAHPY
DDLVVLDVTA SQQLADQYLD FASHGFHVIS ANKLAGASDS NKYRQIHDAF EKTGRHWLYN
ATVGAGLPIN HTVRDLIDSG DTILSISGIF SGTLSWLFLQ FDGSVPFTEL VDQAWQQGLT
EPDPRDDLSG KDVMRKLVIL AREAGYNIEP DQVRVESLVP AHCEGGSIDH FFENGDELNE
QMVQRLEAAR EMGLVLRYVA RFDANGKARV GVEAVREDHP LASLLPCDNV FAIESRWYRD
NPLVIRGPGA GRDVTAGAIQ SDINRLAQLL