Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0003 |
Symbol | thrA |
ID | 6967964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 360 |
End bp | 2816 |
Gene Length | 2457 bp |
Protein Length | 818 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384087 |
Product | bifunctional aspartokinase I/homeserine dehydrogenase I |
Protein accession | YP_002268610 |
Protein GI | 209398223 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00657] aspartate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.726282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGAAGT TCGGCGGTAC ATCAGTGGCA AATGCAGAAC GTTTTCTGCG GGTTGCCGAT ATTCTGGAAA GCAATGCCAG GCAGGGGCAG GTGGCCACCG TCCTCTCTGC CCCCGCCAAA ATCACCAACC ACCTGGTGGC GATGATTGAA AAAACCATTA GCGGCCAGGA TGCTTTACCC AATATCAGCG ATGCCGAACG TATTTTTGCC GAACTTCTGA CGGGACTCGC CGCCGCCCAG CCGGGATTCC CGCTGGCGCA ATTGAAAACT TTCGTCGACC AGGAATTTGC CCAAATAAAA CATGTCCTGC ATGGCATTAG TTTGTTAGGG CAGTGCCCGG ATAGCATTAA CGCTGCGCTG ATTTGCCGTG GCGAGAAAAT GTCGATCGCC ATTATGGCCG GCGTATTAGA AGCGCGCGGT CACAACGTTA CCGTTATCGA TCCGGTCGAA AAACTGCTGG CAGTGGGGCA TTACCTCGAA TCTACTGTCG ATATTGCAGA GTCCACCCGC CGTATTGCGG CAAGTCGTAT TCCGGCTGAT CACATGGTGC TGATGGCAGG TTTCACCGCC GGTAATGAAA AAGGCGAACT GGTGGTACTT GGACGCAACG GTTCCGACTA CTCCGCGGCG GTGCTGGCTG CCTGTTTACG CGCCGATTGT TGCGAGATTT GGACGGACGT TGACGGGGTA TATACCTGCG ACCCGCGTCA GGTGCCCGAT GCGAGGTTGT TGAAATCGAT GTCCTACCAG GAAGCGATGG AGCTTTCCTA CTTCGGCGCT AAAGTTCTTC ACCCCCGCAC CATTACCCCC ATCGCCCAGT TCCAGATCCC TTGCCTGATT AAAAATACCG GAAATCCTCA AGCTCCAGGT ACGCTCATTG GTGCCAGTCG TGATGAAGAC GAATTACCGG TCAAGGGCAT TTCCAATCTG AATAATATGG CAATGTTCAG CGTTTCCGGC CCGGGGATGA AAGGAATGGT CGGCATGGCG GCGCGCGTCT TTGCTGCAAT GTCACGCGCC CGTATTTCCG TGGTGCTGAT TACGCAATCA TCTTCCGAAT ACAGTATCAG TTTCTGCGTT CCGCAAAGCG ACTGTGTGCG AGCTGAACGG GCAATGCAGG AAGAGTTCTA CCTGGAACTG AAAGAAGGCT TACTGGAGCC GCTGGCGGTG ACGGAACGGC TGGCCATTAT CTCGGTGGTA GGTGATGGTA TGCGCACCTT GCGTGGGATC TCGGCGAAAT TCTTTGCCGC GCTGGCCCGC GCCAATATCA ACATTGTCGC TATTGCTCAG GGATCTTCTG AACGCTCAAT CTCTGTCGTG GTAAATAACG ATGATGCGAC CACTGGCGTG CGCGTTACTC ATCAGATGCT GTTCAATACC GATCAGGTTA TCGAAGTGTT TGTGATTGGC GTCGGTGGCG TTGGCGGTGC GCTGCTGGAG CAACTGAAGC GTCAGCAAAG CTGGTTGAAG AATAAACATA TCGACTTACG TGTCTGCGGT GTTGCTAACT CGAAGGCTCT GCTCACCAAT GTGCATGGCC TAAATCTGGA AAACTGGCAG GAAGAACTGG CGCAAGCCAA AGAGCCGTTT AATCTCGGGC GCTTAATTCG CCTCGTGAAA GAATATCATC TGCTGAACCC GGTCATTGTT GACTGCACCT CCAGCCAGGC AGTGGCGGAT CAATATGCCG ACTTCCTGCG CGAAGGTTTC CACGTTGTCA CGCCGAACAA AAAGGCCAAC ACCTCGTCGA TGGATTACTA CCATCTGTTG CGTCATGCGG CTGAAAAATC GCGGCGTAAA TTCCTCTATG ACACCAACGT TGGGGCTGGA TTACCGGTTA TTGAGAACCT GCAAAATCTG CTCAATGCTG GTGATGAATT GATGAAGTTC TCCGGCATTC TTTCAGGTTC GCTTTCTTAT ATCTTCGGCA AGTTAGACGA AGGCATGAGT TTCTCCGAGG CGACTACGCT GGCGCGGGAA ATGGGTTATA CCGAACCGGA TCCGCGAGAT GATCTTTCTG GTATGGATGT AGCGCGTAAA CTATTAATTC TCGCTCGTGA AACGGGACGT GAACTGGAGC TGGCGGATAT TGAAATTGAA CCTGTGCTGC CCGCAGAGTT TAACGCTGAG GGTGATGTTG CCGCTTTTAT GGCGAATCTG TCACAGCTCG ACGATCTCTT TGCCGCGCGC GTGGCGAAGG CCCGTGATGA AGGAAAAGTT TTGCGCTATG TTGGCAATAT TGATGAAGAT GGCGTCTGCC GCGTGAAGAT TGCCGAAGTG GATGGTAATG ATCCGCTGTT CAAAGTGAAA AATGGCGAAA ACGCCCTGGC CTTTTATAGC CACTATTATC AGCCGCTGCC GTTGGTGCTG CGCGGATATG GTGCGGGCAA TGACGTTACC GCTGCCGGTG TCTTTGCCGA TCTGCTACGT ACCCTCTCAT GGAAGTTAGG AGTCTGA
|
Protein sequence | MLKFGGTSVA NAERFLRVAD ILESNARQGQ VATVLSAPAK ITNHLVAMIE KTISGQDALP NISDAERIFA ELLTGLAAAQ PGFPLAQLKT FVDQEFAQIK HVLHGISLLG QCPDSINAAL ICRGEKMSIA IMAGVLEARG HNVTVIDPVE KLLAVGHYLE STVDIAESTR RIAASRIPAD HMVLMAGFTA GNEKGELVVL GRNGSDYSAA VLAACLRADC CEIWTDVDGV YTCDPRQVPD ARLLKSMSYQ EAMELSYFGA KVLHPRTITP IAQFQIPCLI KNTGNPQAPG TLIGASRDED ELPVKGISNL NNMAMFSVSG PGMKGMVGMA ARVFAAMSRA RISVVLITQS SSEYSISFCV PQSDCVRAER AMQEEFYLEL KEGLLEPLAV TERLAIISVV GDGMRTLRGI SAKFFAALAR ANINIVAIAQ GSSERSISVV VNNDDATTGV RVTHQMLFNT DQVIEVFVIG VGGVGGALLE QLKRQQSWLK NKHIDLRVCG VANSKALLTN VHGLNLENWQ EELAQAKEPF NLGRLIRLVK EYHLLNPVIV DCTSSQAVAD QYADFLREGF HVVTPNKKAN TSSMDYYHLL RHAAEKSRRK FLYDTNVGAG LPVIENLQNL LNAGDELMKF SGILSGSLSY IFGKLDEGMS FSEATTLARE MGYTEPDPRD DLSGMDVARK LLILARETGR ELELADIEIE PVLPAEFNAE GDVAAFMANL SQLDDLFAAR VAKARDEGKV LRYVGNIDED GVCRVKIAEV DGNDPLFKVK NGENALAFYS HYYQPLPLVL RGYGAGNDVT AAGVFADLLR TLSWKLGV
|
| |