Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3653 |
Symbol | thrA |
ID | 6065835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3999502 |
End bp | 4001964 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603068 |
Product | bifunctional aspartokinase I/homeserine dehydrogenase I |
Protein accession | YP_001726591 |
Protein GI | 170021637 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00657] aspartate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT TTACCCAATA TCAGCGATGC TGAACGTATT TTTGCCGAAC TTCTGACGGG ACTCGCCGCC GCCCAGCCGG GATTCCCGCT GGCGCAATTG AAAACTTTCG TCGACCAGGA ATTTGCTCAA ATAAAACATG TCCTGCATGG CATTAGTTTG TTAGGGCAGT GCCCGGATAG CATCAACGCT GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT GTTAGAAGCG CGTGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TACTGGCAGT GGGGCATTAC CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGTATTCCG GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG GTGCTTGGAC GTAACGGTTC CGACTACTCC GCTGCGGTGC TGGCTGCCTG TTTACGCGCC GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTTTATA CCTGCGACCC GCGTCAGGTG CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC GGCGCTAAAG TTCTTCACCC CCGCACCATT ACCCCCATCG CCCAGTTCCA GATCCCTTGC CTGATTAAAA ATACCGGAAA TCCTCAAGCA CCAGGTACGC TCATTGGTGC CAGCCGTGAT GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ACATGGCAAT GTTCAGCGTT TCCGGCCCGG GGATGAAAGG AATGGTCGGC ATGGCGGCGC GCGTCTTTGC TGCAATGTCA CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCATCTT CCGAATACAG TATCAGTTTC TGCGTTCCGC AAAGCGACTG TGTGCGAGCT GAACGGGCAA TGCAGGAAGA GTTCTACCTG GAACTGAAAG AAGGCTTACT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT GTCGTGGTAA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC AATACCGATC AGGTTATCGA AGTGTTTGTG ATTGGCGTCG GTGGCGTTGG CGGTGCGCTG CTGGAGCAAC TGAAGCGTCA ACAAAGCTGG CTGAAGAATA AACATATCGA CTTACGTGTC TGCGGCGTTG CCAACTCGAA GGCTCTGCTT ACCAATGTGC ATGGCCTAAA CCTGGAAAAC TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC GTGAAAGAAT ATCATCTGCT AAACCCGGTC ATTGTTGACT GCACCTCCAG CCAGGCAGTG GCGGATCAAT ATGCCGACTT CTTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG GCCAACACCT CGTCGATGGA TTACTACCAT CTGTTGCGTC ATGCGGCGGA AAAATCGCGG CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA AATCTGCTCA ATGCTGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC AGGTTCGCTT TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC TACTCTGGCG CGGGAAATGG GTTATACCGA ACCGGATCCG CGAGATGATC TTTCTGGTAT GGATGTAGCG CGTAAGCTAT TGATTCTCGC TCGTGAAACG GGACGTGAAC TGGAGCTGGC GGATATTGAA ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCCGAGGGTG ATGTTGCCGC TTTTATGGCG AATCTGTCAC AGCTCGACGA GCTCTTTGCC GCGCGCGTGG CGAAGGCCCG TGATGAAGGA AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGCG TCTGCCGCGT GAAGATTGCC GAAGTGGATG GGAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTC TATAGCCACT ATTATCAGCC GCTGCCGTTG GTGCTGCGCG GATATGGTGC GGGCAATGAC GTTACCGCTG CTGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC TGA
|
Protein sequence | MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH LLRHAAEKSR RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA NLSQLDELFA ARVAKARDEG KVLRYVGNID EDGVCRVKIA EVDGNDPLFK VKNGENALAF YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV
|
| |