Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0001 |
Symbol | thrA |
ID | 6142593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 336 |
End bp | 2798 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641614902 |
Product | bifunctional aspartokinase I/homeserine dehydrogenase I |
Protein accession | YP_001742118 |
Protein GI | 170684018 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00657] aspartate kinase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.880494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT TTACCCAATA TCAGCGATGC CGAACGTATT TTTGCCGAAC TTTTGACGGG ACTCGCCGCC GCCCAGCCGG GGTTCCCGCT GGCGCAATTG AAAACTTTCG TCGATCAGGA ATTTGCCCAA ATAAAACATG TCCTGCATGG CATTAGTTTG TTGGGGCAGT GCCCGGATAG CATCAACGCT GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT ATTAGAAGCG CGCGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TGCTGGCAGT GGGGCATTAC CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGTATTCCG GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG GTGCTTGGAC GCAACGGTTC CGACTACTCT GCTGCGGTGC TGGCTGCCTG TCTACGCGCC GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTCTATA CCTGCGACCC GCGTCAGGTG CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC GGCGCTAAAG TTCTTCACCC CCGCACAATT ACCCCTATCG CCCAGTTCCA GATCCCTTGC CTGATTAAAA ATACCGGAAA TCCTCAAGCA CCAGGTACGC TCATTGGTGC CAGCCGTGAT GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ACATGGCAAT GTTCAGCGTT TCCGGCCCGG GGATGAAAGG GATGGTTGGC ATGGCGGCGC GTGTCTTTGC AGCGATGTCA CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCGTCTT CTGAATACAG TATCAGTTTC TGCGTTCCAC AAAGCGACTG TGTGCGAGCT GAACGGGCGA TGCAGGAAGA GTTCTATCTT GAACTGAAGG AAGGCTTGCT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT GTCGTGGTCA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC AACACCGATC AGGTTATCGA AGTGTTTGTG ATTGGTGTCG GTGGCGTTGG CGGTGCGCTG CTGGAGCAAC TGAAGCGTCA GCAAAGCTGG TTGAAGAATA AACATATCGA CTTACGTGTC TGCGGTGTTG CCAACTCGAA GGCACTGCTC ACCAATGTAC ATGGCCTTAA TCTGGAAAAC TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC GTGAAAGAAT ATCATCTGCT GAACCCGGTC ATTGTTGACT GCACTTCCAG CCAGGCAGTG GCGGATCAAT ATGCCGACTT CCTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG GCCAACACCT CGTCGATGGA TTACTACCAT CAGTTGCGTT ATGCGGCGGA AAAATCGCGG CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA AATCTGCTCA ATGCAGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC TGGTTCGCTT TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC CACGCTGGCG CGGGAAATGG GTTATACCGA ACCGGACCCG CGAGATGATC TTTCTGGTAT GGATGTGGCG CGTAAGCTAT TGATTCTCGC CCGTGAAACG GGACGTGAAC TGGAACTGGC GGATATTGAA ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCAGAGGGTG ATGTTGCCGC TTTTATGGCG AATCTGTCAC AGCTCGACGA TCTCTTTGCC GCACGCGTGG CGAAGGCTCG TGATGAGGGC AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGCA TCTGCCGCGT GAAGATTGCC GAAGTGGATG GCAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTC TATAGCCACT ATTATCAGCC GCTGCCGTTG GTTCTGCGCG GATATGGCGC GGGCAATGAC GTTACAGCTG CTGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC TGA
|
Protein sequence | MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH QLRYAAEKSR RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA NLSQLDDLFA ARVAKARDEG KVLRYVGNID EDGICRVKIA EVDGNDPLFK VKNGENALAF YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV
|
| |