Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0009 |
Symbol | |
ID | 3831881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 9450 |
End bp | 10340 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637827936 |
Product | pyridoxal biosynthesis lyase PdxS |
Protein accession | YP_428892 |
Protein GI | 83588883 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0214] Pyridoxine biosynthesis enzyme |
TIGRFAM ID | [TIGR00343] pyridoxal 5'-phosphate synthase, synthase subunit Pdx1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000000232044 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000788382 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCAGCAG CAGAAGTAGG AACCTGGACG GTCAAAAAGG GCCTGGCCGA GATGCTCAAG GGCGGCGTCA TTATGGATGT GACCACCCCG GAACAGGCTA AAATCGCCGA GGAGGCGGGG GCCTGCGCGG TTATGGCCCT GGAACGGGTG CCGGCCGACA TCCGGGCCGC CGGCGGGGTG GCCCGGATGG CCGATCCCAC GGTTATCTTA AGAATTATGG ACGCCGTAAC CATTCCGGTC ATGGCCAAGG CCAGGATCGG CCACTTTGTC GAGGCCCAGA TCCTGGAGGC CCTGGGTGTC GATTATATTG ATGAGAGCGA AGTTCTTACC CCGGCTGACG AGGACTTCCA TATCAATAAG CACGAGTTCA AGGTTCCCTT TGTCTGTGGC GCCCGCAATC TCGGTGAGGC TCTAAGACGC ATCGGCGAGG GAGCAGCCAT GATCCGCACC AAGGGTGAAC CCGGTACCGG CAACGTGGTC GAAGCTGTGC GCCACATGCG CCGGGTAATG AGCGAGATCC GGCGCCTGCA GAATCTACCC GACGAGGAAC TGATGACCTT TGCCAAAGAA ATCCAGGCAC CCTATGAATT AGTGAAACAG GTAAAGGAAC TGGGACGGTT GCCGGTGGTC AATTTCGCCG CCGGCGGCAT CGCCACCCCG GCCGATGCGG CTCTAATGAT GCAACTGGGG GCCGATGGCA TATTTGTGGG CTCTGGAATC TTTAAATCCA GCGATCCGAG GAAACGGGCC CGGGCCATTG TCGCCGCCAC CACCCACTTC CGTGAACCAG AGGTTTTGGC CGAGGTCTCC CGGGACCTGG GCGAAGCTAT GCCTGGCATA GAGATTGCTA CCATAAAACC TGAAGAACGC ATGCAGGAAC GCGGTTGGTA A
|
Protein sequence | MAAAEVGTWT VKKGLAEMLK GGVIMDVTTP EQAKIAEEAG ACAVMALERV PADIRAAGGV ARMADPTVIL RIMDAVTIPV MAKARIGHFV EAQILEALGV DYIDESEVLT PADEDFHINK HEFKVPFVCG ARNLGEALRR IGEGAAMIRT KGEPGTGNVV EAVRHMRRVM SEIRRLQNLP DEELMTFAKE IQAPYELVKQ VKELGRLPVV NFAAGGIATP ADAALMMQLG ADGIFVGSGI FKSSDPRKRA RAIVAATTHF REPEVLAEVS RDLGEAMPGI EIATIKPEER MQERGW
|
| |