Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1335 |
Symbol | |
ID | 3831045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1381337 |
End bp | 1382365 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829271 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_430191 |
Protein GI | 83590182 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0331189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.222136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATAG TCATGCAACC CGGAGCCACC CGCGAAGAAG AGCAGAGAGT TATCAGTCAC CTGGAACGGG AGGGTTTCAA GGTGCACCTG TCACGGGGTA TAGAACGGAC TATAATCGGC GTAATAGGCG ATAAAACCCG GTTAAAGGCC GAAACTGTAA CGGCCCTGGC CGGAGTGGAG AAGGTTGTCC CCATTTTGCA ACCCTATAAG CTGGCCAGCC GGGATTTCCA TCCCGAGGAT ACGGTGGTAG AAACAGGCAA TCGGACCATA GGCGGCGCGG CGGTGCAGAT AATCGCCGGG CCATGTGCCG TGGAAAGCCG TGACCAGCTC CTGGAGGCTG CCGACGCTGT CCGGGAGGCG GGGGCCACCA TGCTCCGCGG CGGCGCCTAT AAACCCCGCA GCTCACCTTA CTCCTTCCAG GGGCTGGCTG CCAAGGGTCT GGAAATCCTG GCTGAGGCCC GGGAACGTAC CGGCCTGCCG GTGGTCACCG AGGTTATGGA CCAGAACCTG GTAGAAGATG TAGCCGCCGT TGCCGACGTC CTGCAGATCG GCTCCCGCAA TATGCAGAAC TTTGCCCTCC TGCAGGCCGT AGGCCAGACA AACAAACCCG TCCTCCTAAA GCGCGGTCTG GCAGCCACCA TTGAAGAATG GCTTCTGGCT GCCGAGTACA TCCTCAACGC AGGCAATAGC CGGGTGATTT TATGCGAGAG AGGTATCCGC ACCTTTGAGA CCTATACCCG CAATACCCTG GACCTCAGCG CGGTACCGGC GGTAAAGCAT CTCTCCCACC TGCCTGTAAT TGTCGATCCC AGCCACGGGA CCGGGCGCCG GTTTATGGTG GCCCCTATGG CCCGGGCCGC CCTGGCAGCC GGGGCCGATG GGATCATGGT TGAGGTACAC CCCCGACCCC AGGAGGCCCT CTCCGACGGC TCCCAGTCCC TGGACCCGGA ACAGTTCGCC GCCCTGGTCA GGGAGATCCG GCCCATCATC ACCGCCAGCG GGCGTGAACT GGAGCGGCAG GCCGTATGA
|
Protein sequence | MIIVMQPGAT REEEQRVISH LEREGFKVHL SRGIERTIIG VIGDKTRLKA ETVTALAGVE KVVPILQPYK LASRDFHPED TVVETGNRTI GGAAVQIIAG PCAVESRDQL LEAADAVREA GATMLRGGAY KPRSSPYSFQ GLAAKGLEIL AEARERTGLP VVTEVMDQNL VEDVAAVADV LQIGSRNMQN FALLQAVGQT NKPVLLKRGL AATIEEWLLA AEYILNAGNS RVILCERGIR TFETYTRNTL DLSAVPAVKH LSHLPVIVDP SHGTGRRFMV APMARAALAA GADGIMVEVH PRPQEALSDG SQSLDPEQFA ALVREIRPII TASGRELERQ AV
|
| |