Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1043 |
Symbol | |
ID | 3831849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1070579 |
End bp | 1071658 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828971 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_429900 |
Protein GI | 83589891 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000130354 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCCTGGCA GGCGCCGTCC CACCCGGCGA ATCCAGGTGG GTAAGGTTGC TATTGGGGGC GGGGCTCCTA TCTCCGTCCA GTCTATGACC AATACCGATA CCCGGGATAT TACCGCTACT GTCGCCCAGA TCAGGAGGCT GGCCGCCGCC GGCTGTGAAA TCGTCCGCCT GGCCGTACCG GATCAAGAAG CGGCCCTGGC CCTGGCGAAA ATAAAGGCCC AGGTAGAGAT ACCTCTTATC GCCGATATCC ACTTCGACTA CCGCCTGGCC CTGGCGGCCC TGGAGGCCGG GGTTGACGGC TTGCGTTTAA ATCCGGGCAA CATTGGCGGG CCTGAGCGGG TAAAGGCGGT AGTCAAAGAG GCTGCTGCCC GCCGGGTGCC CATCCGCATC GGCGTTAACG CCGGTTCCCT GGAGAAAGAA GTCCTGGCGG CCCATGGCGG GGTGACGGCG GAAGCCATGG TTGCCAGTGC CCTAAAACAC ATCCGCCTCC TGGAGGATCT GGATTTCCGG GAGATTAAAG TTTCCCTTAA AGCCTCCGAG GTGCCTTTAA TGCTGGCAGC CTACCGCCTC ATGGCGGAAA AGGTAGATTA CCCTCTGCAC CTGGGGGTTA CCGAAGCCGG CCGGGGGCTG GAAGGAGCGG TAAAATCGGC CGTAGGCATC GGCATTTTAC TCGCAGAGGG GATTGGCGAC ACCATCAGGG TCTCCCTCAC CGGCGACCCG GTCCAGGAGG TTATTGCCGG CTTTGCCATT CTGCGCGCCT TGGGCCTGCG CCAGCAGGGC ATTGAGTTGA TCTCCTGTCC CACCTGCGGC CGCTGCCAGC TGGACCTGGA CGCGGTGGCG GCCAGGGTTC AGGAGGAACT GCGGGGCATT AAACAGCCCC TGAAGGTGGC TATCATGGGC TGCGCCGTCA ACGGCCCCGG GGAGGCCCGC CAGGCTGACG TCGGTATTGC CGGCGGTCCG GGCTTCGGCC TCCTTTTTCG CCACGGTCGC CCGGTACGCA AGGTGAAAGA AGAAGATCTG GCCCGGGCCC TGGTGGAGGA AGTGAAACGC CTGGCGGCAG AGAGGCGGGA ACAGGGATAA
|
Protein sequence | MPGRRRPTRR IQVGKVAIGG GAPISVQSMT NTDTRDITAT VAQIRRLAAA GCEIVRLAVP DQEAALALAK IKAQVEIPLI ADIHFDYRLA LAALEAGVDG LRLNPGNIGG PERVKAVVKE AAARRVPIRI GVNAGSLEKE VLAAHGGVTA EAMVASALKH IRLLEDLDFR EIKVSLKASE VPLMLAAYRL MAEKVDYPLH LGVTEAGRGL EGAVKSAVGI GILLAEGIGD TIRVSLTGDP VQEVIAGFAI LRALGLRQQG IELISCPTCG RCQLDLDAVA ARVQEELRGI KQPLKVAIMG CAVNGPGEAR QADVGIAGGP GFGLLFRHGR PVRKVKEEDL ARALVEEVKR LAAERREQG
|
| |