Gene Moth_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1043 
Symbol 
ID3831849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1070579 
End bp1071658 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content63% 
IMG OID637828971 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_429900 
Protein GI83589891 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000130354 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCTGGCA GGCGCCGTCC CACCCGGCGA ATCCAGGTGG GTAAGGTTGC TATTGGGGGC 
GGGGCTCCTA TCTCCGTCCA GTCTATGACC AATACCGATA CCCGGGATAT TACCGCTACT
GTCGCCCAGA TCAGGAGGCT GGCCGCCGCC GGCTGTGAAA TCGTCCGCCT GGCCGTACCG
GATCAAGAAG CGGCCCTGGC CCTGGCGAAA ATAAAGGCCC AGGTAGAGAT ACCTCTTATC
GCCGATATCC ACTTCGACTA CCGCCTGGCC CTGGCGGCCC TGGAGGCCGG GGTTGACGGC
TTGCGTTTAA ATCCGGGCAA CATTGGCGGG CCTGAGCGGG TAAAGGCGGT AGTCAAAGAG
GCTGCTGCCC GCCGGGTGCC CATCCGCATC GGCGTTAACG CCGGTTCCCT GGAGAAAGAA
GTCCTGGCGG CCCATGGCGG GGTGACGGCG GAAGCCATGG TTGCCAGTGC CCTAAAACAC
ATCCGCCTCC TGGAGGATCT GGATTTCCGG GAGATTAAAG TTTCCCTTAA AGCCTCCGAG
GTGCCTTTAA TGCTGGCAGC CTACCGCCTC ATGGCGGAAA AGGTAGATTA CCCTCTGCAC
CTGGGGGTTA CCGAAGCCGG CCGGGGGCTG GAAGGAGCGG TAAAATCGGC CGTAGGCATC
GGCATTTTAC TCGCAGAGGG GATTGGCGAC ACCATCAGGG TCTCCCTCAC CGGCGACCCG
GTCCAGGAGG TTATTGCCGG CTTTGCCATT CTGCGCGCCT TGGGCCTGCG CCAGCAGGGC
ATTGAGTTGA TCTCCTGTCC CACCTGCGGC CGCTGCCAGC TGGACCTGGA CGCGGTGGCG
GCCAGGGTTC AGGAGGAACT GCGGGGCATT AAACAGCCCC TGAAGGTGGC TATCATGGGC
TGCGCCGTCA ACGGCCCCGG GGAGGCCCGC CAGGCTGACG TCGGTATTGC CGGCGGTCCG
GGCTTCGGCC TCCTTTTTCG CCACGGTCGC CCGGTACGCA AGGTGAAAGA AGAAGATCTG
GCCCGGGCCC TGGTGGAGGA AGTGAAACGC CTGGCGGCAG AGAGGCGGGA ACAGGGATAA
 
Protein sequence
MPGRRRPTRR IQVGKVAIGG GAPISVQSMT NTDTRDITAT VAQIRRLAAA GCEIVRLAVP 
DQEAALALAK IKAQVEIPLI ADIHFDYRLA LAALEAGVDG LRLNPGNIGG PERVKAVVKE
AAARRVPIRI GVNAGSLEKE VLAAHGGVTA EAMVASALKH IRLLEDLDFR EIKVSLKASE
VPLMLAAYRL MAEKVDYPLH LGVTEAGRGL EGAVKSAVGI GILLAEGIGD TIRVSLTGDP
VQEVIAGFAI LRALGLRQQG IELISCPTCG RCQLDLDAVA ARVQEELRGI KQPLKVAIMG
CAVNGPGEAR QADVGIAGGP GFGLLFRHGR PVRKVKEEDL ARALVEEVKR LAAERREQG