Gene Moth_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0808 
Symbol 
ID3832139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp838139 
End bp839275 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content64% 
IMG OID637828739 
Productgeranylgeranyl reductase 
Protein accessionYP_429669 
Protein GI83589660 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000424296 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCAATACG ATGTTTTAAT CTGCGGCGCC GGCCCGGCCG GGAGCACCTG CGGTCGCCTG 
CTGGCTCGCC AGGGTCTTAA GGTGGCCATA TTTGACCGGG CGCGCTTTCC CCGTTATAAA
CCCTGTGGCG GCGGTTTGAC CGGTAAAGCC CAGGGTGAGC TGGAAGCGGG TTGGGAAGAC
TTAATAGAAG ATACTACCCG TGAAGTTATT TTTTACCATC GCCAGGAACG TCCCCTAAAG
ATAACCTGCG AGCAGCCGGT AATAAAAATG GTCAGCAGGG AAAAGTTCGA TTCCTGGCTT
CTCACAGAGG CAGCCAGGGC CGGAGCCGAG GTCAGGGACG GCTACCGGGT GACCGGAGTG
ACGGAAACAG CCGGGGGGGT GAAGGTTCAA GGGCAAGACG GCTGCACCTG GGAAGGACGC
TTCCTGGTCG GGGCCGACGG CGCCCTGAGC CTGGTGCGGC GCAGCCTCCC CTTTAAACCC
GGGGGAACGG CCGGAATAAC CCTGGAGTGC GAAGTGCCGG TTGACGCCGG CCTCCTTACG
AGTTATCGGG GCCAGGTCCA CCTGAGCTAT GGAGGTATTC CTTACGGCTA CGGCTGGGTC
TTTCCCAAGG GGGACCACCT CTCGGTGGGA ATAGGCTCCT TTACCCGCCG GGTCAAAGGC
CTGAGGCGCT ACTTCGATAC CTTTTGTCGC GGGCTGGGGT TGGCGGTGCC GGCGAACTTA
CGCTGTCGCG GCGCGGTTAT CCCGGCGGCC GACGGCCAGG CGGGCGTCTT TCACACCGGC
CGGGCCCTCC TGGTGGGGGA TGCCGCAGGC CTGGTGGATC CCTTCTCCGG GGAGGGAATT
TACTATGCCC TCCGGAGCGG CCGCCTGGCG GCGGAAACCA TCATGGCAAC CCTGGCAGGT
ACCGGGGAGC CGGGGGCTTA TTCCCGCCGG CTCTACGATG AATTATTACA GCCCCTCCAC
TACGCCCGGC GCATCGCCAG GGTGGTTTAT GCCCTGACCC CGGTGGTCCA TCGCCTGGTG
ACGGCCAACC CCGGGATAGC CAGGCGCCTG GTGGAGGTCC TCTTCGGCCG GGATACCTAC
CCCGACCTCT GGCAGTACCT GACCCGGCGC TACGCCATCT TTCGCCTGGC CCGCTAA
 
Protein sequence
MQYDVLICGA GPAGSTCGRL LARQGLKVAI FDRARFPRYK PCGGGLTGKA QGELEAGWED 
LIEDTTREVI FYHRQERPLK ITCEQPVIKM VSREKFDSWL LTEAARAGAE VRDGYRVTGV
TETAGGVKVQ GQDGCTWEGR FLVGADGALS LVRRSLPFKP GGTAGITLEC EVPVDAGLLT
SYRGQVHLSY GGIPYGYGWV FPKGDHLSVG IGSFTRRVKG LRRYFDTFCR GLGLAVPANL
RCRGAVIPAA DGQAGVFHTG RALLVGDAAG LVDPFSGEGI YYALRSGRLA AETIMATLAG
TGEPGAYSRR LYDELLQPLH YARRIARVVY ALTPVVHRLV TANPGIARRL VEVLFGRDTY
PDLWQYLTRR YAIFRLAR