Gene Moth_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0095 
Symbol 
ID3832666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp92267 
End bp93703 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content64% 
IMG OID637828027 
Product3-octaprenyl-4hydroxybenzoate decarboxylase 
Protein accessionYP_428977 
Protein GI83588968 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.218951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCACC AGGATTTGCA GGCCTACCTG GCCTATCTAG AAGCCCACAA ACTGTTGCAT 
CGCGTTAAGG TAGAGGTCGA CCCCATCTTT GAGATTGCGG CCATCAGCGA CCGGGTGGTC
AAACGGGGCG GCCCGGCCCT GCTCTTCGAG CGGGTAAAGG GTTCGACCCT GCCCGTGGCC
ACCAACCTCT TCGGCAGTAT AGACTTGGTA AAGGCGGCCC TGGAAGTGAC CGACCTGGAG
GAACCGGCCC GGCGCCTCCG GGCCCTCCTG GAACTGCCGG CCGATTCGGG TGGCTGGCTG
GATAAGCTGC GCTTTCTGCC CCGCCTGGCA GAACTGGGCC GTTACCTCCC CCGCCGGGTA
AAGGAGGCTC CCTGCCAGGA GGTCAGGGTA GAACCGCCAT CTTTGGAGGA ACTGCCGGTA
CTGCAACTCT GGCCGGGAGA CGGCGGCCGT TTCCTTACCC TGCCCCTGGT CTTTACCCAT
GACCCCCTGA CCGGCCGCCG GAATGTAGGC ATGTACCGAA TGCAGGTGTT TGACGCGGTC
ACCACCGGCA TGCACTGGCA TATCCACAAG GACGGGGCCG AGCACCTGCG CCGCAGCGGG
GACCGCCTGG AAGTAGCTGT CGCCCTGGGA GCTGACCCGG CGGTGATCTA CGCCGCCACT
GCCCCCCTGC CTCCGGGCCT GGACGAGATG CTCCTGGCCG GGTTTTTACG CCGGGAACCG
GTGGAGATGG TACCGGCCCT GACGGTGAAT ATCGATGTCC CGGCCCGGGC GGAGATTATC
CTCGAGGGTT ATGTCGACCC TGGGGAAACC CGCCTAGAGG GCCCCTTCGG CGACCATACG
GGTTATTACT CCCCGGCTGA CAATTATCCC GTTTTTCACC TGACCTGCCT GACCCGGCGC
CGCCGGGCGG TTTACCCGGC TACGGTGGTG GGACCGCCGC CCATGGAGGA CGCCTACCTG
GGGAAAGTAA CGGAACGCCT CTTCCTGCCT TTGATCCAGC TCCAGCTCCC GGAGGTGGTG
GACATCAACT TCCCCCCCGC AGGGGTTTTC CATAACTGCG TCATTGTCGC CATCCGTAAA
GCTTACCCCG GCCAGGCGCG CAAGGTCATG CATGCCCTCT GGGGGATGGG GCAGATGATG
TTTACCAAGC TCATCATCGT AGTCGATGCC GATGTCAACG TCCATGATCT TCAAGAGGTC
GCCTGGCGCG TCCTGGGTAA TATCGACCCC CGCCGGGATG CCGTTATAGT CGACGGGCCG
GTGGATATCC TGGATCACGC CGCTCCCCGC AGGGGTTTCG GTAGCAAGAT GGGACTGGAT
GCCACCCGGA AACTGCCGGA AGAAGGAGCC TCGCGTCCCT GGCCGGAGGA GGCCCGGGCT
GCCCGGGAGG TCCTGGAACT CATCGACAGG AGGTGGCAGG AGTATGGTCT GGCGTAA
 
Protein sequence
MAHQDLQAYL AYLEAHKLLH RVKVEVDPIF EIAAISDRVV KRGGPALLFE RVKGSTLPVA 
TNLFGSIDLV KAALEVTDLE EPARRLRALL ELPADSGGWL DKLRFLPRLA ELGRYLPRRV
KEAPCQEVRV EPPSLEELPV LQLWPGDGGR FLTLPLVFTH DPLTGRRNVG MYRMQVFDAV
TTGMHWHIHK DGAEHLRRSG DRLEVAVALG ADPAVIYAAT APLPPGLDEM LLAGFLRREP
VEMVPALTVN IDVPARAEII LEGYVDPGET RLEGPFGDHT GYYSPADNYP VFHLTCLTRR
RRAVYPATVV GPPPMEDAYL GKVTERLFLP LIQLQLPEVV DINFPPAGVF HNCVIVAIRK
AYPGQARKVM HALWGMGQMM FTKLIIVVDA DVNVHDLQEV AWRVLGNIDP RRDAVIVDGP
VDILDHAAPR RGFGSKMGLD ATRKLPEEGA SRPWPEEARA AREVLELIDR RWQEYGLA