Gene Moth_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0233 
Symbol 
ID3832561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp232197 
End bp233861 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content61% 
IMG OID637828169 
Product2-octaprenylphenol hydroxylase 
Protein accessionYP_429111 
Protein GI83589102 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID[TIGR01982] 2-polyprenylphenol 6-hydroxylase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC GGCGTCGCTA CATTCATTTG AACCGTTATC GCCAGGTGGT CAATGTCCTG 
GCCCGCTATG GTTTTGGTTA TCTCCTGGAT CAGGTGGGGC TGGGAGAATT AATTCTCCGG
CGGTCACGGG AAGAAGCACC TCTGTCCCTG GGACAGCGCC TCCGCCTGGC CCTGGAGGAA
CTCGGACCTA CCTTTATCAA GCTGGGACAG CTGCTGAGCA CCAGGCCTGA TCTGTTGCCG
GCGGATATCA TCAGCGAACT GACCCGCCTT CAGGACCGGG TGCCTCCCTT TCCCTTCGCT
GACGTCCGAA AAGCGGTGGA GGAGGAACTG GGCCAGCCCC TGGAGGAACT CTTCGCCTCC
TTTGACCCCG AGCCCCTGGC GGTGGCATCC ATCGGCCAGG TCCACCTGGC TACCCTCCCG
GACGGCAGCC AGGTAATCGT CAAGGTCCAG CGACCCGGAA TTGCCAGGCA GGTTCGGGTG
GATCTGGAGA TCCTCTTCGA CCTGGCCCGC CTGGCCCAGC GCCACACTCC CTACGGCAAG
ATATACGATT TCAACCAGAT GGCCGCCGAG TTTGCCCGGG CCCTGACCGA AGAACTGGAT
TATACCCGGG AAGGCCGCAA CGCCGACCGC TTCCGGGAAA ACTTCGCGGG GGACGCAAGC
GTCTATTTTC CGGCCGTTTA CTGGGACTAT ACCACCAGGG GAGTGCTGAC CCAGGAGTAT
GTAGAGGCGG TAAAACTCAA TAACCTGGAA GAGATTGACC GCCGGGGTTA TAGCCGCCGC
CGGATAGCCG TTAACCTCGC CCGGGCCGTT TACCAGCAGG TTCTGGTGGA CGGCTTTTTC
CATGGCGACC CCCACCCGGG AAACCTGGCC GTCCTGCCAG GGGAAGTTAT TGTCTTTATG
GATTTCGGTC TCACGGGGAC TTTGACGGAG GAACTTAAGG AGCAGTTTGT CAACCTGGTC
CTGGGGATTA TCCGCCGCCG CAGCCAGGAC GTCCTCAGGA CCATCATAGC CATGGGCATG
GTGCCGGCCG AAGTGGACCG GGGTGCCCTG CGGCGGGAAA TTGAAGCCCT GCGGGATAAA
TACTACCACC TTTCCTTTCG CCAGATCAGC CTGGGCCAGG CTATTGAGGA ACTCCTCCAG
CTGGCCTTCA GGTATCACCT GCGCATGCCC CCGGAATTAA CCCTGCTGGG AAAAACCCTC
TTGACCCTGG AGGGCCTGGT CAGGAAACTT GACCCGGAGC TGGAACTGGC CGAACTGGCC
GAACCCTATG GCCGGGAGCT CTTGCGGCGT CGCTTCAGCG TTCGTTTCCT GTGGCGGGCG
TTGACGGAAA ACCTGGCTTC CGGCTGGGAG GTTATGCAGA GCCTCCCCCG GCAGTTCCAG
CACCTCCTGG ACCTGGCGGA ACGGGGCGAA CTGACCCTCA GGGTGGAACC CCTGCACCTG
CGGGGCCTGG TGCGGCAAAT AGACCGGATT ATCAATAAGC TGACGATGAG CGTAGTTTTG
CTCGCCTTCA GTATCATTAT GGCCAGCCTG ATCATCAGCA CGGCCCTGGG GGCCCCGACC
AACAGCCTTT TCTTCCGCCT GCCCACCCTG GAGGTCGGCT TCGGGGCGGC CGGCATGATG
CTCCTGTGGT TACTGGTGAC CATCTGGCGG GGCGGCCGCG ACTGA
 
Protein sequence
MSLRRRYIHL NRYRQVVNVL ARYGFGYLLD QVGLGELILR RSREEAPLSL GQRLRLALEE 
LGPTFIKLGQ LLSTRPDLLP ADIISELTRL QDRVPPFPFA DVRKAVEEEL GQPLEELFAS
FDPEPLAVAS IGQVHLATLP DGSQVIVKVQ RPGIARQVRV DLEILFDLAR LAQRHTPYGK
IYDFNQMAAE FARALTEELD YTREGRNADR FRENFAGDAS VYFPAVYWDY TTRGVLTQEY
VEAVKLNNLE EIDRRGYSRR RIAVNLARAV YQQVLVDGFF HGDPHPGNLA VLPGEVIVFM
DFGLTGTLTE ELKEQFVNLV LGIIRRRSQD VLRTIIAMGM VPAEVDRGAL RREIEALRDK
YYHLSFRQIS LGQAIEELLQ LAFRYHLRMP PELTLLGKTL LTLEGLVRKL DPELELAELA
EPYGRELLRR RFSVRFLWRA LTENLASGWE VMQSLPRQFQ HLLDLAERGE LTLRVEPLHL
RGLVRQIDRI INKLTMSVVL LAFSIIMASL IISTALGAPT NSLFFRLPTL EVGFGAAGMM
LLWLLVTIWR GGRD