Gene Moth_0513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0513 
Symbol 
ID3831815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp532657 
End bp533955 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content58% 
IMG OID637828447 
Productphenylacetate-CoA ligase 
Protein accessionYP_429386 
Protein GI83589377 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACTGGA ACGAGAGATA TGAATGTATG CCGGAGGCAG AACTGCAGGA ACTCCAGCTG 
GAACGCCTCC AGGCAACAGT CAGGAGGGCC TTTTTTGATG TCCCCTTTTA TCGCCGGGCC
TTCCAGGAGA TCGGCCTGGA ACCGGGGGAC ATCAAGAGCC TGGACGACCT GCAAAGACTG
CCTTTCACCA CCAAACAGGA TTTGCGGGAT AACTACCCCT ACGGCATGTT CGCCGTGCCC
ATGAGCGAGA TCGTACGTAT CCACTCCTCC TCGGGCACTA CCGGTAAACC GACGGTGGTT
GGTTACACCC GCCACGACAT CGACGTCTGG TCCGAACTCA TGGCCCGGGC CCTCGTCTGC
GGCGGCGCCA CCCGGCATGA TATCATTCAA AACGCCTACG GGTATGGCCT CTTTACCGGT
GGCCTGGGAA TCCACTACGG CTCCGAACGC CTGGGGGCCT CCGTAATTCC AATTTCCGGC
GGCAACACCA AGCGCCAGGT AATGATCATG AAGGACTATG GTAGCACCGT CCTGACCTGT
ACTCCTTCCT ATGCCCTGCA TATCGCGGAA GTAATGGCGG AGATGGGGAT CAAACCCGAG
GAAATAAAGC TCCGCTGCGG GATCTTCGGT GCCGAACCCT GGTCGGAAAG CATGCGCCAG
GAGATCGAGA AGCGCCTGGG CATCAGCGCC GTCGACATCT ACGGCCTGAG CGAGGTTATC
GGCCCCGGGG TGGGGATTGA GTGCCAGGAG AAAAACGGTC TCCATATCTT TGCCGACCAC
TTTTTGATTG AAATTATCGA TCCGGTGACC GGGAAGCAAC TGCCCCCGGG CCAGAGGGGC
GAACTGGTGA TTACCTCCCT CACCAAGGAA GCCCTGCCGG TCATCCGTTA CCGCACCCGG
GATATCACCA GCCTGATCCC CGGACCCTGT CCCTGTGGCC GCACCCACCC GCGGGTAGCC
CGTTTCACCG GTCGTACCGA CGACATGCTC ATTATCCGCG GGGTCAATGT CTTCCCCTCC
CAGGTGGAGA GCGTCCTCCT GGAGATGGGG GGTACCGAAC CCCACTACCT CCTCATCGTC
GACCGCCAGG GCTCCCTGGA TACCCTGGAG ATCAAAGTAG AAGTCTCTGA AACCCTTTTC
TCCGATAAAG TCCGGCGCCT GGAAGATTTG GAAAAACGCC TGCGTCATGA ACTGGAAAGC
ACCCTGGGCA TCAGCGTCAA AGTAACCCTG GTCGAGCCTA AATCTATCCA GCGGAGCGAA
GGCAAAGCCG TAAGGGTTAT CGATAAAAGG AAGATTTAG
 
Protein sequence
MYWNERYECM PEAELQELQL ERLQATVRRA FFDVPFYRRA FQEIGLEPGD IKSLDDLQRL 
PFTTKQDLRD NYPYGMFAVP MSEIVRIHSS SGTTGKPTVV GYTRHDIDVW SELMARALVC
GGATRHDIIQ NAYGYGLFTG GLGIHYGSER LGASVIPISG GNTKRQVMIM KDYGSTVLTC
TPSYALHIAE VMAEMGIKPE EIKLRCGIFG AEPWSESMRQ EIEKRLGISA VDIYGLSEVI
GPGVGIECQE KNGLHIFADH FLIEIIDPVT GKQLPPGQRG ELVITSLTKE ALPVIRYRTR
DITSLIPGPC PCGRTHPRVA RFTGRTDDML IIRGVNVFPS QVESVLLEMG GTEPHYLLIV
DRQGSLDTLE IKVEVSETLF SDKVRRLEDL EKRLRHELES TLGISVKVTL VEPKSIQRSE
GKAVRVIDKR KI