Gene Moth_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0933 
Symbol 
ID3832934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp966547 
End bp967833 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID637828864 
Productphenylacetate-CoA ligase 
Protein accessionYP_429793 
Protein GI83589784 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT CCGAAGCAAA AAACTGCCAA CGCAAGTTCG ACTACGATTG CCTGTACCGG 
CAGCAGGAAC CGCACTTGCA GGCCCTGGTG CAGCGCCTCT TTGCCCATTC CCCTTACTAC
CGGGAAAAGC TGGCGGCGGC CGGCCTTACA CCCGGAGACA TCCGGACGGT AGCCGACCTG
GAGCATGTCC CCCTGACGGA CAAATGGGAG CTACGCAATG GTAAACCCCT GGCCCTGATG
GCCGTTCCCG AAGAAAAAGT CGTCCGCATC CACTCCTCTT CGGGAACTAC GGGTAAACCC
ATTATCATTC CCTACACGGC CTACGACGTG GCCGTCTGGG CGCAGATGAT GGCCCGCTGC
TTCGCCATGG CCGGGGTCAC CAACCGTGAC CGGGTCCAGG TTACCCCTGG GTACGGCCTC
TGGACGGCGG GCATCGGCTT CCAGGCCGGT ATTGAGTACC TGGGGGCCAT GGTGATCCCC
ATGGGACCGG GGAATACTGA AAAACAACTG GAGATGATGG TCGATCTCCA GGCTACCGTC
CTTGCGGCCA CGGCTTCCTA CGCTCTCTTC CTGGCCGAAG AGATCGACCG CCGGGGCCTT
AAGGATCAAC TGGCCCTACG GGTAGGGGTC CTGGGCTCCG AGCGCTGGGG CGAGAAGATG
CGGCAGCGAA TTGAAGACCT TCTGGGCATC GAAACCTTTG ATATTTACGG CTTAACGGAG
ATCTACGGCC CGGGCATCGG CATCGACTGC CCGGCCCATG AGGGTATTCA TATGTGGACG
GATCACCTGC TCCTGGAGGT TATCGACCCG GCGACAGGCA AGCAATTACC TCCGGGGGAG
ACTGGTGAGC TGGTGATAAC TACCCTCACC AAAGAGGGTA TGCCCCTCCT CCGTTACCGC
ACCCACGACC TGACCTGCCT AAAGAGGGAA GCCTGCTCCT GCGGTTCGCC CTACCCCATG
ATTGAGCGCG TCCTGGGCCG GACCGACGAC ATGGTCAAGA TCAAGGGTGT CAACATCTTC
CCGGGCCAGG TAGATCATGT CCTCCACCTC ACCCCCGGCG CCGGGAGCGA GTACCAGCTT
ATCCTCACCC GGCAGGAAGG TAAAGACCGG CTGCTGGTAA AAATAGAGTA CCTGCCCGGT
TATGATGGTG AGTCCACGGC AGCCGAGTGT CGCCGCCAGA TCAAGACCCG GATCGGTATC
CTTGCCGACG TGGAAGCCGT GCCCCTGGGA ACCCTGCCCC GCAGCGAAAA GAAAACCCGG
CGCGTCTACG ACTACCGGGA GACTTAG
 
Protein sequence
MSLSEAKNCQ RKFDYDCLYR QQEPHLQALV QRLFAHSPYY REKLAAAGLT PGDIRTVADL 
EHVPLTDKWE LRNGKPLALM AVPEEKVVRI HSSSGTTGKP IIIPYTAYDV AVWAQMMARC
FAMAGVTNRD RVQVTPGYGL WTAGIGFQAG IEYLGAMVIP MGPGNTEKQL EMMVDLQATV
LAATASYALF LAEEIDRRGL KDQLALRVGV LGSERWGEKM RQRIEDLLGI ETFDIYGLTE
IYGPGIGIDC PAHEGIHMWT DHLLLEVIDP ATGKQLPPGE TGELVITTLT KEGMPLLRYR
THDLTCLKRE ACSCGSPYPM IERVLGRTDD MVKIKGVNIF PGQVDHVLHL TPGAGSEYQL
ILTRQEGKDR LLVKIEYLPG YDGESTAAEC RRQIKTRIGI LADVEAVPLG TLPRSEKKTR
RVYDYRET