Gene Moth_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1007 
Symbol 
ID3833310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1035657 
End bp1036706 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content58% 
IMG OID637828936 
Product2-hydroxyglutaryl-CoA dehydratase, D-component 
Protein accessionYP_429865 
Protein GI83589856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000530359 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000167288 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCATT TAGTGGGTTG GCTGTGCCAC TATACGCCGG TAGAAATTTT TACGGCTCTC 
GGGTACACCC CTTACCGGCT GCTGGGCCGG GAGGGCAATA ACCCCCGCGC CGCTACATAT
CTGGTGGGCA ACCTCTGCCC TTATGTCCAG AGCTGCCTCG AGGCAGCCAT CAAACAGGAG
TTACCCCTCC TGGCCGGCGT CGTAATCGCC CGATCTTGCA ATGCCATGAT CCACCTGGCT
AATGTCTGGC CCCATTATGG TGCAGGGGGT AAGACCGTAG TCCTCGATGT ACCCCGCCGG
TTCGACGAGG ACGCAGTTTT CTATTTCAGC CAGAATCTAC GCCGTCTGGC GGCAGAACTA
GCTCCCCCTG GGGAGCGACA ATTAAATGAA GAACGTTTGT GGGCGGCCAT TACCTGGTGG
GAAGAACAAC GGGCAGCCTG GCGCGAGCTT TTAGCCTGCC GGGCTGCGTG GGAAGACCCT
CCCGGCGGGA AAGAAATCAT GGACGGGCTG GCCAGGTGGC AAACTCCCCT TAATCCCCAG
GAGGCAGATA AATGGGAGAC TGTAATTACT CGGTTAACCA GGCAACCCCG GGTGAAAACC
GCCCGGCGGC CGCGCCTGCT GCTAGCGGGA AGCATCCTCC CGCGGGAATT GATTACTATG
GTGGAAGAGT GCGGCGGTTT ATCTGTTTTC GAGGACAGCT GCAATGGCAT GCGGCTATTA
CTGGCCCCCG CCGCCAGGGC CGAGGGTGAC CCTTACCTCT ACCTGGCCAG GCTCTACCTG
GGAGGACCCC CATGTCCCCG GATGGTAGGC GACCGGGAGA AACGACGCCA GTACTGGGCC
GCAGCTGTAG ACCGGTTCCG TATCCAGGGA ATCATTTACC ATGCCATGAA GTTCTGCGAC
GCCGCTATGT ACGACTACAT AGCCCTGAAG GTTTTTAGTG AAAAAAAGGG GTTACCCCTT
CTTCGTCTTG ATGGGGATTT CAGCGGCGGT AATCGGGGTC AATGGCAGAC GCGGCTGGAA
GCCTTTTTAG AAATGCTAGG GGTGGAGTAA
 
Protein sequence
MSHLVGWLCH YTPVEIFTAL GYTPYRLLGR EGNNPRAATY LVGNLCPYVQ SCLEAAIKQE 
LPLLAGVVIA RSCNAMIHLA NVWPHYGAGG KTVVLDVPRR FDEDAVFYFS QNLRRLAAEL
APPGERQLNE ERLWAAITWW EEQRAAWREL LACRAAWEDP PGGKEIMDGL ARWQTPLNPQ
EADKWETVIT RLTRQPRVKT ARRPRLLLAG SILPRELITM VEECGGLSVF EDSCNGMRLL
LAPAARAEGD PYLYLARLYL GGPPCPRMVG DREKRRQYWA AAVDRFRIQG IIYHAMKFCD
AAMYDYIALK VFSEKKGLPL LRLDGDFSGG NRGQWQTRLE AFLEMLGVE