Gene Moth_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0033 
Symbol 
ID3830899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp33358 
End bp34449 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content60% 
IMG OID637827966 
Productpyruvate flavodoxin/ferredoxin oxidoreductase-like 
Protein accessionYP_428916 
Protein GI83588907 
COG category[C] Energy production and conversion 
COG ID[COG0674] Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.80564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000011646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGCAA AACCCATTGA AGGAGAGCAA AGGGCCTTTA TGACCGGCAA CGAGGTTGTC 
GCCTGGGCCG CCCTGGCGGC AGGGGCTGAC ATCATGTACG GTTACCCCAT TACGCCCCAA
AACGAGATCA TGCACTACTG GACCCGCATG GCTCCCAGGT ACGACCGGGG TTTTTTACAG
ACCGAGGACG AAATATCAGC CGGGTTTACT ACTGTGGGCG GGGTTCTGGC CGGCAAGAGG
GCCTTTACGG CCACCGCCGG GCCGGGCAAT GTCCTCATGC AGGAGGCCAT GTCCATGGCC
GAGATGATGC GCCTGCCCAC CGTGGTGGTC GTGACCCAGC GGGGCGGCCC TTCGACGGCC
ACGGTCATCT ATTCCCAGCA GGAACTCAAC CTGACCTGTT TCGGCGGCAA TGGGGAGGGA
CTCAGGATTG TTTATTCCAC CTCCTCCCAT CAGGACCTTT TTAACTATAC CATCAAGGCC
TTCAACACTG CCTGGAAATA TCGTTTCCCT ACCTTTGTCC TGGGTGACGG TTACCAGTCC
AAGATGAGGG AACCGGTAAC CATCTATGAC CCCGCCACCA GGGGTATTGT CATGGAAGAG
TGCCGGCCGA TGGTAGGCCT GCCGGGTATA GCCGGGATAG ATCGTGAGCC TGCCCACCTG
CGCAATACCT ACAACCTCGA GGATGAACTT TATGACCGGC TTAGCGCCTC AATTAAAGAC
TACCAGGCCA TGCTCCCGGA AGTAGTCGAA TGGGAGGCCT ACGCTGTGGA CGATGCCGAG
TTCCTGGTCA TTGCCCACGG AGTTGTTTCC AGGGCCGCCC GGGCAGCCGT AGACTCCCTC
CGGGAAGCCG GCATCAAGGC CGGGTACTTC CGGCCCATTA CCCTTAGACC CTTCCCGGAG
GAAGCCTTGC AGCCCCTGGC TGCCAGGGCG CAAAGGATCC TGGTGGTCGA GTCCGCCCAC
GGCCAGCTGG AACGCCAGGT CAGGGCCAGC CTCTATGGCC TGGAAACACC CGTCAGCGGC
TACCTGCGGC CGGGCATGGG CATAACCCCG GAGGAGATAA TCGGCGCCGT CCAACAAACT
ATAAGGAGCT GA
 
Protein sequence
MAAKPIEGEQ RAFMTGNEVV AWAALAAGAD IMYGYPITPQ NEIMHYWTRM APRYDRGFLQ 
TEDEISAGFT TVGGVLAGKR AFTATAGPGN VLMQEAMSMA EMMRLPTVVV VTQRGGPSTA
TVIYSQQELN LTCFGGNGEG LRIVYSTSSH QDLFNYTIKA FNTAWKYRFP TFVLGDGYQS
KMREPVTIYD PATRGIVMEE CRPMVGLPGI AGIDREPAHL RNTYNLEDEL YDRLSASIKD
YQAMLPEVVE WEAYAVDDAE FLVIAHGVVS RAARAAVDSL REAGIKAGYF RPITLRPFPE
EALQPLAARA QRILVVESAH GQLERQVRAS LYGLETPVSG YLRPGMGITP EEIIGAVQQT
IRS