Gene Moth_0935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0935 
Symbol 
ID3832936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp968417 
End bp970195 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content61% 
IMG OID637828866 
Productthiamine pyrophosphate enzyme 
Protein accessionYP_429795 
Protein GI83589786 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0199927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAAT TGCTCATGGG CAACGAAGCC ATAGCCAGGG GAGCGCTGGA GGCCGGTATC 
CGGGTGGCCA CCGCTTATCC CGGGACGCCG GCTTCGGAAA TCATGGTTAC GCTGATGCGT
TTTGGTCCGG AGGCAGGAGT CTATACTGAA TGGTCCGTTA ACGAGAAAGT AGCCGTAGAA
ATTGCTGCCG GCGCTGCCTA TGCCGGTGCC AGAGCCATGG CCAGTATGAA GCAGATGGGC
CTCAATGTTG CCGCCGATGC CATCATGAGC CTGGCTTACA TCGGCGTCAA GGGCGGCCTG
GTCCTGGTGG TGGCCGATGA TCCAGGCCCC CATAGTTCCC AGACAGAACA GGATACGCGC
CTTTTCGCCC GCTTTGCCAA GCTGCCGGTC CTGGACCCCT CCTGTCCCCG GGAAGCCTAC
GAAATGACCA AATACGCTTT TGATCTCTCG GAAACCCTAG GGCTGCCCGT CATTGTCCGT
CCTACCACCC GTACCTGCCA TGCCTGCCAG GATGTTGCCT TAGGAACCAT TCCCCCCCGG
CCTCCAGTAC CCGGCTTCGA GAAAGACCCG CGCTGGGTCA TTATGCCTTC CCTCTCGGCC
AGGCAGCACG TCTGGTTGAA CCAGCAGCAG CTACGTGCCG GGGAGGAGTT CGCCAACAGT
CCCTTTAACG AGGTCTACTA TAATGGCCCC GCCGGAGTTA TCACCAGCGG CCTGTCTTAC
TACTACGTCA CCGAGGCCGG GGAGCGCCTG GGAGTAAAAC TATCCCTGTT AAAGATCGGT
ACCCCCTACC CCTTGCCCGA AAAACTGGTG ATCGATTTTT TAAAGCAGGT TGAGCGGGTA
CTCATCGTAG AGGAGCAGGA GCCTGTTGTC GAAGATCAAG TCATTCGCCT GGCCTGGCGC
CACCGCCTGC CGGTAGAGAT AGACGGTAAA CACAACGGTT TTCTCCCCCG GGAAGGTGAG
TTTAATCCCG ATATTGTCAC CGGGGCCCTG GCCAAGTTCC TGGCAATACA ACCGGCGGGT
ACCCACGGCC GCCCCGGGAC ACCACCCCTG CCGGTGCGTC CGCCCCTACT CTGCGCCGGT
TGTCCCCATC GCGGCTCCTT CTACGCCTTT AAACAGGCCG CCCGGGACCG TAAAGTCATC
TTCACCGGCG ATATCGGCTG CTATACCCTG GGAGCGGCCC CGCCCCTGGA AGCCATGGAT
ACCTGCCTGT GCATGGGAGC GGGCCTGGGT CTGGCCCAGG GCCTGGCGCG GGTCCAGCCA
GATACCCGGC TGGTAGCCTT CGTCGGGGAC TCTACCTTTT TCCATGCCGG CTTGCCATCC
CTTGTCAACG CCGTCCACCA GCAGACGCCC ATAGTTGTCG TCGTCCTTGA TAATGAAACC
ACAGCCATGA CCGGGCACCA GCCCCACCCA GGCCTGGCCA CCGATACCCA CCATAAGAAG
ATTGATATCA GCCAGGTAGG CCGGGCCTGC GGGGTAGAAA CTATCTTGAC CGCCGACCCC
CTGAACCTGG AGGAAACCCT GACTGTAGCC AATCAGGCCC TGGCGGCCCC GGGACCCGTC
CTGGTCATCC TAAGTCACCC CTGCCCGCAA GTAGCCAAAC CTGCGGGACG CTACCAGGTT
GACCAAATCG CCTGTATCAG CTGTCATACC TGTATTAAAG AGCTTGGCTG CCCGGCCCTG
AGGCCGGACG GCAACGGTGT TCAAATCGCA GCCACCTGTA CCGGCTGCGG CCTCTGTAGC
CAGGTCTGCC CGGTGGCAGC CATCGAGGAG GTACTTTAA
 
Protein sequence
MPELLMGNEA IARGALEAGI RVATAYPGTP ASEIMVTLMR FGPEAGVYTE WSVNEKVAVE 
IAAGAAYAGA RAMASMKQMG LNVAADAIMS LAYIGVKGGL VLVVADDPGP HSSQTEQDTR
LFARFAKLPV LDPSCPREAY EMTKYAFDLS ETLGLPVIVR PTTRTCHACQ DVALGTIPPR
PPVPGFEKDP RWVIMPSLSA RQHVWLNQQQ LRAGEEFANS PFNEVYYNGP AGVITSGLSY
YYVTEAGERL GVKLSLLKIG TPYPLPEKLV IDFLKQVERV LIVEEQEPVV EDQVIRLAWR
HRLPVEIDGK HNGFLPREGE FNPDIVTGAL AKFLAIQPAG THGRPGTPPL PVRPPLLCAG
CPHRGSFYAF KQAARDRKVI FTGDIGCYTL GAAPPLEAMD TCLCMGAGLG LAQGLARVQP
DTRLVAFVGD STFFHAGLPS LVNAVHQQTP IVVVVLDNET TAMTGHQPHP GLATDTHHKK
IDISQVGRAC GVETILTADP LNLEETLTVA NQALAAPGPV LVILSHPCPQ VAKPAGRYQV
DQIACISCHT CIKELGCPAL RPDGNGVQIA ATCTGCGLCS QVCPVAAIEE VL