Gene Moth_1329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1329 
Symbol 
ID3831039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1374196 
End bp1376406 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content59% 
IMG OID637829265 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase/S1 RNA-binding domain protein 
Protein accessionYP_430185 
Protein GI83590176 
COG category[I] Lipid transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0539] Ribosomal protein S1
[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming)
[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000119457 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0289615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTGA TTGTGGCTGA TTACGCGGGA TTTTGCTTCG GCGTCAAGAG GGCCGTAGAA 
AAGGCCAGAG AAGCCCCCAA ACCGGTGGCT TCCCTGGGCA TGCTCATCCA TAACCGGCAG
GTGGTGGATC AGCTTGCCCG GGAGGGAGTC CGGCCGGTGG CTTCCCTGGA TGAAGTCAAC
TGCGGCCGGA TCCTGATCCG TTCCCACGGG ACAACACCGG AGACCTTGGT CCGGGCCCGG
GATAAAGGGC TGGAGATAAT TGATGCTACT TGCCCCTATG TAAAAAGGGC CCAGCATCTG
GCCCAAAAAA TGGCCGCCGA AGGTTACCAG GTGATTATCG TCGGCGATGC CGCCCACCCT
GAGGTCCAGG GAATGCTGGG ATGGGCCGGA CCGGGGGCGG TGGTAGTTCC TGATAAAAAA
GCGGCTGCAG CCTTGCCCTT CCATCCCCGG CGGGCGGTCT TAGCCCAAAC CACTCAGCCC
GAATCCAGGT TGGAAGAAGT CGCAGCAGCT TTACGTTCCA ATACGGATAC TCTGGTAGTA
CACAATACCG TTTGCCAGGC CACCCACCAG CGTCAGGAAG CGGCAGTCAG GTTGGCGCGC
CAGGTAGACG TAATGGTCGT CGTCGGTGGT AAGGAAAGCG CCAATACGCG GCATTTAGCC
GAACTTTGCC GGGCTACCGG GACGCCTACT TACCATGTGG AAAGGGCCGG GGAGTTGTGT
CGCCAGTGGT TCATAAATAC CCGGAGGGTC GGGGTCACAG GTGGCGCTTC GACCCCAGCG
TGGATTATTG AGGAGGTAGT TGGAATGTTG ACTGGAGAGG AAAAAATTAT GGTTCCGGAA
GAAGAAAAGG AGCCGGTAAC CACTCAAGAA CCGGTGGCCT CCGGGCAACC GGTAGCCCCG
GCTGAGCCGG AAGGTAACCC GGCAATTACC GGACCTGAAG GCGGGGCAGA ACCCCCGGCG
GCTGAGGCCG CGAAGTCCGG CGCGTTCGGG GGGCAGGAGG GACCCGGGGT AGCCCCAGGG
GAGACTGGAA CCCCTGAAGA GCAAAGAAAT GAGATGCAGC AGATAACCGC AAGGATGCCA
GAACTACGCC GCGGCAACGT CGTTGCGGGT ACGGTCGTCC AGATCACCGA TAACGAAGTG
ATGGTAGATG TAGGGGGCAA ATCCGAAGGC GTTATCCCCC TGAACGAGCT CTCCCATCGT
AACGTAACCG ACCCGCATGA GGTGGTAGCC GTTGGCGATA CGATAAACGT CATGGTTTTA
CGGCCGGAGA ACGAAGAGGG ACACCCCGTA CTTTCCAAGA GGCGTGCCGA TCGCCGGGTA
GCCTGGGAAA AATTGGAGGA ACACCTGGCC AGCGGGGAAG AGATCCAGGG TGAGGTAATC
GAGGTCGTCA AGGGCGGCCT GCTGGTAGAC GTGGGCGTCA GGGGCTTTTT GCCGGCTTCC
CTGGTGGAAC GGGGTTATGT GGAAGACCTC AATGCCTACC TGGGGCAAAC CCTGCGCCTG
CGGGTAATCG AGCTCGATCG CAGCAAGAAT AAGGTCGTCC TTTCCCGGAA AGCCATCCTG
GAAGAAGAAT ATGAAAAGCA GCGCCAGGCT ACCTGGAACA GCCTGGAGGT GGGCCAGGTA
CGCAAGGGTA TTGTCCGGAG GTTGACCAAC TTCGGTGCCT TTGTTGACCT GGGCGGTGTC
GATGGCCTGC TCCATGTATC CGAAATCTCC TGGGGTCGGG TGGAACACCC GCGGGACGCC
CTGAGCGAAG GCCAGGAGAT CGAGGTCAAG ATCCTGGGCA TCGACCGCGA AGAAGGCAAG
GTTTCCCTGG GCCGCAAGCA ACTCCTACCC AATCCCTGGG ATACGGCGGC CGAGCGCTAC
CCGGTAGGGA CCATTGTCGA GGGTAAGATT TTGCGACTGG CTCCCTTCGG TGCCTTTGTG
GAAGTTGAAC CGGGTATTGA AGGCCTGGTC CACATCTCCC AGCTGGCCGA CCGCCATGTC
GATAAACCGG AAGATGTCGT CAGTATCGGC GATATCATCC CGGTCAAGGT CCTGGGGGTG
GACCAGCAGG CCCAGCGGAT GAGCCTCAGC CTGCGCCAGG CCATCCGGGA GAAGAACAGA
AAACAGGCCA AACCGGCCGA AAAGCCCGCC GAGAATCAGA ACGAGAGCGG GGTTAAGTTA
GGCGATCTCT TTGGCGATCT CTTTGAGGCG AATCAATCTA CAAACCAATG A
 
Protein sequence
MEVIVADYAG FCFGVKRAVE KAREAPKPVA SLGMLIHNRQ VVDQLAREGV RPVASLDEVN 
CGRILIRSHG TTPETLVRAR DKGLEIIDAT CPYVKRAQHL AQKMAAEGYQ VIIVGDAAHP
EVQGMLGWAG PGAVVVPDKK AAAALPFHPR RAVLAQTTQP ESRLEEVAAA LRSNTDTLVV
HNTVCQATHQ RQEAAVRLAR QVDVMVVVGG KESANTRHLA ELCRATGTPT YHVERAGELC
RQWFINTRRV GVTGGASTPA WIIEEVVGML TGEEKIMVPE EEKEPVTTQE PVASGQPVAP
AEPEGNPAIT GPEGGAEPPA AEAAKSGAFG GQEGPGVAPG ETGTPEEQRN EMQQITARMP
ELRRGNVVAG TVVQITDNEV MVDVGGKSEG VIPLNELSHR NVTDPHEVVA VGDTINVMVL
RPENEEGHPV LSKRRADRRV AWEKLEEHLA SGEEIQGEVI EVVKGGLLVD VGVRGFLPAS
LVERGYVEDL NAYLGQTLRL RVIELDRSKN KVVLSRKAIL EEEYEKQRQA TWNSLEVGQV
RKGIVRRLTN FGAFVDLGGV DGLLHVSEIS WGRVEHPRDA LSEGQEIEVK ILGIDREEGK
VSLGRKQLLP NPWDTAAERY PVGTIVEGKI LRLAPFGAFV EVEPGIEGLV HISQLADRHV
DKPEDVVSIG DIIPVKVLGV DQQAQRMSLS LRQAIREKNR KQAKPAEKPA ENQNESGVKL
GDLFGDLFEA NQSTNQ