Gene Moth_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1042 
Symbol 
ID3831848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1069558 
End bp1070568 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content58% 
IMG OID637828970 
Productpeptidase RseP 
Protein accessionYP_429899 
Protein GI83589890 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000013997 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGACCATTA TCCTGGCCCT GGTTATCTTC AGCATTCTGG TTATCGTCCA TGAAGGGGGG 
CACTACCTGG CGGCCAAGCG CGCCGGGATT AAGGTGGAGG AGTTCGCTAT CGGCATGGGC
CCGGCCCTGT GGCAGGTTAA AAAGGGAGAA ACCATTTATT CCCTGCGGGC CTTTCCCCTG
GGAGGGTTTA ACCGGATGGC CGGCATGGAG GGGCCAGACC TTGACGACCC ACGTGGCTTC
AACCGCCAGC CGGTACTCGC CCGGATGGGG GTCATCGGCG CCGGTTCTGG TATGAACTTC
CTCCTGGCGT TGTTCCTGTT TATTCTGGTC TTTATGGTCC TGGGGATACC GGCTGATATC
AATATTATTG GCCGGGTCGA GCCGGGTATG CCGGCCGCCC TGGCCGGCTT GCAACCCGGG
GATAAAATCC TTCAGGTTAA CGATACCCCG GTGAATACCT GGCGCGATAT GGTCGACCTG
ATTTATAAAC ACCCGGAAGA AAAAATAACC CTGGTGATTG AACGGGACGG CCGGCAACAA
CAGATCAACC TCACCACCGC CAGGGATCCC CAGACGGGGG TGGGATTGAT CGGCATCGGC
CCCACCTGGG AGAGGCAGGG TTTCTGGCGC TCTATTGTCC TAGGCACCAG GCAGGCAATA
GAGATCACCA GGCTCATTAT CCTGAGCTTG GTAGAGATGG TGACCGGCAA GGTGGCGGCG
GAGGTAGTCG GTCCGGTGGG TATCGTCCAG CTGGTGGGCC AAGCGGCAGC CTTCGGCCTG
GCCAATGTTT TGAACTTTAT GGCCGTCCTG AGCCTTGACC TGGGGATTAT TAACCTGCTG
CCGGTCCCCG CCCTGGATGG CAGCCGGCTG GTGTTCCTGG GCCTGGAAGC AGTGCGCGGG
CGACCCATTA ACCCGGAAAA GGAGAATTTT ATCCACCTGA TCGGCTTTGC CATCCTGATG
GGCCTGTTAA TTCTCATTAC CTATAAGGAT TTAATCCGGA TCTTCAGCTG A
 
Protein sequence
MTIILALVIF SILVIVHEGG HYLAAKRAGI KVEEFAIGMG PALWQVKKGE TIYSLRAFPL 
GGFNRMAGME GPDLDDPRGF NRQPVLARMG VIGAGSGMNF LLALFLFILV FMVLGIPADI
NIIGRVEPGM PAALAGLQPG DKILQVNDTP VNTWRDMVDL IYKHPEEKIT LVIERDGRQQ
QINLTTARDP QTGVGLIGIG PTWERQGFWR SIVLGTRQAI EITRLIILSL VEMVTGKVAA
EVVGPVGIVQ LVGQAAAFGL ANVLNFMAVL SLDLGIINLL PVPALDGSRL VFLGLEAVRG
RPINPEKENF IHLIGFAILM GLLILITYKD LIRIFS