Gene Moth_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1542 
Symbol 
ID3831928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1585374 
End bp1586777 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content59% 
IMG OID637829474 
Productmicrocin-processing peptidase 2 
Protein accessionYP_430394 
Protein GI83590385 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0708612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTAC CAGAAACAGA TCTCAAAGCC ATCCTCCAGG CAGCCCTGCA GCAGGGAGGC 
GATTTTGCCG AAATTTTCCT GGAACGCCGC CGGACTACCA GTATCGGCTG CGAAGAAAAT
AAAATCGAGC GGGTGACATC CGGCTTTGAC CAGGGAGCGG GCATCCGTGT CATCGCCGGG
GAGAATACCG CCTACGGCTA CAGTAATGAC CTGAGCCGCG AGAGCCTTAT CGAAACTGCC
ATCCTGGTGG GTAAAGCTGT GCAGAGCCAG CCACGGCCAC GGGAGATTAA CTTTACCCGG
CCCAAGTCCC GGGTAAAGTT CACCATAAAA AAACGGCCCG ACCAGGTGGC GGTAGAAGAA
AAGGTGAACC TGGTACGCAG GGCCAATGAC GCCGCCCGGG CCGTCGATAA ACGCATTGCC
CAGGTCACGG TTGGTTACGG CGACGTTATT CAGGAAGTAA CCATTGCCAA CAGCGACGGG
ACCCTGGTGG ATGATGAGCG GGTCCGCTGT CGCCTGGTCG TCAACGCCGT GGCCACCGAC
GGTAAACATA TCCAGACGGG CTACGAATCA GCCGGGGGCC ACCAGGGCTG GGAGCTTTTT
GAGACCGTGA ATCCTGAAGA AATGGCCCGG CTGGCGGCCC GGCGGGCGGT GATGATGCTG
GAGGCCCGAC CGGCCCCGGC CGGAAAGATG CCGGTAGTCA TGGCCGGTGA GGCCGGCGGT
ACCATGATTC ACGAGGCCTG CGGTCACGGC CTGGAGGCCG ACCTGGTCCA GAAGCAGCTT
TCCGTCTACG CCGGTAAAAA AGGGCAAAAG GTGGCCGCCG ATATCGTCAC GGTTATCGAC
GACGCTACCA TTCCGGGGAA ATACGGCTCC TACTCCTTTG ACGATGAGGG CAACCCCGGG
CAAAAGACGG TCCTGATTGA AAACGGTGTC CTCAAAGAGT ACATGTACGA TTATCTCACC
GCCCGTAAAG AGGGTCGGCG CTCCACCGGC AACGGCCGCC GCGAGTCCTA CCAGGACCGG
CCCATTCCCA GGATGACCAA TACTTATATT GCTCCTGGCA AGGATGATGC CGCGGCAATC
CTCAGGGACA CTAAATACGG CTTGCTGGTC AAACGTATGG GCGGTGGGCA GGTCAATACC
ACCAACGGTG ATTTTGTTTT TGACGTTGCT GAGGGTTACC TGATCCAGGA TGGCCAGATC
GGCCCGGCCG TCAGGGGTGC GACCCTCACC GGCAACGGTC CCGAGGTTTT ACGAATCATT
GACCGGGTGG CCGGCGATCT GGGCTTCAGC CTGGGCATCT GCGGCAAAGA CGGCCAGGGT
GTTCCGGTCG GGGACGCGCA GCCAACCATC AGGATCCCCG AGCTGGTAGT TGGCGGTATC
CTGGATGAGG GGGATAAAGA ATAA
 
Protein sequence
MLLPETDLKA ILQAALQQGG DFAEIFLERR RTTSIGCEEN KIERVTSGFD QGAGIRVIAG 
ENTAYGYSND LSRESLIETA ILVGKAVQSQ PRPREINFTR PKSRVKFTIK KRPDQVAVEE
KVNLVRRAND AARAVDKRIA QVTVGYGDVI QEVTIANSDG TLVDDERVRC RLVVNAVATD
GKHIQTGYES AGGHQGWELF ETVNPEEMAR LAARRAVMML EARPAPAGKM PVVMAGEAGG
TMIHEACGHG LEADLVQKQL SVYAGKKGQK VAADIVTVID DATIPGKYGS YSFDDEGNPG
QKTVLIENGV LKEYMYDYLT ARKEGRRSTG NGRRESYQDR PIPRMTNTYI APGKDDAAAI
LRDTKYGLLV KRMGGGQVNT TNGDFVFDVA EGYLIQDGQI GPAVRGATLT GNGPEVLRII
DRVAGDLGFS LGICGKDGQG VPVGDAQPTI RIPELVVGGI LDEGDKE