Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2364 |
Symbol | |
ID | 3832544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2484301 |
End bp | 2488005 |
Gene Length | 3705 bp |
Protein Length | 1234 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637830283 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_431189 |
Protein GI | 83591180 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00562436 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000368528 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGAAAA GTCTACATAA TACCAGAAGC GCTACCCGGA TTCTCATCTT GATCCTTTTG CTCTTAATTC CAATCCTGGC GGGCAGGCCG CCGGCCCTGG GGATAAGGGG CTTTTTAAAC GATCGCGCCG CGGACATCAT CGGCGCCCGG CCCCTGGCGG CGCCGGGCCT GGTGGTCCCG GAGGGTCTCA CCGGCAAGGG GGAGATCGTC GGCCTGGCCG ACAGCGGCCT GGACGCAGGC AGCGTGGACG ATATCCACCC CGACCTGCAC AGTACCCCTG GCCAGATGCC CAAGGTAGTC ATGCTGAAGA GCTGGGCCGG CCGGGAGAAG GCCGACGACC CCATCGGCCA CGGTACCCAC CTGGCCGGCA TCATTGCCGG CAGCGGCGCC GCCTCAGGGG GGCAGTACCG GGGCATCGCC CCAGGGGCCA GTATCTATTT CCAGGGCCTC CTGGATAAAG AGGGGGCACT GTCCCCTCCG GGAGACCTGG CCAGCCTCTT CCGGGCGGCC TATGATGCCG GCGTCCGCAT CCACGTCGAC GGCTGGGGCG GCGGCCCCAA CGCCTACCGG GATGTGAGCG CCCAGATCGA CGCCTTTGTC CGCCAGAACC CCGACTTCCT CCCCATCTTT GGCGCCGGCA ACGGAGGCCC GGTCCAGGGC TCCCTCACCG CCGAGGCAAA CAGCAAAAAC GCCCTGGTGA TCGGGGCGAG CGAATCGGTA CGGCCGGCCT TCAGTCCCGA CGCCAACGAC GCCCACCGCC AGGCCGATTT CTCCAGCCGC GGCCCGGCCG GGGACGGCCG CCTCCGGCCC GACCTCCTGG CTCCGGGTTC AGCCCTGGTT TCTACCCGCT CCCGCCTGGT GGATTCCAAT TTTCCCGCCA ACCAGCAGTA CACCGTCATG GGCGGCACCA GCCAGGCGGC GGCCGTGGCC GGCGGCGCGG CCGCCGTCCT GCGCCAGTAC CTGAGGGGTA ACGGCTTTCC CGACCCACCG GCGGCCCTCA TGAAGGCCGC CCTGGTCAAC GCTGCCTGGA CCCCACAGGA AGGTCCCACG GCCGCCTTTC CCGGTATCCT CGACCTGGCC GGCACCATCC TGGCCTTGGA GGAAGGGACC ATGCACCTGG CGGACGGCAA CCGGGGCTTG GGCGAGGGGC AGGAGGCGAC CTTCCAGTTC CAGGTGACGG ACAGCGGCGC TCCCCTCCGG GCCACCCTGG CCTGGACCGA CCCGGCCCCG GCCCCGGGAG CTACCACCGC CTTGGTCAAC GACCTGGATC TCACGGTCAT TGCCCCCGAC GGCAGAACGT ACGCCGGTAA CGACTTCGCC GGGGAAGGCC AGCTTGATCG GCGTAACAAC TTGGAAAAAG TTTACATAAA GAATCCGACC CCCGGCACAT ACACCATCAA AGTAAAGGCA GCGGCCGTCA GGAAAAACGC CCTTCCCGGC AACCCCGTCC CCGCCCAGGA CTTCGCCCTG GCCTACGGCC AGCCCCTGGA GCGGGGAGTG GTGGCCGGGG CCACCGCCGA TGGCCGCGTA ACCTTGAGCG ACGGCCGGAC CATAACCGCC CCGTCGGGGG GGATAAAAAA CTGCGTCGAC GGTGCCATAG CCCCGGCCGA CGCTGCCTAT ATCCTCCCCG GTGCCGACGC CTACCTGGGG CCCCGCACCC TGTACATAGT GGCCCGGCGC TGGCAGGCCG CTGGCGTTCA GGCCCTGGCC ACCAGCGGGG GGGCCCTCTT CCTGGAGATA AACAGCCAGG CCCGTACCGG CGGCTATTAC TTCAATACCG CGGGCAGCCT GGCCCTCAAC GGCCGGCCGG TGGGGGTGGC CGACCTGCAG CCCGGCTTTG ACCTGGCGGC CACCATCAAC CCCTCCACCG GGACCCTGTG GCAGGTCCGG GCCGGTTACC AGGAAAAGGA GGGCTTCCTG GCCTACGTCA ACCTGGCCAA AAGGGAGCTC CGGATCCTGG GGGATGACAG GTCCTATTCC TTTAGCCCCC GGGTCGCCGT CACCTTCGCC GACAGCCTGG TGGAGGCCGC CCCGGCCGAC CTGCCATACG GTGCTGCCGA CCGGGCCGAC CCCTCCACCC TGGTGCCCGG CATGGCTGTG CGCCTGGTCC TGGACCCGGA GACGGGCCAG GTCCAGTATA TCGCCGTCAA AAGGGAACTG GCCCTGGGGA CGATCGCCGG GGTGAAGGGC GATACCCTCA CCCTGGCCAC GGGAACTGAC TATACCCTCT TTCCCGGCGC GCCGGTCCGG CGGGACGGCC GGGAGGCCGG CGTGGCGGAC CTCCAGCCCG GCGACTGGGT CATCCTGAAC CTGATGCCCG GCAGCCACCA GATTATAGCC CTGACGGCCT ACAGCAACGT AACCTACGGC CGCGTCCTTT ACCTCAGCGG CGACCGTCAC AGTCTCTACC TCATGGACTG CACCAACCAG TTCCGGAGCT ATAACCTGGA TGACAGCACC AGGGTCTTCC GCTGGGGGCT GCCCGTCAGC GCCGCCACCC TGAGCCCCGG AGACTGGGTG CGCCTGACGG CGGTGCCGGG GGAGACGACG GCCTGGCGGG TGGACGCCGT CTCCCCTGCC GGTGAAGTGG ATAGAGTCCT GGCCGGGGTC GATGGGGAAA AGGGTCTCCT GCGTACCGCC GACGGCAGCA CCTATACCCT CACCAGTCGT ACCCTGGTGA CTCTGAACGG CTACCGGGTG GCGGCCGCTG ACCTGCCGGC CTGGCTGCCC GTCAGCCTTA CCCTCCTGGA AGGACCGGAA AGGCCGATCC TGGCCCGGGT GGCGGCCAGC AGCTTCCCAG GCAGCCAGCC GCCCCGCCTG GAGGTTTCCG TTCTGCAATC AGCAGGTGAG GTGGTCTTGA AAGGGACTAC CAGCGCCGAC CGTCTCTACC TGCAGCACGA TAACGGGGCG CAGGAGGTCG TGCCGGTAGG AACCGGCGGC AGCTTTCAGT GGCGGGTGCC CACCGGGGAA AAGAACGTCC TCCTGATCGC CCTGGATCGA ACCACCAGCG GGGTGACCGG CCAGAAGCTC GACCTTACCA GCATTCAGGA TGAGGGCTTC TGGGATACCC GCGGCCACTG GGCCGAGAAT GATATTAACA AAATGGCCGC CAGGGGCCTG GTCGCCGGGT ATGAAGACGG CAGCTTCCGG CCCGACAACC CCGTCACCCG CGTGGAACTC ATCGCCTTCC TGGTACGCCT GGCCGGCTGG CGGGTGCCCG CGGGCAGTCA GCCGGACTTT ACCGACCGCC AGATCATCCC TGACTGGGCC CGGGCGACGG TAGCCGTGGC CTTAGAGCGC GGCCTGGTGA GCGGGTATCC TGACGGCAGC TTCCGGCCGG GCCAGGTAGT CAGCCGGGTC GAAGCCGCCG CCTTCTTTAC CCGCTATCTG GAGATCGCGG GCAAGCTACC CTCCGGCAGC GCGCCGGGAG CTTCAGGCCC GGCCACCTCC TCGTCGGGCA CAACGGCGGC AAGTAATAGT GCCTCCGCGG GTACGGCGGG CCCGGGGAGC CGGTCGGCAG GCGCTTTATC GTCAAGCCAG CCCCCGCCCT TTACCGACTG GGACTCCGTC CCGGTGTGGG GCCGGGAGGC CGTGGCCCGC GCCTACGCCG CCGGCCTCAT GGGCGGCATG GCCCCGGGCG TCTTCGCCCC CCTTTCCCCC CTCACCCGCG CCCAGGCCGC CGCCATTATG GCGCGGATGC TGTAG
|
Protein sequence | MGKSLHNTRS ATRILILILL LLIPILAGRP PALGIRGFLN DRAADIIGAR PLAAPGLVVP EGLTGKGEIV GLADSGLDAG SVDDIHPDLH STPGQMPKVV MLKSWAGREK ADDPIGHGTH LAGIIAGSGA ASGGQYRGIA PGASIYFQGL LDKEGALSPP GDLASLFRAA YDAGVRIHVD GWGGGPNAYR DVSAQIDAFV RQNPDFLPIF GAGNGGPVQG SLTAEANSKN ALVIGASESV RPAFSPDAND AHRQADFSSR GPAGDGRLRP DLLAPGSALV STRSRLVDSN FPANQQYTVM GGTSQAAAVA GGAAAVLRQY LRGNGFPDPP AALMKAALVN AAWTPQEGPT AAFPGILDLA GTILALEEGT MHLADGNRGL GEGQEATFQF QVTDSGAPLR ATLAWTDPAP APGATTALVN DLDLTVIAPD GRTYAGNDFA GEGQLDRRNN LEKVYIKNPT PGTYTIKVKA AAVRKNALPG NPVPAQDFAL AYGQPLERGV VAGATADGRV TLSDGRTITA PSGGIKNCVD GAIAPADAAY ILPGADAYLG PRTLYIVARR WQAAGVQALA TSGGALFLEI NSQARTGGYY FNTAGSLALN GRPVGVADLQ PGFDLAATIN PSTGTLWQVR AGYQEKEGFL AYVNLAKREL RILGDDRSYS FSPRVAVTFA DSLVEAAPAD LPYGAADRAD PSTLVPGMAV RLVLDPETGQ VQYIAVKREL ALGTIAGVKG DTLTLATGTD YTLFPGAPVR RDGREAGVAD LQPGDWVILN LMPGSHQIIA LTAYSNVTYG RVLYLSGDRH SLYLMDCTNQ FRSYNLDDST RVFRWGLPVS AATLSPGDWV RLTAVPGETT AWRVDAVSPA GEVDRVLAGV DGEKGLLRTA DGSTYTLTSR TLVTLNGYRV AAADLPAWLP VSLTLLEGPE RPILARVAAS SFPGSQPPRL EVSVLQSAGE VVLKGTTSAD RLYLQHDNGA QEVVPVGTGG SFQWRVPTGE KNVLLIALDR TTSGVTGQKL DLTSIQDEGF WDTRGHWAEN DINKMAARGL VAGYEDGSFR PDNPVTRVEL IAFLVRLAGW RVPAGSQPDF TDRQIIPDWA RATVAVALER GLVSGYPDGS FRPGQVVSRV EAAAFFTRYL EIAGKLPSGS APGASGPATS SSGTTAASNS ASAGTAGPGS RSAGALSSSQ PPPFTDWDSV PVWGREAVAR AYAAGLMGGM APGVFAPLSP LTRAQAAAIM ARML
|
| |