Gene Moth_2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2364 
Symbol 
ID3832544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2484301 
End bp2488005 
Gene Length3705 bp 
Protein Length1234 aa 
Translation table11 
GC content68% 
IMG OID637830283 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_431189 
Protein GI83591180 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00562436 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000368528 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGAAAA GTCTACATAA TACCAGAAGC GCTACCCGGA TTCTCATCTT GATCCTTTTG 
CTCTTAATTC CAATCCTGGC GGGCAGGCCG CCGGCCCTGG GGATAAGGGG CTTTTTAAAC
GATCGCGCCG CGGACATCAT CGGCGCCCGG CCCCTGGCGG CGCCGGGCCT GGTGGTCCCG
GAGGGTCTCA CCGGCAAGGG GGAGATCGTC GGCCTGGCCG ACAGCGGCCT GGACGCAGGC
AGCGTGGACG ATATCCACCC CGACCTGCAC AGTACCCCTG GCCAGATGCC CAAGGTAGTC
ATGCTGAAGA GCTGGGCCGG CCGGGAGAAG GCCGACGACC CCATCGGCCA CGGTACCCAC
CTGGCCGGCA TCATTGCCGG CAGCGGCGCC GCCTCAGGGG GGCAGTACCG GGGCATCGCC
CCAGGGGCCA GTATCTATTT CCAGGGCCTC CTGGATAAAG AGGGGGCACT GTCCCCTCCG
GGAGACCTGG CCAGCCTCTT CCGGGCGGCC TATGATGCCG GCGTCCGCAT CCACGTCGAC
GGCTGGGGCG GCGGCCCCAA CGCCTACCGG GATGTGAGCG CCCAGATCGA CGCCTTTGTC
CGCCAGAACC CCGACTTCCT CCCCATCTTT GGCGCCGGCA ACGGAGGCCC GGTCCAGGGC
TCCCTCACCG CCGAGGCAAA CAGCAAAAAC GCCCTGGTGA TCGGGGCGAG CGAATCGGTA
CGGCCGGCCT TCAGTCCCGA CGCCAACGAC GCCCACCGCC AGGCCGATTT CTCCAGCCGC
GGCCCGGCCG GGGACGGCCG CCTCCGGCCC GACCTCCTGG CTCCGGGTTC AGCCCTGGTT
TCTACCCGCT CCCGCCTGGT GGATTCCAAT TTTCCCGCCA ACCAGCAGTA CACCGTCATG
GGCGGCACCA GCCAGGCGGC GGCCGTGGCC GGCGGCGCGG CCGCCGTCCT GCGCCAGTAC
CTGAGGGGTA ACGGCTTTCC CGACCCACCG GCGGCCCTCA TGAAGGCCGC CCTGGTCAAC
GCTGCCTGGA CCCCACAGGA AGGTCCCACG GCCGCCTTTC CCGGTATCCT CGACCTGGCC
GGCACCATCC TGGCCTTGGA GGAAGGGACC ATGCACCTGG CGGACGGCAA CCGGGGCTTG
GGCGAGGGGC AGGAGGCGAC CTTCCAGTTC CAGGTGACGG ACAGCGGCGC TCCCCTCCGG
GCCACCCTGG CCTGGACCGA CCCGGCCCCG GCCCCGGGAG CTACCACCGC CTTGGTCAAC
GACCTGGATC TCACGGTCAT TGCCCCCGAC GGCAGAACGT ACGCCGGTAA CGACTTCGCC
GGGGAAGGCC AGCTTGATCG GCGTAACAAC TTGGAAAAAG TTTACATAAA GAATCCGACC
CCCGGCACAT ACACCATCAA AGTAAAGGCA GCGGCCGTCA GGAAAAACGC CCTTCCCGGC
AACCCCGTCC CCGCCCAGGA CTTCGCCCTG GCCTACGGCC AGCCCCTGGA GCGGGGAGTG
GTGGCCGGGG CCACCGCCGA TGGCCGCGTA ACCTTGAGCG ACGGCCGGAC CATAACCGCC
CCGTCGGGGG GGATAAAAAA CTGCGTCGAC GGTGCCATAG CCCCGGCCGA CGCTGCCTAT
ATCCTCCCCG GTGCCGACGC CTACCTGGGG CCCCGCACCC TGTACATAGT GGCCCGGCGC
TGGCAGGCCG CTGGCGTTCA GGCCCTGGCC ACCAGCGGGG GGGCCCTCTT CCTGGAGATA
AACAGCCAGG CCCGTACCGG CGGCTATTAC TTCAATACCG CGGGCAGCCT GGCCCTCAAC
GGCCGGCCGG TGGGGGTGGC CGACCTGCAG CCCGGCTTTG ACCTGGCGGC CACCATCAAC
CCCTCCACCG GGACCCTGTG GCAGGTCCGG GCCGGTTACC AGGAAAAGGA GGGCTTCCTG
GCCTACGTCA ACCTGGCCAA AAGGGAGCTC CGGATCCTGG GGGATGACAG GTCCTATTCC
TTTAGCCCCC GGGTCGCCGT CACCTTCGCC GACAGCCTGG TGGAGGCCGC CCCGGCCGAC
CTGCCATACG GTGCTGCCGA CCGGGCCGAC CCCTCCACCC TGGTGCCCGG CATGGCTGTG
CGCCTGGTCC TGGACCCGGA GACGGGCCAG GTCCAGTATA TCGCCGTCAA AAGGGAACTG
GCCCTGGGGA CGATCGCCGG GGTGAAGGGC GATACCCTCA CCCTGGCCAC GGGAACTGAC
TATACCCTCT TTCCCGGCGC GCCGGTCCGG CGGGACGGCC GGGAGGCCGG CGTGGCGGAC
CTCCAGCCCG GCGACTGGGT CATCCTGAAC CTGATGCCCG GCAGCCACCA GATTATAGCC
CTGACGGCCT ACAGCAACGT AACCTACGGC CGCGTCCTTT ACCTCAGCGG CGACCGTCAC
AGTCTCTACC TCATGGACTG CACCAACCAG TTCCGGAGCT ATAACCTGGA TGACAGCACC
AGGGTCTTCC GCTGGGGGCT GCCCGTCAGC GCCGCCACCC TGAGCCCCGG AGACTGGGTG
CGCCTGACGG CGGTGCCGGG GGAGACGACG GCCTGGCGGG TGGACGCCGT CTCCCCTGCC
GGTGAAGTGG ATAGAGTCCT GGCCGGGGTC GATGGGGAAA AGGGTCTCCT GCGTACCGCC
GACGGCAGCA CCTATACCCT CACCAGTCGT ACCCTGGTGA CTCTGAACGG CTACCGGGTG
GCGGCCGCTG ACCTGCCGGC CTGGCTGCCC GTCAGCCTTA CCCTCCTGGA AGGACCGGAA
AGGCCGATCC TGGCCCGGGT GGCGGCCAGC AGCTTCCCAG GCAGCCAGCC GCCCCGCCTG
GAGGTTTCCG TTCTGCAATC AGCAGGTGAG GTGGTCTTGA AAGGGACTAC CAGCGCCGAC
CGTCTCTACC TGCAGCACGA TAACGGGGCG CAGGAGGTCG TGCCGGTAGG AACCGGCGGC
AGCTTTCAGT GGCGGGTGCC CACCGGGGAA AAGAACGTCC TCCTGATCGC CCTGGATCGA
ACCACCAGCG GGGTGACCGG CCAGAAGCTC GACCTTACCA GCATTCAGGA TGAGGGCTTC
TGGGATACCC GCGGCCACTG GGCCGAGAAT GATATTAACA AAATGGCCGC CAGGGGCCTG
GTCGCCGGGT ATGAAGACGG CAGCTTCCGG CCCGACAACC CCGTCACCCG CGTGGAACTC
ATCGCCTTCC TGGTACGCCT GGCCGGCTGG CGGGTGCCCG CGGGCAGTCA GCCGGACTTT
ACCGACCGCC AGATCATCCC TGACTGGGCC CGGGCGACGG TAGCCGTGGC CTTAGAGCGC
GGCCTGGTGA GCGGGTATCC TGACGGCAGC TTCCGGCCGG GCCAGGTAGT CAGCCGGGTC
GAAGCCGCCG CCTTCTTTAC CCGCTATCTG GAGATCGCGG GCAAGCTACC CTCCGGCAGC
GCGCCGGGAG CTTCAGGCCC GGCCACCTCC TCGTCGGGCA CAACGGCGGC AAGTAATAGT
GCCTCCGCGG GTACGGCGGG CCCGGGGAGC CGGTCGGCAG GCGCTTTATC GTCAAGCCAG
CCCCCGCCCT TTACCGACTG GGACTCCGTC CCGGTGTGGG GCCGGGAGGC CGTGGCCCGC
GCCTACGCCG CCGGCCTCAT GGGCGGCATG GCCCCGGGCG TCTTCGCCCC CCTTTCCCCC
CTCACCCGCG CCCAGGCCGC CGCCATTATG GCGCGGATGC TGTAG
 
Protein sequence
MGKSLHNTRS ATRILILILL LLIPILAGRP PALGIRGFLN DRAADIIGAR PLAAPGLVVP 
EGLTGKGEIV GLADSGLDAG SVDDIHPDLH STPGQMPKVV MLKSWAGREK ADDPIGHGTH
LAGIIAGSGA ASGGQYRGIA PGASIYFQGL LDKEGALSPP GDLASLFRAA YDAGVRIHVD
GWGGGPNAYR DVSAQIDAFV RQNPDFLPIF GAGNGGPVQG SLTAEANSKN ALVIGASESV
RPAFSPDAND AHRQADFSSR GPAGDGRLRP DLLAPGSALV STRSRLVDSN FPANQQYTVM
GGTSQAAAVA GGAAAVLRQY LRGNGFPDPP AALMKAALVN AAWTPQEGPT AAFPGILDLA
GTILALEEGT MHLADGNRGL GEGQEATFQF QVTDSGAPLR ATLAWTDPAP APGATTALVN
DLDLTVIAPD GRTYAGNDFA GEGQLDRRNN LEKVYIKNPT PGTYTIKVKA AAVRKNALPG
NPVPAQDFAL AYGQPLERGV VAGATADGRV TLSDGRTITA PSGGIKNCVD GAIAPADAAY
ILPGADAYLG PRTLYIVARR WQAAGVQALA TSGGALFLEI NSQARTGGYY FNTAGSLALN
GRPVGVADLQ PGFDLAATIN PSTGTLWQVR AGYQEKEGFL AYVNLAKREL RILGDDRSYS
FSPRVAVTFA DSLVEAAPAD LPYGAADRAD PSTLVPGMAV RLVLDPETGQ VQYIAVKREL
ALGTIAGVKG DTLTLATGTD YTLFPGAPVR RDGREAGVAD LQPGDWVILN LMPGSHQIIA
LTAYSNVTYG RVLYLSGDRH SLYLMDCTNQ FRSYNLDDST RVFRWGLPVS AATLSPGDWV
RLTAVPGETT AWRVDAVSPA GEVDRVLAGV DGEKGLLRTA DGSTYTLTSR TLVTLNGYRV
AAADLPAWLP VSLTLLEGPE RPILARVAAS SFPGSQPPRL EVSVLQSAGE VVLKGTTSAD
RLYLQHDNGA QEVVPVGTGG SFQWRVPTGE KNVLLIALDR TTSGVTGQKL DLTSIQDEGF
WDTRGHWAEN DINKMAARGL VAGYEDGSFR PDNPVTRVEL IAFLVRLAGW RVPAGSQPDF
TDRQIIPDWA RATVAVALER GLVSGYPDGS FRPGQVVSRV EAAAFFTRYL EIAGKLPSGS
APGASGPATS SSGTTAASNS ASAGTAGPGS RSAGALSSSQ PPPFTDWDSV PVWGREAVAR
AYAAGLMGGM APGVFAPLSP LTRAQAAAIM ARML