Gene Moth_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1712 
Symbol 
ID3833162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1751702 
End bp1754290 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content64% 
IMG OID637829637 
Productpeptidase U32 
Protein accessionYP_430557 
Protein GI83590548 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000390003 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.623925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC CAGAACTCAT GGCCCCGGCC GGGAACCAGG AAGCCCTGAA GGCGGCCATC 
GCCAACGGCG CCGACGCCGT TTACCTGGGC GGCCGGCAGT TTAACGCCCG GGCAGGCGCT
GATAATTTTG ATCGGGATGG GATCCTGGCG GCCCTGGATT ATGCCCATGA AAGGGGCTGC
CGCGTTTATG TCACCGTAAA CATTCTCCTG GCTGACCGGG AACTCCCTGC GGCCATGGAT
TACCTCTATT TTTTGGGAGC GGCGCGAGTG GATGGTGTCA TCGTCCAGGA CCTGGGACTG
GCCCATCTCG CCCGGCGGCT CCTGCCGGAA CTACCCCTCA TCGGTAGTAC CCAGATGACG
GTGACCAATG CCGCTGGAGT CAAATACCTG GAGCAACTGG GCTTCAAGCG GGTGGTCCTT
GCCCGGGAGC TGTCCCTGGA CGACATCAGG GCCATCAGGG AGCAGGTAGA GCTGGAACTG
GAAGCCTTCG TCCACGGGGC TCTTTGCTTT TCTTACTCGG GTCAGTGCTT GTTAAGCAGC
ATGATTGGGG GCCGCAGCGG CAATCGTGGC CGCTGCGCCC AGCCCTGTCG CCTGGCCTAC
ACCCTGGTGG ACGAGGCCGG CCATCCCCTG GAAGCGAAAC CGGAACACCT GTTAAGCACC
CGCGACCTCT ATACCCTGGA CCGGATTCCG GATTTGCTGG CTGCCGGGGT CACGGCCTTC
AAGATCGAAG GCCGCCTGCG GCGGCCGGAG TATGTAGCCG TGGTGACCAG GGCCTACCGG
CGGGTCATTG ATCGCTACCT GGCCGATCCC CGGGGATTTG CCGTTTCCCC GGCAGAACGG
GCGGAAGTGG CCCAGATCTT TAACCGCGAT TTTACGCCGG GTTACCTGGA CGGCGACCCG
GGGGTCGAAC TCATGGGGTA CGGCCGACCC AGTAACCGGG GTCTTTACCT GGGCCGGGCC
GGCCGGCGCC AGGGGGAGCG CTGGCTGGTA CGCCTGGAGG CCCCCTTGCG CCGGGGGGAC
GGCCTGGATG TCTGGGTCAG CCGGGGCGGC CATCAAGGGA TAGTGGTTCA CCATATCTGG
CAGGAGGGCC GGGAGGTGCC CCAGGCACCG CCAGGGACCA CCGTGGCCCT GGAGCTGCCC
CCTGCCACCC GGCCGGGGGA CCGGATCTTT AAAACCAGCG ATGTGGAGCT CCTGGAGGAG
GTCCGGCGTA CCTATACCTC GCCCCGGGAA GAGCCCCGGG TGCCGCTGAC CATGGCAGTG
CGGGGGCGAC CGGGGGACCC CCTGGAACTG GAGGTTGTCG ACCCCGGCGG CAACCGTGTC
CAGGCCCGGA CCGCTGTAGC CGCGGCAGTG GCTAAACGCC ATCCCCTGGA TATGGCCACT
CTAACGGCCC AGCTGGGCCG CCTGGGCAAC ACACCCTACC GGTTGGACCG GCTGGTGGCG
CACTTGGAAG GACCGGTCAT GGTACCTTTA AGTGAATTAA ACCGGTTGCG CCGGGAGGCC
ATTGAGGAAC TCCGGCAGAA GCGCCTATCC TCCTGGCCGC AACGGGTACC GTCTCCGGAG
TCCTTCCGTA CCGGCCTGGA AGTTTGCCTA ACGCCCCGGG GACGGGTGCA AGCACCAACC
ACGAAAACGA CAATAACAGG GTATACGGGG TTACCAGGGC CGGTAGCAGG AAACGGCTAT
CACCGGCCCC GCCTGGCTGT AGCCGTAGGC GACGGTGAAG GAGCCCGGGT CGCCCTGGCC
GCAGGTGCCG GGAGGGTGTA CCTGGCCGGG GAAATCTGGC AGGGGAAAGA AACCCTGGAT
ACAGGCGACC TGCGGGAACT GGTGACCCTG GCCGGGGAGA AGGGGGCAGA AGTCATTCCC
GCCCTGCCGC GCCTGTGGCA CGAAAAGGAG GCCGGCAGGG TAAAGAAGCG CCTGGAACAA
TTTATGGAAG CCGGGGCCAG ATTGATAATG GTGGCCAATC CGGGCGGCCT GGAGCTATTA
CAAGAATATC ACCTGGCGGG ATGGGGCGAT TATCCTTTAA ATGTATTTAA CGTCACCGCA
GTAGAGGCTT TGGCCGCTGC CGGTTTACAA GGTGTGACCC TGTCGCCGGA GTTAAATCTG
GAACAGCTGC GGGAGTTTAA GTCCCGGGCA CCGGGCCTGC CCCTGGAAGG CATCGTCCAC
GGGTCCCTGC CCCTGATAGT CTCGGCCCAC TGCGTCCTGG GAGCGCGGCT GGGGGGCAAA
AAACCGGGGC AGGTTTGCAC GGCTCCCTGC CGCCGGGGTC GCTATGGACT GAAGGATCGC
CTGGGGCTGG TGTTCCCGGT AGCTACCGAC CGGCAGTGTC GCTTTTATTT ATATAACCCC
AAAGAAATGT GCCTCGTGGA CCATCTGGCG GCCATCGCCG GCCTGGGTCT GGCCTGGATT
CGCATTGAGG CCCGGGAAAA GCCTCCCGGC TATATCCGCC GGGTGACGGC CCTCTACCGG
GAGGCCCTGG CCGCCCTGGG GACCAGGGAA GAAAGCAGGG TTCTGGGGGC GGCCGCCCGG
GAGGCGGAAG CCCTGGCCCC GGCGGGCATT ACCCGTGGCC ACTATTTCCG GGGAGTTATT
GATGTTTAA
 
Protein sequence
MNKPELMAPA GNQEALKAAI ANGADAVYLG GRQFNARAGA DNFDRDGILA ALDYAHERGC 
RVYVTVNILL ADRELPAAMD YLYFLGAARV DGVIVQDLGL AHLARRLLPE LPLIGSTQMT
VTNAAGVKYL EQLGFKRVVL ARELSLDDIR AIREQVELEL EAFVHGALCF SYSGQCLLSS
MIGGRSGNRG RCAQPCRLAY TLVDEAGHPL EAKPEHLLST RDLYTLDRIP DLLAAGVTAF
KIEGRLRRPE YVAVVTRAYR RVIDRYLADP RGFAVSPAER AEVAQIFNRD FTPGYLDGDP
GVELMGYGRP SNRGLYLGRA GRRQGERWLV RLEAPLRRGD GLDVWVSRGG HQGIVVHHIW
QEGREVPQAP PGTTVALELP PATRPGDRIF KTSDVELLEE VRRTYTSPRE EPRVPLTMAV
RGRPGDPLEL EVVDPGGNRV QARTAVAAAV AKRHPLDMAT LTAQLGRLGN TPYRLDRLVA
HLEGPVMVPL SELNRLRREA IEELRQKRLS SWPQRVPSPE SFRTGLEVCL TPRGRVQAPT
TKTTITGYTG LPGPVAGNGY HRPRLAVAVG DGEGARVALA AGAGRVYLAG EIWQGKETLD
TGDLRELVTL AGEKGAEVIP ALPRLWHEKE AGRVKKRLEQ FMEAGARLIM VANPGGLELL
QEYHLAGWGD YPLNVFNVTA VEALAAAGLQ GVTLSPELNL EQLREFKSRA PGLPLEGIVH
GSLPLIVSAH CVLGARLGGK KPGQVCTAPC RRGRYGLKDR LGLVFPVATD RQCRFYLYNP
KEMCLVDHLA AIAGLGLAWI RIEAREKPPG YIRRVTALYR EALAALGTRE ESRVLGAAAR
EAEALAPAGI TRGHYFRGVI DV