Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1712 |
Symbol | |
ID | 3833162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1751702 |
End bp | 1754290 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829637 |
Product | peptidase U32 |
Protein accession | YP_430557 |
Protein GI | 83590548 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000390003 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.623925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC CAGAACTCAT GGCCCCGGCC GGGAACCAGG AAGCCCTGAA GGCGGCCATC GCCAACGGCG CCGACGCCGT TTACCTGGGC GGCCGGCAGT TTAACGCCCG GGCAGGCGCT GATAATTTTG ATCGGGATGG GATCCTGGCG GCCCTGGATT ATGCCCATGA AAGGGGCTGC CGCGTTTATG TCACCGTAAA CATTCTCCTG GCTGACCGGG AACTCCCTGC GGCCATGGAT TACCTCTATT TTTTGGGAGC GGCGCGAGTG GATGGTGTCA TCGTCCAGGA CCTGGGACTG GCCCATCTCG CCCGGCGGCT CCTGCCGGAA CTACCCCTCA TCGGTAGTAC CCAGATGACG GTGACCAATG CCGCTGGAGT CAAATACCTG GAGCAACTGG GCTTCAAGCG GGTGGTCCTT GCCCGGGAGC TGTCCCTGGA CGACATCAGG GCCATCAGGG AGCAGGTAGA GCTGGAACTG GAAGCCTTCG TCCACGGGGC TCTTTGCTTT TCTTACTCGG GTCAGTGCTT GTTAAGCAGC ATGATTGGGG GCCGCAGCGG CAATCGTGGC CGCTGCGCCC AGCCCTGTCG CCTGGCCTAC ACCCTGGTGG ACGAGGCCGG CCATCCCCTG GAAGCGAAAC CGGAACACCT GTTAAGCACC CGCGACCTCT ATACCCTGGA CCGGATTCCG GATTTGCTGG CTGCCGGGGT CACGGCCTTC AAGATCGAAG GCCGCCTGCG GCGGCCGGAG TATGTAGCCG TGGTGACCAG GGCCTACCGG CGGGTCATTG ATCGCTACCT GGCCGATCCC CGGGGATTTG CCGTTTCCCC GGCAGAACGG GCGGAAGTGG CCCAGATCTT TAACCGCGAT TTTACGCCGG GTTACCTGGA CGGCGACCCG GGGGTCGAAC TCATGGGGTA CGGCCGACCC AGTAACCGGG GTCTTTACCT GGGCCGGGCC GGCCGGCGCC AGGGGGAGCG CTGGCTGGTA CGCCTGGAGG CCCCCTTGCG CCGGGGGGAC GGCCTGGATG TCTGGGTCAG CCGGGGCGGC CATCAAGGGA TAGTGGTTCA CCATATCTGG CAGGAGGGCC GGGAGGTGCC CCAGGCACCG CCAGGGACCA CCGTGGCCCT GGAGCTGCCC CCTGCCACCC GGCCGGGGGA CCGGATCTTT AAAACCAGCG ATGTGGAGCT CCTGGAGGAG GTCCGGCGTA CCTATACCTC GCCCCGGGAA GAGCCCCGGG TGCCGCTGAC CATGGCAGTG CGGGGGCGAC CGGGGGACCC CCTGGAACTG GAGGTTGTCG ACCCCGGCGG CAACCGTGTC CAGGCCCGGA CCGCTGTAGC CGCGGCAGTG GCTAAACGCC ATCCCCTGGA TATGGCCACT CTAACGGCCC AGCTGGGCCG CCTGGGCAAC ACACCCTACC GGTTGGACCG GCTGGTGGCG CACTTGGAAG GACCGGTCAT GGTACCTTTA AGTGAATTAA ACCGGTTGCG CCGGGAGGCC ATTGAGGAAC TCCGGCAGAA GCGCCTATCC TCCTGGCCGC AACGGGTACC GTCTCCGGAG TCCTTCCGTA CCGGCCTGGA AGTTTGCCTA ACGCCCCGGG GACGGGTGCA AGCACCAACC ACGAAAACGA CAATAACAGG GTATACGGGG TTACCAGGGC CGGTAGCAGG AAACGGCTAT CACCGGCCCC GCCTGGCTGT AGCCGTAGGC GACGGTGAAG GAGCCCGGGT CGCCCTGGCC GCAGGTGCCG GGAGGGTGTA CCTGGCCGGG GAAATCTGGC AGGGGAAAGA AACCCTGGAT ACAGGCGACC TGCGGGAACT GGTGACCCTG GCCGGGGAGA AGGGGGCAGA AGTCATTCCC GCCCTGCCGC GCCTGTGGCA CGAAAAGGAG GCCGGCAGGG TAAAGAAGCG CCTGGAACAA TTTATGGAAG CCGGGGCCAG ATTGATAATG GTGGCCAATC CGGGCGGCCT GGAGCTATTA CAAGAATATC ACCTGGCGGG ATGGGGCGAT TATCCTTTAA ATGTATTTAA CGTCACCGCA GTAGAGGCTT TGGCCGCTGC CGGTTTACAA GGTGTGACCC TGTCGCCGGA GTTAAATCTG GAACAGCTGC GGGAGTTTAA GTCCCGGGCA CCGGGCCTGC CCCTGGAAGG CATCGTCCAC GGGTCCCTGC CCCTGATAGT CTCGGCCCAC TGCGTCCTGG GAGCGCGGCT GGGGGGCAAA AAACCGGGGC AGGTTTGCAC GGCTCCCTGC CGCCGGGGTC GCTATGGACT GAAGGATCGC CTGGGGCTGG TGTTCCCGGT AGCTACCGAC CGGCAGTGTC GCTTTTATTT ATATAACCCC AAAGAAATGT GCCTCGTGGA CCATCTGGCG GCCATCGCCG GCCTGGGTCT GGCCTGGATT CGCATTGAGG CCCGGGAAAA GCCTCCCGGC TATATCCGCC GGGTGACGGC CCTCTACCGG GAGGCCCTGG CCGCCCTGGG GACCAGGGAA GAAAGCAGGG TTCTGGGGGC GGCCGCCCGG GAGGCGGAAG CCCTGGCCCC GGCGGGCATT ACCCGTGGCC ACTATTTCCG GGGAGTTATT GATGTTTAA
|
Protein sequence | MNKPELMAPA GNQEALKAAI ANGADAVYLG GRQFNARAGA DNFDRDGILA ALDYAHERGC RVYVTVNILL ADRELPAAMD YLYFLGAARV DGVIVQDLGL AHLARRLLPE LPLIGSTQMT VTNAAGVKYL EQLGFKRVVL ARELSLDDIR AIREQVELEL EAFVHGALCF SYSGQCLLSS MIGGRSGNRG RCAQPCRLAY TLVDEAGHPL EAKPEHLLST RDLYTLDRIP DLLAAGVTAF KIEGRLRRPE YVAVVTRAYR RVIDRYLADP RGFAVSPAER AEVAQIFNRD FTPGYLDGDP GVELMGYGRP SNRGLYLGRA GRRQGERWLV RLEAPLRRGD GLDVWVSRGG HQGIVVHHIW QEGREVPQAP PGTTVALELP PATRPGDRIF KTSDVELLEE VRRTYTSPRE EPRVPLTMAV RGRPGDPLEL EVVDPGGNRV QARTAVAAAV AKRHPLDMAT LTAQLGRLGN TPYRLDRLVA HLEGPVMVPL SELNRLRREA IEELRQKRLS SWPQRVPSPE SFRTGLEVCL TPRGRVQAPT TKTTITGYTG LPGPVAGNGY HRPRLAVAVG DGEGARVALA AGAGRVYLAG EIWQGKETLD TGDLRELVTL AGEKGAEVIP ALPRLWHEKE AGRVKKRLEQ FMEAGARLIM VANPGGLELL QEYHLAGWGD YPLNVFNVTA VEALAAAGLQ GVTLSPELNL EQLREFKSRA PGLPLEGIVH GSLPLIVSAH CVLGARLGGK KPGQVCTAPC RRGRYGLKDR LGLVFPVATD RQCRFYLYNP KEMCLVDHLA AIAGLGLAWI RIEAREKPPG YIRRVTALYR EALAALGTRE ESRVLGAAAR EAEALAPAGI TRGHYFRGVI DV
|
| |