Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2086 |
Symbol | |
ID | 4810946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2478446 |
End bp | 2480965 |
Gene Length | 2520 bp |
Protein Length | 839 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107493 |
Product | peptidase U32 |
Protein accession | YP_001038486 |
Protein GI | 125974576 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGAG ATTTTAAACT GGAATTGCTT GCCCCCTGCG GAGACTGGGA AGCATTTATG GCAGCCGTCG AAAACGGTGC GGACGCGGTA TACGTGGGGG GAAAGTTGTT TAATGCAAGG CAGTATGCTT CAAACTTTGA TGAGGAAAAA ATCAAAGAGG TAATACACTA TGCCCATGTG CGGGATGTAA ATGTATATCA GACCATGAAC ATCCTGATAA GCGACAGTGA GATGAGAGAG GCGCTCAAAG CATTGGAGCG GTCGTACCTT GCAGGTATTG ACGGGGTGAT AGTCCAGGAT ATCGGACTGG CGAGTTTGAT AAGAAAGCTG TATCCGGATC TTGCACTTCA CGCAAGCACA CAGATGACAG TATACAATTT GCAGGGCGTA AAGCTGCTCG AGGAACTGGG ATTTAAAAGG GTTGTGCTTG CAAGGGAGTT GTCGCTGGAG GAAATACAAT ATATTACTGA AAATACTTCA CTGGAGGTGG AAGTGTTTGT TCATGGGGCG TTGTGTGTCT GCTATTCGGG ACAATGCCTT ATGAGCAGTA TTATTGGAGG AAGAAGCGGA AACCGCGGAA AATGTGCCCA GCCCTGCAGG CTTCCGTATC AGCTTCTGGA AGTTGGCGAA GGAAGCGGTC TGCCTCAAAG AAAAGCGAAC AGAGGGTATT TTATGAGCCC AAAAGACCTG TGCTCTGTTG ATATTTTGGA TAAAATTATA AAAAGTGGTG TAAAATCGCT TAAAATTGAA GGCAGAATGA AAAGCGCCGA GTATGTGGCC ACCGTGGTGA GGATTTACAG AAAATATCTT GACAGGCTGT TTGAGAGTAC GGACAGTCGT AATGAAGGTA TTGTGGAAAA GGATATGAAG GACCTTCTCC AGATATTCAA CCGGGGGGGC TTCTCAAAAG GATATCTGGA AGGAAAAACG GGAAAAGATA TGATGAGCTT TGAGAAGCCT AAAAACTGGG GAATATACGT GGGAAAAGTA ATGGCCTGTG ACAGGGCGCA AGGCAGCATA AAAATAAAAC TTGAGGAACC TTTAAGCCTT GGTGACGGGA TAGAGGTGTG GAACGGTGAG GATGAAAGCC CGGGAACAAT TGTAACGTCA ATCCGGGTAA ACGGCAAGGC AGTGACGGAA GCACTGCCGC AGCAGGTGGT TGAGGTAAGA AACGTCAAAG GCAGGATAAA CAAGGGAAAC AAAGTTTACA AGACGTCCGA CAAAAAGCTT AATGCTTCTG CCAGAGAATC TTTCACCGGA AAATTCAAAA AGAGAATTCC CATTGAAGGA AGGATTACTG TGGCGGGAGG CAAACCTCTG TCAATTATTG TGAAGGATTA TGAGGGAAAC AAAGTTGAAG TCAAGTCCTC ATACGTGCCT GAGAAAGCTC TGACAAGTCC CGTTACCGAA GAGAAAGTTT TGAAACAGGC GGCAAAAACC GGACAGACTC CTTTTGAATT TAAAGAATTG CTCGCCGATG TGGAAGACGG TTTGTCCGTA CCCGTAAGTG AAATCAACAA TATTCGGCGT CATGCACTAA ATCAGCTGGA GATAAAAAGA ACCGACAGAT ATCCCTTAAG AAAGCCGGGA AATTTGCAAG AAAAATTGGA GGATGTGATG CATTTCCCGG GAAATAGTCG AAACGGGGAG GAAAAAAATT TAAAAATTTC GGCATGTTTT TACAAAGACA TGGCCGGGCT TGAATATGAA AGCCTTGGAG TGGATCGCAT CTACCTTCCT TTCAGCATGT TTGTAAAGGA AAACAAAGAA AGGATTTTGA GCATTAAAGA AAATGCAGAG CTGTTTGTAT TTATTCCCCC GGTAACCAGG GGAAATTATG ACAAGCTGAT AAAATCCAGG CTTGATGATA TTGTAAATAT GGGAATTGAC GGAATTCTTG CGGGGAACCC CGGCACTGTG AAATATGCCG GAGCATACCC AAAAATCCGT ATTATGGGGG ACTTTTCTCT GAACATATTT AACAGTGTTT CAATAAAAAC TCTCAAGGAT ATGGGGCTTA ACGGGGCGAC TTTGTCCTGC GAGCTTAATT TGAATCAGAT AAGGGAGATG GGGAAGTTTC CGGATTTTGT GGAAGAAGTG CTGGTATACG GAAGAATACC CCTTATGATC AGTGAGTATT GTCCGGTTGG GAGCATAAAA GGCAATTTCG GCAAAAACTC CAGATGCAGC ATGCCTTGCA AAGACAAAGA TTTTTACCTT GTGGACAGAA TGAACATGAA ATTTCCCGTC CTGTGCGACA GGATTGACTG CAGAAGCATG ATTTTCAACG CAAAAGTATT GCTGCTTTCA GATACTGTTG ATAGAATTAA AACATTGGGT ATTGATATGG TACGGCTTAA TTTTACGGAT GAAAATCCAA AAGAAGTAAA AGACATAGTG AAAATGCACA GGGATCTTTT AAATAACGGT TCCGGGGCGT TAGACTCTTA TAAGCAGTTG ATTGATAAAA TAAAAAGCAG AGGCTTTACA AAAGGGCATT TCCCAAGGGG TGTCCAGTAA
|
Protein sequence | MTRDFKLELL APCGDWEAFM AAVENGADAV YVGGKLFNAR QYASNFDEEK IKEVIHYAHV RDVNVYQTMN ILISDSEMRE ALKALERSYL AGIDGVIVQD IGLASLIRKL YPDLALHAST QMTVYNLQGV KLLEELGFKR VVLARELSLE EIQYITENTS LEVEVFVHGA LCVCYSGQCL MSSIIGGRSG NRGKCAQPCR LPYQLLEVGE GSGLPQRKAN RGYFMSPKDL CSVDILDKII KSGVKSLKIE GRMKSAEYVA TVVRIYRKYL DRLFESTDSR NEGIVEKDMK DLLQIFNRGG FSKGYLEGKT GKDMMSFEKP KNWGIYVGKV MACDRAQGSI KIKLEEPLSL GDGIEVWNGE DESPGTIVTS IRVNGKAVTE ALPQQVVEVR NVKGRINKGN KVYKTSDKKL NASARESFTG KFKKRIPIEG RITVAGGKPL SIIVKDYEGN KVEVKSSYVP EKALTSPVTE EKVLKQAAKT GQTPFEFKEL LADVEDGLSV PVSEINNIRR HALNQLEIKR TDRYPLRKPG NLQEKLEDVM HFPGNSRNGE EKNLKISACF YKDMAGLEYE SLGVDRIYLP FSMFVKENKE RILSIKENAE LFVFIPPVTR GNYDKLIKSR LDDIVNMGID GILAGNPGTV KYAGAYPKIR IMGDFSLNIF NSVSIKTLKD MGLNGATLSC ELNLNQIREM GKFPDFVEEV LVYGRIPLMI SEYCPVGSIK GNFGKNSRCS MPCKDKDFYL VDRMNMKFPV LCDRIDCRSM IFNAKVLLLS DTVDRIKTLG IDMVRLNFTD ENPKEVKDIV KMHRDLLNNG SGALDSYKQL IDKIKSRGFT KGHFPRGVQ
|
| |