Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1010 |
Symbol | |
ID | 4811304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1207340 |
End bp | 1208575 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106428 |
Product | peptidase U32 |
Protein accession | YP_001037435 |
Protein GI | 125973525 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTGAACTGCT TGCTCCCGCA GGCAATCTTG AAAAACTTAA AATGGCCGTT TTATATGGTG CGGACGCCGT TTACCTGGGC GGTGAGGAAT TCAGCCTCAG AGCTTATGCC GAGAATTTTA CATTGGATGA GCTGAAAGCA GGAGTGGAAT TTGCCCATAG CAAAGGAAAA AAAGTATATG TAACCATTAA TATATTCCCT CACAATGATG ATTTGAAGAA AATACCGGAA TATATAAAAG AAGTTGCAGG GATCGGAGTC GATGCCATAA TCCTCTCAGA CCCCGGCATT CTCTCCATTG TGAAAGAAAT AGCTCCGGAT ATGGAAATAC ATTTAAGCAC CCAGGCCAAC AATACTAATT TTATGAGTGC CAGATTTTGG CACAATCACG GTGTAAAACG GATAATACTT GCAAGAGAGC TTTCCCTTGA GGAAATCCGG GAAATAAGAG AAAAAACTCC TGACTCTCTG GAGCTTGAAG TTTTTGTCCA CGGTGCCATG TGCATATCCT ATTCCGGAAG GTGTCTTCTC AGCAATTACA TGGCCGGCAG GGATTCCAAC AGGGGACTGT GCGCACATCC ATGCAGGTGG AAATATTACT TGATGGAGGA AAAAAGACCC GGTGAATACT ACCCGGTATA TGAAAATGAA AGAGGCACAT TCATTTTCAA CTCCAGGGAC CTCTGTATGA TTGAGCACAT ACCGGAATTG GTGGAATCCG GAGTTTCCAG CTTTAAAATT GAAGGCCGCA TGAAAAGCTC TTTCTACGTC GCAACGGTCG TAAAAGCATA TCGCGAAGCA ATAGATGCCT ATTATGAGGA TAAAGACAAC TATAAATTCG ATCCCAGGCT TTTGGAGGAA GTCTGCAAAG TCAGTCACAG GGAATTCACC ACCGGCTTTT TCTTCAACAA GCCCGGCCCG AAAGACCAGA TTTACGCCAC CAGCTCATAT ATAAGAGAGT ATGACTTTGT AGGGGTTGTT CAAAAATATG ACAAAGCAAC AAAAATAGCA ACCGTAGAAC AAAGAAACCG CATGTACAAA GGTGAGGAAA TAGAAGTTGT AAATCCCAAA GGCAATTTTT TTGTTCAGAA AATTGAATGG ATGAAAAATG CCGACGGTGA AGACATAGAC GTTGCCCCCC ACCCTCAAAT GACGGTATAT ATGCCGATGA AAGAGGATGT GGAAGAATTT GCAATGCTCA GGCGAAAAAG CAGTCCAAAT AAATAA
|
Protein sequence | MKKVELLAPA GNLEKLKMAV LYGADAVYLG GEEFSLRAYA ENFTLDELKA GVEFAHSKGK KVYVTINIFP HNDDLKKIPE YIKEVAGIGV DAIILSDPGI LSIVKEIAPD MEIHLSTQAN NTNFMSARFW HNHGVKRIIL ARELSLEEIR EIREKTPDSL ELEVFVHGAM CISYSGRCLL SNYMAGRDSN RGLCAHPCRW KYYLMEEKRP GEYYPVYENE RGTFIFNSRD LCMIEHIPEL VESGVSSFKI EGRMKSSFYV ATVVKAYREA IDAYYEDKDN YKFDPRLLEE VCKVSHREFT TGFFFNKPGP KDQIYATSSY IREYDFVGVV QKYDKATKIA TVEQRNRMYK GEEIEVVNPK GNFFVQKIEW MKNADGEDID VAPHPQMTVY MPMKEDVEEF AMLRRKSSPN K
|
| |