Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1474 |
Symbol | |
ID | 7310243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1789155 |
End bp | 1790375 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608400 |
Product | peptidase U32 |
Protein accession | YP_002505808 |
Protein GI | 220928899 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00296183 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TTGAGCTTCT GGCTCCGGCC GGAAATCTTG AAAAACTAAA AATGGCAATT ATGTATGGTG CAGACGCAGT ATATATTGGA GGACAAAAAT TCGGCTTAAG GGCGTCTGCT GATAATTTCT CACTTGAGGA CATAAAGGCG GGACTTGTAT TTGCACACGA TCGGGAGTGT AAGGTCTATG TGACCGTTAA CATAATTCCT CATAATGAAG ATCTTGTGGG ATTGCCCGAA TATATTAGAC AGCTTGATGA ACTGGGAGTG GATGCTCTTA TAGTATCTGA CCCCGGAATT TTTGATATTG TACGTGAAAA CGCTCCTGAT ATGGAAATAC ATGTGAGTAC TCAGGCTAAT AATACAAATT ATGCAAGTGC AATGTTCTGG TACAGGCATG GTGCCAAGCG AATTGTTACT GCTAGGGAGC TTTCTCTTAT AGAAATAAGT GAAATTAAGG AAAAAATCCC GGATGACTTG GATTTAGAGG CCTTTATCCA TGGTGCGATG TGTATTTCAT ATTCGGGAAG GTGCCTGCTC AGCAATTACA TGGCAGGAAG AGATTCAAAC AGAGGAGCCT GTTCCCATCC GTGCAGATGG AAGTACCATC TGGTTGAAGA AAAGCGTCCA GGAGAGTACT ACCCTGTTTA TGAGGATGAA AAGGGGACAT ATATATACAA TTCAAAGGAT TTATGCACCA TTGAGTATAT CCCCGAACTT GTGAAAGCAG GTATCTACAG TTTTAAAATA GAGGGCAGGA TGAAAAGCTC CTTTTACGTA GCCACCGTTG TAAGTGCCTA TAGACAGGCT ATCGATGCAT ATATGGCCGA TCCTGAAAAC TACAGGTTTG ACCCGGAATG GTTAAAAGAG TTGTCAAAGG CAAGTCACAG GGAGTATACA ACGGGATTTT ATTTTAATAA AACTAGCGGT GCCGATCAGA TATACAATAC AAGCTCCTAT ATCAGAGAAT ATGACTTTGT GGGCGTGGTT CTGGAGTATG ACAAAGAAAC TGGTATTGCT AAGATTGAGC AGAGAAACCG TATGATTGTG GGAGATGAAA TAGAGGTGGT AAGCCCCCGT AAAGGCTATT TTACCCAAAC TATTAAAGAG ATGAAGAATG AGGATGGAGA AAGCATCAGA ACTGCACCTC ATGCACAGAT GATCGTATAT ATGCCTATGG ATCAGGAAGT GGGGCCTTAT AGTATTTTGA GAAGGAAGTA G
|
Protein sequence | MKKVELLAPA GNLEKLKMAI MYGADAVYIG GQKFGLRASA DNFSLEDIKA GLVFAHDREC KVYVTVNIIP HNEDLVGLPE YIRQLDELGV DALIVSDPGI FDIVRENAPD MEIHVSTQAN NTNYASAMFW YRHGAKRIVT ARELSLIEIS EIKEKIPDDL DLEAFIHGAM CISYSGRCLL SNYMAGRDSN RGACSHPCRW KYHLVEEKRP GEYYPVYEDE KGTYIYNSKD LCTIEYIPEL VKAGIYSFKI EGRMKSSFYV ATVVSAYRQA IDAYMADPEN YRFDPEWLKE LSKASHREYT TGFYFNKTSG ADQIYNTSSY IREYDFVGVV LEYDKETGIA KIEQRNRMIV GDEIEVVSPR KGYFTQTIKE MKNEDGESIR TAPHAQMIVY MPMDQEVGPY SILRRK
|
| |