Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0852 |
Symbol | |
ID | 4810470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1026968 |
End bp | 1027813 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106268 |
Product | type 4 prepilin peptidase 1. Aspartic peptidase. MEROPS family A24A |
Protein accession | YP_001037279 |
Protein GI | 125973369 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000579083 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGACTT CATTAAATCT TGGACTGTTG TTTGAAACGG GATTTACGGT TTTTTGTTAC ATTTCAGTGG CCTTGCTTGG GCTTCTGGTC GGTTCATTTT TAAATGTCTG TATCTACAGA ATACCCAACG ATGAATCTGT CGTCAGGCCT CGATCCCACT GCATGAAATG CGGACATACG CTTGGTGCGT TGGATTTAGT TCCTGTTTTC AGTTATCTGT TTTTAAAAGG CAGATGCAGA TACTGCGGCG AAAAGATTTC TCCAAGATAT GCTTTGGTTG AACTTTTAAC GTCCGTTGTT TATTTGCTTT TGTTCTGGAA GTATGGGTTG TCGGTGGATT TTTTGGCATC TGCTTATCTT ATGTCAGTAC TTATAGCCGT ATTCTTTATT GATTTGGATC ATATGATTAT TCCCAACAAG CTTGTAGTAG CGGCTCTGGT TGGAGGAGTT CTGCCTTTTG TTTATAATAT TTTCAGGCCC ATGGATATTT ATGTGGATCG CAAATGGTGG AATCCTCTGT TGGGTGCGTT TATAGGATTT GGCTTTTTGC TTCTGGTGGC AATTGTAGGT TATTTGGTTT ATAAAACGGA TGAGGCAATG GGCGGCGGCG ATGTCAAGCT GTTTGCTCCG ATAGGCCTTT TCCTGGGCTG GAAAATGACT ATAGTGGCAC TTTTTATATC CTTTGTGTCA GCCGGAATTG TAAGTATTGT ATTACTTTTG CTTAAGAAGA AGGAGAGAAG GAGTACGTTT GTTTTCGGTC CTTTTATTGT AATGGGTACT TTTTTCACTT ATCTTTTTGG CTGGGAGCTA TTGGAATGGT ACCTGAGCAC TTTGCTTCAT GTATGA
|
Protein sequence | MGTSLNLGLL FETGFTVFCY ISVALLGLLV GSFLNVCIYR IPNDESVVRP RSHCMKCGHT LGALDLVPVF SYLFLKGRCR YCGEKISPRY ALVELLTSVV YLLLFWKYGL SVDFLASAYL MSVLIAVFFI DLDHMIIPNK LVVAALVGGV LPFVYNIFRP MDIYVDRKWW NPLLGAFIGF GFLLLVAIVG YLVYKTDEAM GGGDVKLFAP IGLFLGWKMT IVALFISFVS AGIVSIVLLL LKKKERRSTF VFGPFIVMGT FFTYLFGWEL LEWYLSTLLH V
|
| |