Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2878 |
Symbol | |
ID | 4809085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3403324 |
End bp | 3404217 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640108297 |
Product | peptidase C39, bacteriocin processing |
Protein accession | YP_001039269 |
Protein GI | 125975359 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0648351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTTA ACCGTGCAAT ATTTCCGGCT GTAATAATCG GCATTATACT TTTTATTATA GGGCAAAGAA CCTCAAGGGC GGTGAGCGGA AAACCCGTCA GGATTCTTTT GGGTTTTGTG TTCATCCTCC TTTGTATTCC AAACTTTCTG TTTGGGGCAT ATTACATGCA TTTTATAAAT GAACCTGTAT GGTATATTAA ATTTAGGGCG ATTGACAATA TTGAGTTGTT ATCATTCCTG ACAGGTCTTT TCTTTGGATT TGTAACTTTC AAAGATGAAA GCTTTGGGAA AATTAAGTTA AAAGTTTTTA ATAAATATAT ATTCGTTATC TGCATGCTTT TAATTATAGT ACCTTTTATT AAACCGATTA TAAGACCTGT AAGCCGGAAT TCTCAGTTTC AAGACCGATG GAAAGATGGG GTGTGTTTGC AATCTACAGG TTCAACTTGC GGTCCTGCAG CATTAGCGAC AATTTTTAAT TATTATGGAA TATCAAAAAG TGAAGAAGAG ATTGCAAGGG CAGCCTTTTC CAGTTCAAGC GGTACGGAGA ATTGGTATTT AATCAGGTAT GCGGAAAAGA ACGGACTCGA AACTGAAGTT TCATACAAAA ACAGTTTGAA CGATGTACCG GTACCGTCAA TAATCGGAAC TTATATTAAC GGCAGGATAG GCCACTTTAT AACGATTTTA GGCAAAGAAG GAGACCATTT TGTTGTCGGA GATTCTTTAA AAGGCAGATT ACTGTTAACC GAAAAGGAGT TTAATAAATA CTATACTTTT TCAAATTTTG TTTTGCACAT AAAAAAGCCG GAGGATTTAT ATTCTTCAAA GATGAAGCCC GGGAATGGAG AGACAGAAGG CAACTTCGAT ACGGAAAATA CGGATGATAC TTAA
|
Protein sequence | MYFNRAIFPA VIIGIILFII GQRTSRAVSG KPVRILLGFV FILLCIPNFL FGAYYMHFIN EPVWYIKFRA IDNIELLSFL TGLFFGFVTF KDESFGKIKL KVFNKYIFVI CMLLIIVPFI KPIIRPVSRN SQFQDRWKDG VCLQSTGSTC GPAALATIFN YYGISKSEEE IARAAFSSSS GTENWYLIRY AEKNGLETEV SYKNSLNDVP VPSIIGTYIN GRIGHFITIL GKEGDHFVVG DSLKGRLLLT EKEFNKYYTF SNFVLHIKKP EDLYSSKMKP GNGETEGNFD TENTDDT
|
| |