Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0592 |
Symbol | |
ID | 4206241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 706497 |
End bp | 709172 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642565152 |
Product | cell wall binding repeat-containing protein/mannosyl-glycoprotein endo-beta-N-acetylglucosamidase |
Protein accession | YP_697919 |
Protein GI | 110803679 |
COG category | [R] General function prediction only |
COG ID | [COG5263] FOG: Glucan-binding domain (YG repeat) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0288009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGAA AAACAGCTAC TATGTTAGCT ATATTTATAA CACTTTCAAC AAGTAATGTA TTTGCTATAG ATATTAAAAT TAATAATAAT TCTAAAAATA AACAAGAGAA AAGTGTTGAG AGTTACAATC AAAAAGCTTC AGAAAATAAT AAGGATAAAA TTAATGAAAA AAATGTTAAT GTTGATGAGG CGATAAAAGA AAAGGATACA AATATAGATA ATGACATAAT GAATGGTAAA GAGTCTGAGG AAAAAAAAGA TAATAATAAT CAAGACATAT ATGTAGAACC AAAAAATAAT TTAGATACAG AAAATACAAA CAATAATGAT AATTCGAAAG ATTGGAAAAA AGAAGTTGAA AAAAATAGAG ATGAAAATAA GATTAAAGCA GCTGATGAAG ATAATAAGGT TGAAAATAAG GATGATTTGG TTGGGAGTAT TTATGAAAAG AAAAGTATTA ATCTAAAGCC TAAATTTCAT CAATTTTTTA GTATAAATAA TAGTGAGAAC GAGCTAAAAT ATGAGCAAAT AAGAGATTCA AATGGACTTG AATCATTAGT ACGTTTACAT CCTAAAACAG ATAAGAAGTA TGAGATTGCT TTAGCTAAAG ATGATGGAAC CTTTGTTTAT ATAGGAAGTT ATGATGATTA TAATATTGCA AAAAAATCAA CAGATCATAG TAATTATAAT GTTCAGAGTT TTTCTGAAAC TGGTGGCATT CCTATTATTT TAAATTCAGA AGGAAAAGTA ATATATGCTG AAAAAGCATT AGGCAGGATA GTTGGGATTA AAGGACAAAG TACTATTAAT ATTTGTTCAG ATTCAGGATT AAAAAATGCG TTTACTTATA TAGCTGCTTC ATATGTTGAT GATGTTCCAA TAATAAGATA TGATGGAAAT GCTGTAGAGA TTGTTATTAA TGGATATAGG GGGTGGATAA GCAGAGAGGC TATAGATATT GTTCCTTTAA ATCAAGTAAA AAATCCAAGC AATTATGTTA ATAGATATGG TGAGTTAAGT CATTTTATTA GTGATAATTT AATGAGTAGT GAAGAATATT ATCGTTATCC TTTAGGTAAA TCACCAAGTT ATTTAAAAGA ATGGAAAAAA TATTATAATT ATGATGGTAA ATATTTTTAC ACAAGTTTAG AATTATTAAT AGATGATTTA CAAAATAATA CTCATAAAAA TGCTGTGAAT GTTAATGAGC CTTATTACAA CTATTACTAT TATTTACCCA TGAGAAGCAA GACAAGCTTT ACTGTTGATC AAATAAATAC GTATATTAGT AATAACTCAG CTTACAATAG TAAACTTAGA GATACAGGCG AATATTTTAT TGAGGCTCAA AACAAATATG GAGTAAATGC GTTAATTATG TTAGGAATAG CTATAAATGA GTCAGCATGG GGAACTTCAT ATTATGCAAG AAATAAAAAT AATTTATTTG GAATAGGAGC TGTTGATTCA AATCCAGATG ATGCATTTTA TTTTGAAAGT GTAGAGCAAT GTATAAATGA ATTTGCAAAA TATCAAATGT CTAGAGGATA TTCAGATCCA TATAATTGGT CTTATAATGG AGCGGCTCTT GGAAATAAAG GATTTGGAGC GAATATTCAA TATGCATCAG ATCCATTTTG GGGAGAAAAA GCCTCAAGTA ATTTCTATAA ATTAGATTAT CATGTTTCTG GAAATGGATT AGCAGGATTA ACTGAGTATG ATAAATATCA ACTAGGATTA TATATAGGCA ATAGTAAAGT AACAGATAAA TTTGGAAATT TATTATATAA TATAGGAAGT ATGGAGAGTA GTAAGGTCGG GAAAATTGGA ACAGCTGTAA TAATAAACGA TTTAAATAAA AAGGATATAA ATAATAGTTT AAACTATACT ATACAACCAG ATAGAACTAT TCCTATAAGC AATGGAAAAA TAAGTGGGCT TTATGATTGG AAAAATGGAT ATACTCCAAT TGATGGTGTG AAGCTAATAA ATCAAGGGAA AAATGTGACT AAGCCATCTG GTTGGATTAA TAAAGATGGT AAATTTTATT ATCAGCACTC AGATGGAACT TTAGCAAAAG GTTGGTTAGA TTTAAGTGGA TATTGGTATT ATTTAGATAA TACCACTGGA GAAATGAAAA CTGGACTTCA AGAAATAAAT GGATATAAAT ATTATCTTGA TGAAAGTGGT TATATGAGAA CTGGTTGGGT TAACTATAAA GATGAATATA GATTTTTTGG TGAAGATGGT ACTATGAAAA CAGGATGGAT AAATGATGGT TGGACAGATT ATTATTTAAA ACCGGATGGA ACAATCTATA AAGGATGGTT AGATGATGGC TTAAATAAAT ATTATATGGA TGAAAATGGT CAGATGAGAA AAGGTTGGGT CAAGTATAAT GGAGAATATT ATTTCTTTGG ACCAGATGGG GCAAGGAGAA CTGGATGGAT AAATGATGGA TATGCTTATT ATTTCTTAAA AAAAGATGGA ACTATTCATA CTGGTTGGCT AAAGGAAAAT GGTCAAAAAT ACTATTTAGG TTCTGAAGGC GATATGAAAT TAGGTTGGTA TAATATTGAT AATAATTGGT ATTATTTTGA TAATTCAGGA GCTATGATGA CAAATACTAT AATAGATGGT TGGGAAATAG ATGGTAATGG AATTGCAAGA AAATAA
|
Protein sequence | MIRKTATMLA IFITLSTSNV FAIDIKINNN SKNKQEKSVE SYNQKASENN KDKINEKNVN VDEAIKEKDT NIDNDIMNGK ESEEKKDNNN QDIYVEPKNN LDTENTNNND NSKDWKKEVE KNRDENKIKA ADEDNKVENK DDLVGSIYEK KSINLKPKFH QFFSINNSEN ELKYEQIRDS NGLESLVRLH PKTDKKYEIA LAKDDGTFVY IGSYDDYNIA KKSTDHSNYN VQSFSETGGI PIILNSEGKV IYAEKALGRI VGIKGQSTIN ICSDSGLKNA FTYIAASYVD DVPIIRYDGN AVEIVINGYR GWISREAIDI VPLNQVKNPS NYVNRYGELS HFISDNLMSS EEYYRYPLGK SPSYLKEWKK YYNYDGKYFY TSLELLIDDL QNNTHKNAVN VNEPYYNYYY YLPMRSKTSF TVDQINTYIS NNSAYNSKLR DTGEYFIEAQ NKYGVNALIM LGIAINESAW GTSYYARNKN NLFGIGAVDS NPDDAFYFES VEQCINEFAK YQMSRGYSDP YNWSYNGAAL GNKGFGANIQ YASDPFWGEK ASSNFYKLDY HVSGNGLAGL TEYDKYQLGL YIGNSKVTDK FGNLLYNIGS MESSKVGKIG TAVIINDLNK KDINNSLNYT IQPDRTIPIS NGKISGLYDW KNGYTPIDGV KLINQGKNVT KPSGWINKDG KFYYQHSDGT LAKGWLDLSG YWYYLDNTTG EMKTGLQEIN GYKYYLDESG YMRTGWVNYK DEYRFFGEDG TMKTGWINDG WTDYYLKPDG TIYKGWLDDG LNKYYMDENG QMRKGWVKYN GEYYFFGPDG ARRTGWINDG YAYYFLKKDG TIHTGWLKEN GQKYYLGSEG DMKLGWYNID NNWYYFDNSG AMMTNTIIDG WEIDGNGIAR K
|
| |