Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0593 |
Symbol | |
ID | 4205927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 709292 |
End bp | 710950 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642565153 |
Product | cell wall binding repeat-containing protein |
Protein accession | YP_697920 |
Protein GI | 110803987 |
COG category | [R] General function prediction only |
COG ID | [COG5263] FOG: Glucan-binding domain (YG repeat) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000201841 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA TATTTATTAC AAGTTTGACT AGCCTTATTA TTGTTAGTCT AATTCTAACA ACAGGAACTA TAAGTGTACA AGCTGCAGAA AAAAATGAAA ATAAAATTAA TCAAAATCTT AGTATTAAGA ATGGAAATGC TGATTTTATA TCATCTGTTG ATATAGAAAG ATTAGATAAT AGCAAGATTG CTATAAAAAT AAATACTAAC AGCAATTATA ATGGAAATTC TTACAATATA TCAGTTAGAG GACCTGGAAG TATAAATAAT ATATCTACGA ATGTTCCATA TCATGAGTTG GATTTAACTA ATTATGGTAC ATATAATATA TATATTACAG TAACTGATAT ATATGGTATT AGTGATAGTT ATTTTAAGAA CTATCACTAT TCATCACCTA TACAAGCAAG ATTTTTTGAA ACAGTTGACA AAGCGACTAT AGGAGATGAG ATTAAATTTA ATGTTATTTC TTCAGGTGGA AATGGAAGTC ATAAATACAA ATTTTATACT AAAGATGGTG TAATAAAAGA ATATTCATAT AATTCTAGCT TAGTTACTAA ATTTAATGAA TACGGAGAAA AAGAATTGTT CTGTGATATA AAAGATGAAG ATGACACAGT AAAAACTATT TCTCATAAGA TAAAAGTTAT AGCAAATGAA CCAGCCTGGA AAAATGAAAA TAATAAATGG TATTATGTTA ATGATAAAGG TGAATTTATT AAAGGATGGC TTAATTTAAA TAATGTTTGG TATTATTTAG ATGGTGAAAC TGGAGAAATG AAAACAGGAC TTCAGGATAT AGGTGGATAT AGATATTATT TTGATGAAAG TGGTTATATG AAAACTGGAT GGATTAATTA TAATGGAGAG TATAGGTTTT TTGGTTCTGA TGGAGCAATG AGAACAGGAT GGGTAAATGA TGGGTGGACA GATTACTATT TAAAATCAGA TGGAACAATT TATAAAGGGT GGTTAGATGA TGGATTAAAT AAATATTATA TGGATGAAAA TGGCCAAATG AGAAAAGGGT GGATTAACTA TAACGGAGAA TATTATTTCT TTGGACCTGA TGGAGCAATG AGAACAGGAT GGATAAATGA TGGTTGGACA GATTATTATT TAAAACCAGA TGGAACAATC TTTAAAGGTT GGTTAGATGA TGGATTAAAT AAATATTATA TGGATGAAAA TGGCCAAATG AGAAAAGGCT GGGTCAAACA TAACGGAGAA TATTATTTCT TTGGACCTGA TGGAGCAATG AGAACAGGAT GGATAAATGA TGGATATGCG TATTATTTCT TAAATAATAA TGGTACAGTA AAAAAAGGAT GGTTTGATGA AAATGGCATA AGATATTATT TAGGGTCAGA TGGAGCTATG AGAACTGGTT GGCAAGTAAT AGGTGATAAT TGGTATCATT TTAATAATTC TGGAGCAATG AGTAGAAGTA CAAGTATAGA TGGGTGGAAA ATAGATAAAG AAGGAATAGC AACACCTATT AAAATTAATT CAACTGTATA TGTAACACCA AATGGAACAA GTTATCATTA TAGTAGAGAT TGTACAACAT TAAAAAGAAG TCATCAAATA TTAAGTATGA GTCTAGATGA GGCTAAAGCT AGTGGCAAAA ATGATCCTTG TAATGTATGT GTTAAATAA
|
Protein sequence | MKRIFITSLT SLIIVSLILT TGTISVQAAE KNENKINQNL SIKNGNADFI SSVDIERLDN SKIAIKINTN SNYNGNSYNI SVRGPGSINN ISTNVPYHEL DLTNYGTYNI YITVTDIYGI SDSYFKNYHY SSPIQARFFE TVDKATIGDE IKFNVISSGG NGSHKYKFYT KDGVIKEYSY NSSLVTKFNE YGEKELFCDI KDEDDTVKTI SHKIKVIANE PAWKNENNKW YYVNDKGEFI KGWLNLNNVW YYLDGETGEM KTGLQDIGGY RYYFDESGYM KTGWINYNGE YRFFGSDGAM RTGWVNDGWT DYYLKSDGTI YKGWLDDGLN KYYMDENGQM RKGWINYNGE YYFFGPDGAM RTGWINDGWT DYYLKPDGTI FKGWLDDGLN KYYMDENGQM RKGWVKHNGE YYFFGPDGAM RTGWINDGYA YYFLNNNGTV KKGWFDENGI RYYLGSDGAM RTGWQVIGDN WYHFNNSGAM SRSTSIDGWK IDKEGIATPI KINSTVYVTP NGTSYHYSRD CTTLKRSHQI LSMSLDEAKA SGKNDPCNVC VK
|
| |