Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0390 |
Symbol | |
ID | 4204073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 475454 |
End bp | 478411 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642564947 |
Product | polysaccharide lyase family protein 8 |
Protein accession | YP_697719 |
Protein GI | 110801656 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.918566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CTAGAAAAAA AATCAATAGT CTTATATCAA TGGCAATTGC AAGTTGTATG GTTATGGGAG TTTCATATGA GAATGTTTTG GCACTTGAAA ATAAGAGTCA AAATAATTCT GAAAAGTTAG TGAAATATGA CAATGAAACT GAGTATATTA AAAATATTAG ACTAAGATGG AAGGAGGATT TAGTAGGTAA TTCTTCTCTA GATACAAGTA ATGCTACAAT ATCTAAGAAG ATAATAAGTT ATGTAAATAA TACAGATAAA TTAGTAGCAA AACTTAATAT GGATCCTAAG GCTCAATGGC TTTGGGAGGA TTTAAAAGAT TACAAACAAA ATCCAGCCAG AATAACATCT ATGTTCAATA ACTTAGTAAC AATGACTATG GCATATAGCT TACCTAATAA TAAGTATTAC AAAAATGAAG ATTTGAAAAA TAAGATAATA TATTCCTTAG ACTGGATTAA TAAAAATGCT TATAATGAGA ATATTGACCA GTATGGAAAC TGGTGGGATT GGATGATTGG AATACCAGCA AGATTAAATA ATATTGTTGT TTTAATGTAT GATGATTTAA CTGAAGAACA AGTTAAAAAT TACATGAATG CAATACAAAA GTTCTTACCT AGTATTGAGC CAGGAAGTAA ATATCATACA GGAGCAAATT TAGCAGATGT ATGTATGAAT AAGTTATTGC AAGGTGTCAA TGAGAATGAT CCAGAAAAAA TTAAGGAAGC ATCAGAGGAT ATAATTGGAG TTTTTGATTA TGTAACTAGC GGAGATGGAT TCTATAAGGA TGGTTCATAC CTTCAACATG GAATGGTAGC ATACACAGGT TCATATGGAA ATGTTCTTAT TGAGAAAATA TCTAATATAA TGTTTCTATT AGAAAAAACT CCATGGTCAA TAAAATCTGA AAGTAAAGAT AACGTTTATA AGTGGATTTT TGATAGTTTC AATCCAATTA TATATAAAGG ATACGTTATG GACATGGTTA GAGGAAGAGC AATATCAAGA TATAATGCTA ATGGATACTT ACAAGCATCT GGAATTATTG AAGGTATGAT TAAAATTGGA ATGATTTCTG ATGGAGATAA GGCTAGTGAG ATAAATTCTT TAGTTAAAAA ATGGGCTACA GAAGCTAAGA GTGTATTAGA TTTTGGAACA AGATTTAAGT CAATTAATGT AATAGATGAA TTCTATGGAA TTATGAATAA TGACAATATA AAACCTTTAG AAGAAGGTAA TAAGCATTAT GCATTAAATA GTATGGACAA GACTGTTCAT AAAAGAGAGA ATTTCGCTTT AGGTATATCA AGAAGTTCAA GTAGAATTAG TAAATATGAA TTCATGAACA AGGAAAATTT AACACCATGG TTCCAAGGGG ATGGAATGAC TTATTTATTC AATAATGATT TAAATCAATT CTCAGGAAAT TTTTGGGCTA CAGTAGATCC ATATAGAATG CCAGGTACAA CTGTAGACAC TAGAAAAAGA GAACCAAAAG AAATATTACC AGGGTTAGAT CCAGGAGCAT CACAACAAAA TGAAATTTAT TATGAATTAG GAAAGAGTAA TTGGTCTGGT GGAAGTAAGT TAGGAGCTTA CGGCGTAGCT GGAATGGAAA TAGATAATAA GTACGATTCC TTAAAAGCTA AGAAATCTTG GTTTATGTTT GATGATGAAA TAGTTGCCTT AGGTTCAGGA ATAACTAATC CAGAAGATTT TGAAACTGAA ACAATAGTTG AAAATAGGAA GATAAAAAGT GATGGATCAA ATAAATTTAT AGTAGATGGA AAAGAAAGAG TAAGTAAATT AAAAGAAAAA GATAAAGTTG ATAATGCAAA ATGGGCTTAC TTAGAAGGAA ATGTAAGTGG ATCAAATATA GGATATTATT TCCCAGAGGG ATCAAATATT AATTTAATAA AAGATGAAAG AGAAGGTAAT TGGATTAATG TAAACTCTTC TAAACCAGAA GCAGATAAGG TGGTTAAAGA TAATTACTTA ACTATGTATA TAGATCATGG AAAAGCTATA AAGAATCAAA AATATAGTTA CGTATTACTA CCAAATAAGA CTGAGGATAA GGTAAAAGAA TATTCTGAGA ATCCAAATGT TGAAATTATT CAAAATGATG ATGTAGCTCA TAGTGTTAAG CATAAAAAAT TAAATATTGA AGCAGCTAAC TTCTGGAAAG ATGGAAAAAA TACTGCTGGA AATATAACAT CAACAGGAAA ATCATCTATA ATAATAAAAG AAAATAAGGA TAATACCTTA AGCATAGCTG TGTCAGATCC AACTTTCTTA GAAAAAAAAC TTTCTGTAGA AATAAATAAA CCAGCAATGG AAGTAATAAA ATCAGATGAA AGAATATCAA ATATAAATTT AGAAAATGGA AAAATAAAAT TTGATGTAAA TACAGAAAAT CTTTCAGGGT CACCTTTAGA GCTTCTTGTA AAATTAGGTA AAAAAAATAA TGGAGACAAT GAAAATAATA ATGAAATTAA AAATGAAGCT CCTGTAATAG AAGGACAAGA TGCTAATTTA TTTGTAGGAG ATAAGTGGGA TAAATCTCTT CACAAGCTTA AGGCAACAGA TAAGGAAGAT GGAGATTTAA CTAAAAATAT TAAGATTAAA GATAATCAAA TTCCTTTAAA TGATCAATTT GAAGTTACAA AGCCTGGAAC ATATCCAGTT ACTTTTGAAG TAAGTGATAA TAATGGGAAA AAAGCAGAGA AAAAGCTTAA TGTTTTAGTT AAAGAAAAGG AAGAAAATAA GCCAGAAAAT AAACCGGAAA ATCAAGAGAA TAAACCAAAT ATTAAACCAG AGGATCAAGA AAATAATAAT ACTGAGAAGC TACCTAACAC TGGAGGAGCA AGTAGTCTAA GTCTTGCAGC AATAGGTGTT CTTCTAGCTA CTGTTGGAAC AATGTTTACT AAGAAAAGAA AAAAATAA
|
Protein sequence | MKKSRKKINS LISMAIASCM VMGVSYENVL ALENKSQNNS EKLVKYDNET EYIKNIRLRW KEDLVGNSSL DTSNATISKK IISYVNNTDK LVAKLNMDPK AQWLWEDLKD YKQNPARITS MFNNLVTMTM AYSLPNNKYY KNEDLKNKII YSLDWINKNA YNENIDQYGN WWDWMIGIPA RLNNIVVLMY DDLTEEQVKN YMNAIQKFLP SIEPGSKYHT GANLADVCMN KLLQGVNEND PEKIKEASED IIGVFDYVTS GDGFYKDGSY LQHGMVAYTG SYGNVLIEKI SNIMFLLEKT PWSIKSESKD NVYKWIFDSF NPIIYKGYVM DMVRGRAISR YNANGYLQAS GIIEGMIKIG MISDGDKASE INSLVKKWAT EAKSVLDFGT RFKSINVIDE FYGIMNNDNI KPLEEGNKHY ALNSMDKTVH KRENFALGIS RSSSRISKYE FMNKENLTPW FQGDGMTYLF NNDLNQFSGN FWATVDPYRM PGTTVDTRKR EPKEILPGLD PGASQQNEIY YELGKSNWSG GSKLGAYGVA GMEIDNKYDS LKAKKSWFMF DDEIVALGSG ITNPEDFETE TIVENRKIKS DGSNKFIVDG KERVSKLKEK DKVDNAKWAY LEGNVSGSNI GYYFPEGSNI NLIKDEREGN WINVNSSKPE ADKVVKDNYL TMYIDHGKAI KNQKYSYVLL PNKTEDKVKE YSENPNVEII QNDDVAHSVK HKKLNIEAAN FWKDGKNTAG NITSTGKSSI IIKENKDNTL SIAVSDPTFL EKKLSVEINK PAMEVIKSDE RISNINLENG KIKFDVNTEN LSGSPLELLV KLGKKNNGDN ENNNEIKNEA PVIEGQDANL FVGDKWDKSL HKLKATDKED GDLTKNIKIK DNQIPLNDQF EVTKPGTYPV TFEVSDNNGK KAEKKLNVLV KEKEENKPEN KPENQENKPN IKPEDQENNN TEKLPNTGGA SSLSLAAIGV LLATVGTMFT KKRKK
|
| |