Gene CPR_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0390 
Symbol 
ID4204073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp475454 
End bp478411 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content29% 
IMG OID642564947 
Productpolysaccharide lyase family protein 8 
Protein accessionYP_697719 
Protein GI110801656 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.918566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT CTAGAAAAAA AATCAATAGT CTTATATCAA TGGCAATTGC AAGTTGTATG 
GTTATGGGAG TTTCATATGA GAATGTTTTG GCACTTGAAA ATAAGAGTCA AAATAATTCT
GAAAAGTTAG TGAAATATGA CAATGAAACT GAGTATATTA AAAATATTAG ACTAAGATGG
AAGGAGGATT TAGTAGGTAA TTCTTCTCTA GATACAAGTA ATGCTACAAT ATCTAAGAAG
ATAATAAGTT ATGTAAATAA TACAGATAAA TTAGTAGCAA AACTTAATAT GGATCCTAAG
GCTCAATGGC TTTGGGAGGA TTTAAAAGAT TACAAACAAA ATCCAGCCAG AATAACATCT
ATGTTCAATA ACTTAGTAAC AATGACTATG GCATATAGCT TACCTAATAA TAAGTATTAC
AAAAATGAAG ATTTGAAAAA TAAGATAATA TATTCCTTAG ACTGGATTAA TAAAAATGCT
TATAATGAGA ATATTGACCA GTATGGAAAC TGGTGGGATT GGATGATTGG AATACCAGCA
AGATTAAATA ATATTGTTGT TTTAATGTAT GATGATTTAA CTGAAGAACA AGTTAAAAAT
TACATGAATG CAATACAAAA GTTCTTACCT AGTATTGAGC CAGGAAGTAA ATATCATACA
GGAGCAAATT TAGCAGATGT ATGTATGAAT AAGTTATTGC AAGGTGTCAA TGAGAATGAT
CCAGAAAAAA TTAAGGAAGC ATCAGAGGAT ATAATTGGAG TTTTTGATTA TGTAACTAGC
GGAGATGGAT TCTATAAGGA TGGTTCATAC CTTCAACATG GAATGGTAGC ATACACAGGT
TCATATGGAA ATGTTCTTAT TGAGAAAATA TCTAATATAA TGTTTCTATT AGAAAAAACT
CCATGGTCAA TAAAATCTGA AAGTAAAGAT AACGTTTATA AGTGGATTTT TGATAGTTTC
AATCCAATTA TATATAAAGG ATACGTTATG GACATGGTTA GAGGAAGAGC AATATCAAGA
TATAATGCTA ATGGATACTT ACAAGCATCT GGAATTATTG AAGGTATGAT TAAAATTGGA
ATGATTTCTG ATGGAGATAA GGCTAGTGAG ATAAATTCTT TAGTTAAAAA ATGGGCTACA
GAAGCTAAGA GTGTATTAGA TTTTGGAACA AGATTTAAGT CAATTAATGT AATAGATGAA
TTCTATGGAA TTATGAATAA TGACAATATA AAACCTTTAG AAGAAGGTAA TAAGCATTAT
GCATTAAATA GTATGGACAA GACTGTTCAT AAAAGAGAGA ATTTCGCTTT AGGTATATCA
AGAAGTTCAA GTAGAATTAG TAAATATGAA TTCATGAACA AGGAAAATTT AACACCATGG
TTCCAAGGGG ATGGAATGAC TTATTTATTC AATAATGATT TAAATCAATT CTCAGGAAAT
TTTTGGGCTA CAGTAGATCC ATATAGAATG CCAGGTACAA CTGTAGACAC TAGAAAAAGA
GAACCAAAAG AAATATTACC AGGGTTAGAT CCAGGAGCAT CACAACAAAA TGAAATTTAT
TATGAATTAG GAAAGAGTAA TTGGTCTGGT GGAAGTAAGT TAGGAGCTTA CGGCGTAGCT
GGAATGGAAA TAGATAATAA GTACGATTCC TTAAAAGCTA AGAAATCTTG GTTTATGTTT
GATGATGAAA TAGTTGCCTT AGGTTCAGGA ATAACTAATC CAGAAGATTT TGAAACTGAA
ACAATAGTTG AAAATAGGAA GATAAAAAGT GATGGATCAA ATAAATTTAT AGTAGATGGA
AAAGAAAGAG TAAGTAAATT AAAAGAAAAA GATAAAGTTG ATAATGCAAA ATGGGCTTAC
TTAGAAGGAA ATGTAAGTGG ATCAAATATA GGATATTATT TCCCAGAGGG ATCAAATATT
AATTTAATAA AAGATGAAAG AGAAGGTAAT TGGATTAATG TAAACTCTTC TAAACCAGAA
GCAGATAAGG TGGTTAAAGA TAATTACTTA ACTATGTATA TAGATCATGG AAAAGCTATA
AAGAATCAAA AATATAGTTA CGTATTACTA CCAAATAAGA CTGAGGATAA GGTAAAAGAA
TATTCTGAGA ATCCAAATGT TGAAATTATT CAAAATGATG ATGTAGCTCA TAGTGTTAAG
CATAAAAAAT TAAATATTGA AGCAGCTAAC TTCTGGAAAG ATGGAAAAAA TACTGCTGGA
AATATAACAT CAACAGGAAA ATCATCTATA ATAATAAAAG AAAATAAGGA TAATACCTTA
AGCATAGCTG TGTCAGATCC AACTTTCTTA GAAAAAAAAC TTTCTGTAGA AATAAATAAA
CCAGCAATGG AAGTAATAAA ATCAGATGAA AGAATATCAA ATATAAATTT AGAAAATGGA
AAAATAAAAT TTGATGTAAA TACAGAAAAT CTTTCAGGGT CACCTTTAGA GCTTCTTGTA
AAATTAGGTA AAAAAAATAA TGGAGACAAT GAAAATAATA ATGAAATTAA AAATGAAGCT
CCTGTAATAG AAGGACAAGA TGCTAATTTA TTTGTAGGAG ATAAGTGGGA TAAATCTCTT
CACAAGCTTA AGGCAACAGA TAAGGAAGAT GGAGATTTAA CTAAAAATAT TAAGATTAAA
GATAATCAAA TTCCTTTAAA TGATCAATTT GAAGTTACAA AGCCTGGAAC ATATCCAGTT
ACTTTTGAAG TAAGTGATAA TAATGGGAAA AAAGCAGAGA AAAAGCTTAA TGTTTTAGTT
AAAGAAAAGG AAGAAAATAA GCCAGAAAAT AAACCGGAAA ATCAAGAGAA TAAACCAAAT
ATTAAACCAG AGGATCAAGA AAATAATAAT ACTGAGAAGC TACCTAACAC TGGAGGAGCA
AGTAGTCTAA GTCTTGCAGC AATAGGTGTT CTTCTAGCTA CTGTTGGAAC AATGTTTACT
AAGAAAAGAA AAAAATAA
 
Protein sequence
MKKSRKKINS LISMAIASCM VMGVSYENVL ALENKSQNNS EKLVKYDNET EYIKNIRLRW 
KEDLVGNSSL DTSNATISKK IISYVNNTDK LVAKLNMDPK AQWLWEDLKD YKQNPARITS
MFNNLVTMTM AYSLPNNKYY KNEDLKNKII YSLDWINKNA YNENIDQYGN WWDWMIGIPA
RLNNIVVLMY DDLTEEQVKN YMNAIQKFLP SIEPGSKYHT GANLADVCMN KLLQGVNEND
PEKIKEASED IIGVFDYVTS GDGFYKDGSY LQHGMVAYTG SYGNVLIEKI SNIMFLLEKT
PWSIKSESKD NVYKWIFDSF NPIIYKGYVM DMVRGRAISR YNANGYLQAS GIIEGMIKIG
MISDGDKASE INSLVKKWAT EAKSVLDFGT RFKSINVIDE FYGIMNNDNI KPLEEGNKHY
ALNSMDKTVH KRENFALGIS RSSSRISKYE FMNKENLTPW FQGDGMTYLF NNDLNQFSGN
FWATVDPYRM PGTTVDTRKR EPKEILPGLD PGASQQNEIY YELGKSNWSG GSKLGAYGVA
GMEIDNKYDS LKAKKSWFMF DDEIVALGSG ITNPEDFETE TIVENRKIKS DGSNKFIVDG
KERVSKLKEK DKVDNAKWAY LEGNVSGSNI GYYFPEGSNI NLIKDEREGN WINVNSSKPE
ADKVVKDNYL TMYIDHGKAI KNQKYSYVLL PNKTEDKVKE YSENPNVEII QNDDVAHSVK
HKKLNIEAAN FWKDGKNTAG NITSTGKSSI IIKENKDNTL SIAVSDPTFL EKKLSVEINK
PAMEVIKSDE RISNINLENG KIKFDVNTEN LSGSPLELLV KLGKKNNGDN ENNNEIKNEA
PVIEGQDANL FVGDKWDKSL HKLKATDKED GDLTKNIKIK DNQIPLNDQF EVTKPGTYPV
TFEVSDNNGK KAEKKLNVLV KEKEENKPEN KPENQENKPN IKPEDQENNN TEKLPNTGGA
SSLSLAAIGV LLATVGTMFT KKRKK