Gene CPR_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1558 
Symbol 
ID4206251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1747187 
End bp1750381 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content29% 
IMG OID642566109 
Productpullulanase precursor 
Protein accessionYP_698874 
Protein GI110803650 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02104] pullulanase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.430377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT CTGTAAAAAG AAAACTTGTT TGTTATGATG ATAATGAATA TTACATAGAG 
ATTAGTCCTT TAACTAATAC CATAAGAGCA ACCTTATGTT GTAAGGGGGC AATTGAGGCT
TATTTAGTTG GAGATTTTAA TAATTGGGAA AAAAGCGAAA ATTTTAAGCT TAATTGGGGA
TTGGATACTA ATGATGGTAG AGTAAAAATG ATTAAGGATA TTACCTTTCC TTGTGGACTA
AAGAAGGGAG AGTATAAGTA TGGATATATA ATTATTACTC TAGATGGAAA AGAAATATAT
GTAGATAACT TAAATGAAGA GAAAAATGAT TTTTATTTCC TTTGGGAGCC CTTTGAAGAA
TCCTTAGAAA TAAAAAGTTC AAGAGATTTT ATATCTACTA GATACCCAGT TGAATTAATA
GCTGTTAAAA ATTCATTTTA TGGAAATATA ACCATTGAAG AGGAGGTTCT TTCAATAGAG
AATTCTTTAA AGGGAGTAAC TTTAGAAAAT GGTTTTTTAA AGATTAATGA GGAAGTAGCC
TCTGGCACTA AAATTATAGT AAAGGCCTAT GATTCTTTAA ATGACATGAC AGCTTATAAG
GAAATTGAGG TTAGAGAAGA CGGGGAAAAA GGGACTTTTG TACAATTTTT AAAGAATGAT
GGATTATATT ATGGAGAAAA TTTCAGTTGG AATATTTGGG GCTTTGGAGA AAATAGCTCA
GGTAAAGAGT TTAATTTAAA TATTAAAACA GATTTAGGTG TAGGAGCTTT TGTAGATGAA
GAGAAATTTA TAGTAAGAAG GAGAACTTGG GGATATGACT GGGTAAATGA TTGGGATGAG
CAGACATATA CTTTTTATTT AGGTAAGGAA GATAGAAATT TATTTTGCAT ATATGAAAAT
AATAAGATAT TAACTTCTTT AAAGACTGCT ATAGAAGAAA CTACTCCTAA GATACAAGTT
GCCTTAATGG ATCATAAAAA TACTATAAAA GCATATCTTT CCCATAAGCC TTTAATTGGA
GTTAAGTATG GATTGTATAT AAACGGAATA AAGGCTAAGG GAGTTTCAAC CCTAGTTAGA
GAAAATGAAA AAGAAGTTTT AATAACTAAT CTTCCAAGTG ATATAGATCC ATCAGATTTA
TTAGAAGTAA GAGCTTCTAG TATGTTTTCA AGTTGCAAAG TAATAGTGAG AGATTATTTA
AATGATTATT ATTATGGAGG AAATGACCTA GGGGTAAGAT TTAGTGAAGA AAATATAAGC
TTAAGGTTGT GGGCACCTAC TGCTAAAAAA GTAGAATTAT TAATATATGA AGATTATAAA
TCTCTAAGGG AAAATCCTTT AAGAAAATAT GATATGAATA GAGAAAAAGA AAATGGAACT
CATTTAATTA AAGTACCAAG AAAAGAAAAT GAGGGTAAAT ATTATTTATA TAGATTATAC
TTTAATGGTT TAAGCAAAAA AGGTAAACAC GTAAATAAGA TAACCTATGC TGTAGACCCT
TATGCTGTTT CTGTAGGAGT AAATGGAGAA AAGGGGGCTT TAGTAGATTT ATTTAATAAA
GAGTGTGTGC CAGAAGGATG GAATAAATAT AATAAACCTA AGTTAATAAA TAAAGAAGAT
TCAATAATTT ATGAGATGCA TATTAGAGAT TTTACCATAA ATGAAAATTC AGGAGTTTCT
GAAAGCTTAA GAGGAAAATT TTTAGGGACT GTTGAAGAGG GAACTTTTTA TATTGATAAA
GAAAGTGGAA ACAAGGTTAA AACAGGATTA GATCATTTAA AGGAATTAGG GATAACCCAT
GTTCATTTGC TTCCAGTCTT TGATTTTGCC TCTGTTAATG AAGAAATAAC TAAGGATGAA
AACAATAGAA ACTGGGGATA TGATCCTAAA AACTTTAATG CCATAGAAGG AAGTTATTCA
ACAGACCCAT ATACACCTTC AAGAAGAATA ATAGAGTTTA GAGAAATGAT AAAAAAGTTT
CATGATAATG GAATAAGAGT TATATTAGAT ATGGTTTATA ATCATATGTA TGAAACTAGT
AATATGGATA ATATAGTTCC TTTATATTAC TTTAGAAGTG ATAAACTAGG AAAATATACA
AATGGATCTG GCTGTGGAAA TGAAATGGCC TCTGAAAAGC CTATGGTTAG GAAATTTATT
TTAGATTCCA TATTACATTG GATTAAAAAT TATCATATAG ATGGATTAAG ATTTGACTTA
ATGGAGCTTA TAGATTTAGA TACTATGAAG GAAATAGTAA AAAGAAGTGA AGAGATTGAT
GAAAAAATAC TTATTTATGG GGAGCCTTGG AAAGGTGGAG ATTCTCCTTT AAGTAATGGA
ACTTACAAGG GAAGTCAAAA AGGTCTTGGA TTTTCCATAT TTAATGATGA CTTTAGAAAT
GCTCTAAGAG GAAATAATGA CCCATCTAAT GGATTTATAA ATGGAGAACA GCACAATAAA
AATAAGGTTT GGCAAGTTAT AGAAGGAATT AAGGGATCCA TAAATTCAAT AACCTATAAA
CCTATGGAAA GTATAAATTA TTTAGAAGCA CATGATAACT ATACCCTATG GGATCAAATA
GTAAAAAGTC AAAATCATAG TGTGGAAAAG GGACATTATA GAGATTTTAA TGAAGAAAAT
ATATTAGATA ATCTTTATGT TAAGGAAGAT TTACTAGGAG CTTCTATATT ATTTACTTCT
CAAGGAATTC CCTTTATTCA ATCTGGAGCT GAATTTTTAA GATCTAAAGA TGGAGATCAT
AATAGCTATA AGAGTCCAGA CAGTATAAAT GCTTTAAATT GGGCAGAGAA AGAAAAGTAT
ATTGATGTTT TTAATTATTA TAAAGATTTA ATAAGTTTAA GAAAAAATCA CAGTGCTTTT
AGAATGAAAA ATCCAGAGGA TATAATAGGA AATTTAGAAG TTTATTTTTA TGACAATAAT
GATACTAGCG GAGTAATAAT AGCTCACTAC AAAAATAATG CTAATGAAGA TTTATGGAAA
GATATAGTTG TCATATATAA TGGAACTACC ATTGATGATT ATAATGTAAT TTCTTCTATG
CCTAAATCAA GTAATGGCTT TTGGAATATT GCAGTTAAAA GTTGGGCTTT AAATCAGTTT
GGTATAGAAA GAGTGAGTGA GGATGAAATT CCCAAAATAA AATCTCATTC TATGATGATA
CTTTATGATG AATAA
 
Protein sequence
MFDSVKRKLV CYDDNEYYIE ISPLTNTIRA TLCCKGAIEA YLVGDFNNWE KSENFKLNWG 
LDTNDGRVKM IKDITFPCGL KKGEYKYGYI IITLDGKEIY VDNLNEEKND FYFLWEPFEE
SLEIKSSRDF ISTRYPVELI AVKNSFYGNI TIEEEVLSIE NSLKGVTLEN GFLKINEEVA
SGTKIIVKAY DSLNDMTAYK EIEVREDGEK GTFVQFLKND GLYYGENFSW NIWGFGENSS
GKEFNLNIKT DLGVGAFVDE EKFIVRRRTW GYDWVNDWDE QTYTFYLGKE DRNLFCIYEN
NKILTSLKTA IEETTPKIQV ALMDHKNTIK AYLSHKPLIG VKYGLYINGI KAKGVSTLVR
ENEKEVLITN LPSDIDPSDL LEVRASSMFS SCKVIVRDYL NDYYYGGNDL GVRFSEENIS
LRLWAPTAKK VELLIYEDYK SLRENPLRKY DMNREKENGT HLIKVPRKEN EGKYYLYRLY
FNGLSKKGKH VNKITYAVDP YAVSVGVNGE KGALVDLFNK ECVPEGWNKY NKPKLINKED
SIIYEMHIRD FTINENSGVS ESLRGKFLGT VEEGTFYIDK ESGNKVKTGL DHLKELGITH
VHLLPVFDFA SVNEEITKDE NNRNWGYDPK NFNAIEGSYS TDPYTPSRRI IEFREMIKKF
HDNGIRVILD MVYNHMYETS NMDNIVPLYY FRSDKLGKYT NGSGCGNEMA SEKPMVRKFI
LDSILHWIKN YHIDGLRFDL MELIDLDTMK EIVKRSEEID EKILIYGEPW KGGDSPLSNG
TYKGSQKGLG FSIFNDDFRN ALRGNNDPSN GFINGEQHNK NKVWQVIEGI KGSINSITYK
PMESINYLEA HDNYTLWDQI VKSQNHSVEK GHYRDFNEEN ILDNLYVKED LLGASILFTS
QGIPFIQSGA EFLRSKDGDH NSYKSPDSIN ALNWAEKEKY IDVFNYYKDL ISLRKNHSAF
RMKNPEDIIG NLEVYFYDNN DTSGVIIAHY KNNANEDLWK DIVVIYNGTT IDDYNVISSM
PKSSNGFWNI AVKSWALNQF GIERVSEDEI PKIKSHSMMI LYDE