Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1522 |
Symbol | |
ID | 4204237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1708082 |
End bp | 1710049 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642566075 |
Product | pullulanase precursor |
Protein accession | YP_698840 |
Protein GI | 110802996 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02104] pullulanase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0360874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAA ATTTTAATTC TAAAGAGTTT AATGATAAGT ATTATTATGA TGGAGAACTT GGAGCTATTT ATAATAAAGA GAAGACTATT TTTAGAGTAT GGTCACCAGA AGCTAAGAGT TTAGAATTGT TACTTTATAG AGATGGAAAT ACAGAGAGTT TAATAGAAAG AGATTTGCTT CATAAAAAAA GCAATGGACT TTGGGAATTA GTAAAGAAGG GAGATTTAAA TGGAGTTTAT TATAAGTACC ATATTGTAGG AGAAGATTTT GAGCATGATT TTGTTGATCC TTACTGCAAA GCCTTAGGAG TTAATGGGTA TAGAGCTATG GTTATTGACT TGAAGAAAAC TAATCCTAAG GGATGGGAAA ATGAAAAGAA ACCATCACTT GAAAATCCCT TAGATTCAAT TTTATATGAA ATGCATTTAA GAGATTTTTC AATTGATGAG AGTTCAGGGG TTTCTTTAGA AAATAGAGGA AAGTTTTTAG GAATAGTTGA GGAAAATACA AAAGTTCCTG GTAAAGAAGT AAAAACAACC TTAGATCATT TAAAGGAATT AGGAATAACC CATGTACATC TTTTACCTTC TTTTGATTTT GGAACAGTTG ATGAGGAGAG GTTAGATGAA GAGCAGTATA ATTGGGGATA TGACCCTGTA AACTATAATG TACCAGAAGG ATCATATTCT AAAAATCCAT ATAAAGGTGA AGTTAGAATT AAAGAGTTTA AGGAAATGGT ACTTAAACTT CATAAAGCAG GAATAAGAGT TGTCATGGAT GTTGTATATA ATCATACTTA TTCAGGAGAA AACTCTAATT TAAATTTATC TTATCCAGGA TATTATCATA GACAAGATGA TTTTGGAAAT TTTTCTAATG GTTCAGGATG TGGTAATGAA TTAGCATCAG AAAGACTTAT GGTAAGAAAA TATATGGTTG ATTCATTAAA ATATTGGGCA AGAGAATATC ACATAGATGG TTTTAGATTT GATTTAATGG CATTACATGA CATTGAAACC TTAAAGGGAA TAAGAGAAGA ATTAAATAAA ATAGATCCTT CAATATTAAT ATATGGAGAA GGGTGGAATG GAGGGGAATC TCCACTGCCT AAAGAAGAAG CTTGTTTTAA ATGTAATATA GAGAAATTTG ACAAATTACA AATAGCAGCT TTTAGTGATG ATATGAGAGA TGGAATAAAA GGACATGTAT CACATCTTAA AGATGGAGGC TTCGTAAATG GAGGAGAAGA CTTTGAAGAG AGTATAAAGT TTGGAATAGT AGCTTCTACT TATCATGAAG GTGTAGATTA TAATAAAGTA AATTATTCAG ATTCTCCTTG GGCAAATGAA CCTTATCAGA CAGTAAATTA TTGTTCTGCC CATGATAATA ATACATTACA TGATAAACTT AAAATTGTAT GTGAAAATGC TTCTGAAGAA GAGATAATAG AAATGAATAA GTTAAGTGCT GCTATTTTCT TAACATCTCA AGGAATACCA TTTATTCATT CAGGAGAGGA GCTTTTAAGA ACTAAGATTA ATGAAAAAGG AGAATTTATT GAAAATAGTT ATAATTCTAA TGACTTTGTA AATAAGATTG ACTGGACTAG AAAAGTAAAG TATATGGATT TATTTAAATA CTATAAGGGA TTAATTGAAT TAAGAAAGGA ATATCCTTTA TTTAGGTTAG AAAGTAACAA AGAAATAAGA GAAAAAATTT CATTTATAGA GAGTAAGTTA GGAATAAATG AAAAAGGTAT AGTTGCTTAT AGATTAAAAG ATAATAATAA TGAATTCATA GTAATTTTTA ATAGTAATAA TAATGAAGTT AAAATAAACC TTCCTAAGGG CCTTTGGGGA GTTATGGTTA ATAATAAATT TTCAGGAAAA GAAATAAGGG ATGAAGCTAG AGATTTTTAT GATATAATAA GAAAGTCAGC ATATGTTTTT AAAAAAATAA GTAATTAA
|
Protein sequence | MRRNFNSKEF NDKYYYDGEL GAIYNKEKTI FRVWSPEAKS LELLLYRDGN TESLIERDLL HKKSNGLWEL VKKGDLNGVY YKYHIVGEDF EHDFVDPYCK ALGVNGYRAM VIDLKKTNPK GWENEKKPSL ENPLDSILYE MHLRDFSIDE SSGVSLENRG KFLGIVEENT KVPGKEVKTT LDHLKELGIT HVHLLPSFDF GTVDEERLDE EQYNWGYDPV NYNVPEGSYS KNPYKGEVRI KEFKEMVLKL HKAGIRVVMD VVYNHTYSGE NSNLNLSYPG YYHRQDDFGN FSNGSGCGNE LASERLMVRK YMVDSLKYWA REYHIDGFRF DLMALHDIET LKGIREELNK IDPSILIYGE GWNGGESPLP KEEACFKCNI EKFDKLQIAA FSDDMRDGIK GHVSHLKDGG FVNGGEDFEE SIKFGIVAST YHEGVDYNKV NYSDSPWANE PYQTVNYCSA HDNNTLHDKL KIVCENASEE EIIEMNKLSA AIFLTSQGIP FIHSGEELLR TKINEKGEFI ENSYNSNDFV NKIDWTRKVK YMDLFKYYKG LIELRKEYPL FRLESNKEIR EKISFIESKL GINEKGIVAY RLKDNNNEFI VIFNSNNNEV KINLPKGLWG VMVNNKFSGK EIRDEARDFY DIIRKSAYVF KKISN
|
| |