Gene CPF_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1803 
SymbolpulA 
ID4201252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2035132 
End bp2037099 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content29% 
IMG OID638082673 
Productpullulanase, type I 
Protein accessionYP_696237 
Protein GI110799982 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02104] pullulanase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.248563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAA AGTTTAATTC TAAAGAGTTT AATGATAAGT ATTATTATGA TGGAGAACTT 
GGAGCTATTT ATAATAAAGA GGAGACTATT TTTAGGGTAT GGTCACCAGA AGCTAAGAGT
TTAGAATTAT TACTTTATAG AGATGGGAAT AAAGAGAGTT TAATAGAAAG AGATTTGCTT
CATAAAAAAA GTAATGGACT TTGGGAATTA GTAAAAAAGG GAGATTTAAA TGGAGTTTAT
TATAAGTGCC ATATTGTAGG AGAAGATTTT GAGCATGATT TTGTTGATCC TTACTGCAAA
GCCTTAGGAG TTAATGGGTA TAGAGCTATG GTTATTGACT TGAAGAAAAC TAATCCTAAG
GGATGGGAAA ATGAAAAGAA ACCATCACTT GAAAATCCCT TAGATTCAAT TTTATATGAA
ATGCATATAA GAGATTTTTC GATTGATAAG AGTTCAGGGG TTTCTTTAGA AAATAGAGGA
AAGTTTTTAG GAATAGTTGA AGAAAATACA AAAGTTCCTG GTACAGAAGT AAAAACAACC
TTAGATCATT TAAAGGAATT AGGAATAACC CATGTACATC TTTTACCTTC TTTTGATTTT
GGAACAGTTG ATGAGGAGAG ATTAGATGAA GAGCAGTATA ATTGGGGATA TGACCCTGTA
AATTATAATG TACCAGAAGG ATCATATTCT AAAAATCCAT ATAAAGGTGA AGTTAGAATT
AAAGAGTTTA AGGAAATGGT ACTTAAACTT CATAGAGCAG GAATAAGAGT TATAATGGAT
GTTGTGTATA ATCACACTTA TTCAGGAGAA AACTCTAATT TAAATTTATC TTACCCAGGA
TATTATCATA GACAAGATGA CTTTGGGAAT TTTTCTAATG GTTCTGGATG TGGTAATGAA
TTAGCATCAG AAAGACTTAT GGTAAGAAAA TATATGGTTG ATTCATTAAA GTATTGGGCA
AAAGAATATC ACATAGATGG ATTTAGATTT GATTTAATGG CATTACATGA TATTGAAACC
TTAAAGGAAA TAAGAGAAGA ATTAAATAAA ATAGATCCTT CAATATTAAT ATATGGAGAA
GGATGGAATG GAGGAGACTC TCCACTGCCT AAAGAAGAAG CTTGTTTTAA ATGTAATATA
GGTAAATTTG ATAAATTACA AATAGCAGCT TTTAGTGATG ATATGAGAGA TGGGATAAAA
GGATATGTAG CACATCTTAA AGAAGGAGGC TTCGTAAATG GAGGAGAAGA CTTTGAAGAG
AGTATAAAGT TTGGAATAGT AGCTTCTACT TATCATGAAG GAGTAGATTA TAATAAAGTA
AATTATTCAG ATTCTCCTTG GGCAAATGAA CCTTATCAGA CAGTAAATTA TTGCTCAGCC
CATGATAATA ATACATTACA CGATAAGCTT AAAATTGTAT GTGAAAATGC TTCTGAAGAA
GAGATAATAG AAATGAATAA GTTAAGTGCT GCTATTTTCT TAACATCTCA AGGAATACCC
TTTATTCATT CAGGAGAGGA GTTTTTAAGA ACTAAGACTA ATGAAAAAGG AGAATTTATT
GAAAATAGTT ATAATTCTAA TGACTTTGTA AACAAGATTG ATTGGACTAG AAAAGTAAAG
TATATGGATT TATTTAAATA CTATAAGGGA TTAATTGAAT TAAGAAAGGA ATATCCTTTA
TTTAGGTTAG AAAGTAACAA AGAAATAAGA GAAAAAATTT CATTTATAGA GAGTGAGTTA
GGAATAAAGA AAAAGGGTAT AGTTGCTTAT AGATTAAAAG ATCATAATAA TGAATTCATA
GTAGTTTTTA ATAGTAATAA TAATGAAGTT AAAATAAACC TTCCTAAGGG TCTTTGGGGA
GTTATGGTTA ATAATAAATT TTCAGGAAAA GAAATAAGGG ATGAAGCTAG AGATTTTTAT
GATATAATAA GAAAGTCAGC ATGTGTTCTT AAAAAATTAA GTGATTAA
 
Protein sequence
MRRKFNSKEF NDKYYYDGEL GAIYNKEETI FRVWSPEAKS LELLLYRDGN KESLIERDLL 
HKKSNGLWEL VKKGDLNGVY YKCHIVGEDF EHDFVDPYCK ALGVNGYRAM VIDLKKTNPK
GWENEKKPSL ENPLDSILYE MHIRDFSIDK SSGVSLENRG KFLGIVEENT KVPGTEVKTT
LDHLKELGIT HVHLLPSFDF GTVDEERLDE EQYNWGYDPV NYNVPEGSYS KNPYKGEVRI
KEFKEMVLKL HRAGIRVIMD VVYNHTYSGE NSNLNLSYPG YYHRQDDFGN FSNGSGCGNE
LASERLMVRK YMVDSLKYWA KEYHIDGFRF DLMALHDIET LKEIREELNK IDPSILIYGE
GWNGGDSPLP KEEACFKCNI GKFDKLQIAA FSDDMRDGIK GYVAHLKEGG FVNGGEDFEE
SIKFGIVAST YHEGVDYNKV NYSDSPWANE PYQTVNYCSA HDNNTLHDKL KIVCENASEE
EIIEMNKLSA AIFLTSQGIP FIHSGEEFLR TKTNEKGEFI ENSYNSNDFV NKIDWTRKVK
YMDLFKYYKG LIELRKEYPL FRLESNKEIR EKISFIESEL GIKKKGIVAY RLKDHNNEFI
VVFNSNNNEV KINLPKGLWG VMVNNKFSGK EIRDEARDFY DIIRKSACVL KKLSD