Gene CPF_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1839 
Symbol 
ID4202241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2073663 
End bp2076857 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content29% 
IMG OID638082709 
Productputative pullulanase 
Protein accessionYP_696273 
Protein GI110800805 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02104] pullulanase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT CTGTAAAAAG AAAACTTGTT TGTTATGATG ATAATGAATA CTACATAGAG 
ATTAGTCCTT TAACTAATAC CATAAGAGCA ACCTTATGTT GTAAGGGGGC AATTGAGGCT
TATTTAGTTG GAGATTTTAA CAATTGGGAA AAAAGCGAAA ATTTTAAGCT TAATTGGGGC
TTGGATACTA ATGATGGTAG AGTAAAAATG ATTAAAGATA TTACCTTTCC TTGTGGACTA
AAGAAGGGTG AATATAAGTA TGGATATATA ATTATTACTC TAGATGGAAA AGAAATATAT
GTAGATAACT TAAATGAAGA GAAAAATGAT TTTTATTTCC TTTGGGAGCC CTTTGAAGAA
TCCTTAGAAA TAAAAAGTTC AAGAGATTTT ATATCTACTA GATACCCAGT TGAATTAATA
GCTGTTAAAA ATTCATTTTA TGGAAATATA ACCATTGAAG ATGCGGTTCT TTCCATAGAG
AACTCTTTAA AGGGAGTAAT TTTAGAAAAT GGTTTTTTAA AGATTAATGA GGAAGTAGCC
CCTGGCACTA AAATTATAGT AAAGGCCTAT GATTCTTTAA ATGACATGAC AGCTTATAAG
GAAATTGAGG TTAGAGAAGA AGGGGAAAAA GGGACTTTTG TACAATTTTT AAAGAATGAT
GGATTATATT ATGGAGAAAA TTTCAGCTGG AATATTTGGG GCTTTGGAAA AAATAGCTCA
GGTAAAGAAT TTAATTTAAA TATTAAAACA GATTTAGGTG TAGGAACTTT TGTAGATGAA
GAGAAATTTA TAGTAAGAAA GAGAACTTGG GGATGTAACT GGGTAAATGA TTGGAATGAG
CAGACATATA CTTTTTATTT AGGTAAGGAA GATAGAAATT TATTTTGCAT ATATGAAAAT
AATAAGATAT TAACTTCTTT AAAGACTGCC ATAGAAGAAA CTACCCCTAA AATACAAGTT
GCCTTAATGG ATCATAAAAA TACTATAAAA GCATATCTTT CTCATAAGCC TTTAATTGGA
GTTAAGTATG GATTGTATAT AAACGGAATA AAGGCTAAGG GAGTTTCAAC CCTAGTTAGA
GAAAATGAAA AAGAAGTTTT AATAACTAAT CTTCCAAGTG ATATAGATCC ATCAGATTTA
TTAGAAGTAA GAGCTTCTAG TATGTTTTCA AGTTGCAAAG TAATAGTAAG AGATTATTTA
AATGATTATT ATTATGGAGG AAATGACCTA GGTGTAAGAT TTAGTGAAGA AAATATAAGC
TTAAGGTTGT GGGCACCTAC TGCTAAAAAA GTAGAATTGT TAATATATGA AGATTATAAA
TCTCTAAGGG AAAATCCTTT AAGAAAATAT GATATGAATA GGGAAAAAGA AAATGGAACT
CATTTAATTA AAGTGCAAAG AAAAGAAAAT GAGGGTAAAT ATTATTTATA CAGATTATAT
TTTAATGATT TAAGCAAAGA AGGTAAACAT GTAAATAAGA TAACCTATGC TGTAGATCCC
TATGCTGTTT CAGTAGGAGT AAATGGAGAA AAGGGAGCTT TAGTAGATTT ATTTAGTAAA
GAGTGTGTAC CAGAAGGATG GAATAAGTAT AATAAACCTA AGTTAATAAA TAAAGAAGAT
TCAATAATTT ATGAGATGCA CATTAGAGAT TTTACCATAA ATGAAAATTC AGGAGTTTCT
GAAAGCTTAA GAGGAAAATT TTTAGGGGCA GTTGAAGAGG GAACTTTCTA TATTAATAAA
GAAAATGGAA ATAAGGTTAA GACAGGATTA GATCATTTAA AGGAATTAGG GATAACCCAT
GTTCACTTGC TTCCAGTCTT TGATTTTTCT TCTGTTAATG AAGAAATAAC CAAGGATGAA
AATAATAGGA ATTGGGGATA TGATCCTAAA AACTTTAATG CCATAGAAGG AAGTTATTCA
ACAGACCCAT ATACACCTTC AAGAAGAATA ATAGAGTTTA GAGAGATGAT AAAAAAGTTT
CATGATAATG GAATAAGAGT TGTATTAGAC ATGGTTTATA ATCATATGTA TGAAACTAGT
AATATGGATA ATATAGTTCC TTTATATTAC TTTAGAAGTG ATAAATTAGG AAAATATACA
AATGGATCTG GCTGTGGAAA TGAAATGGCC TCTGAAAAGC CTATGGTTAG GAAATTTATT
TTAGATTCAA TATTACATTG GATTAAAAAT TATCATATAG ATGGATTAAG ATTTGACTTA
ATGGAACTTA TAGACTTAGA TACAATGAAG GAAATAGTGA AGAGAAGTGA AGAGATTGAT
GAAAAAATAC TTATTTATGG GGAGCCTTGG AAAGGTGGAG ATTCTCCTTT AAGTAATGGA
ACTTACAAGG GAAGTCAAAA AGGTCTTGGA TTTTCCATAT TTAATGATGA TTTTAGAAAT
GCTCTAAGAG GAAATAATGA CCCATCTAAT GGATTTATAA ATGGAGAACA GCACAATAAA
AATAAAGCTT GGCAAGTTAT AGAAGGAATT AAGGGGTCCA TAAATTCAAT AACCTATAAA
CCTATGGAAA GTATAAATTA TTTAGAATCA CATGATAACT ATACCCTATG GGATCAAATA
GTAAAAAGTC AAAATCATAG TGTGGAAAAG GGACATTATA GAGATTTTAA TGAAGAAAAT
ATATTAGATA ATTTTTATGT TAAGGAAGAT TTGCTTGGAG CTTCTATATT ATTTACTTCT
CAAGGAATTC CCTTTATTCA ATCTGGAGCT GAAATTTTAA GATCTAAAGA TGGAGATCAT
AATAGCTATA AGAGTCCAGA CAGTATAAAT GCTTTAAATT GGGCAGAGAA AGAAAAGTAT
ATTGATGTTT TCAATTATTA TAAAGACTTA ATAAGTTTAA GGAAAAATCA CAGTGCTTTT
AGAATGAAAA ACCCAGAGGA TATAATAGAA AATTTAGAAG TTTATTTTTA TGACAATAAT
GATACTAGCG GAGTAATAAT AGCTCACTAT AAAAATAATG CTAATGAAGA TTTATGGAAA
GATATAGTGG TAATATATAA TGGAACTACC ATTGATGATT ATAATGTAAT TTCTTCTATG
CCTAAATCAA GTAATGGCTT TTGGAATATT GCAGTTAAAA ATGGGGGTGT AAATCAGTTT
GGTATAGAAA GAGTAAGTGA GGATGAAATT CCTAAAATAA AATCTCATTC TATGATGATC
CTTTATGATG AATAA
 
Protein sequence
MFDSVKRKLV CYDDNEYYIE ISPLTNTIRA TLCCKGAIEA YLVGDFNNWE KSENFKLNWG 
LDTNDGRVKM IKDITFPCGL KKGEYKYGYI IITLDGKEIY VDNLNEEKND FYFLWEPFEE
SLEIKSSRDF ISTRYPVELI AVKNSFYGNI TIEDAVLSIE NSLKGVILEN GFLKINEEVA
PGTKIIVKAY DSLNDMTAYK EIEVREEGEK GTFVQFLKND GLYYGENFSW NIWGFGKNSS
GKEFNLNIKT DLGVGTFVDE EKFIVRKRTW GCNWVNDWNE QTYTFYLGKE DRNLFCIYEN
NKILTSLKTA IEETTPKIQV ALMDHKNTIK AYLSHKPLIG VKYGLYINGI KAKGVSTLVR
ENEKEVLITN LPSDIDPSDL LEVRASSMFS SCKVIVRDYL NDYYYGGNDL GVRFSEENIS
LRLWAPTAKK VELLIYEDYK SLRENPLRKY DMNREKENGT HLIKVQRKEN EGKYYLYRLY
FNDLSKEGKH VNKITYAVDP YAVSVGVNGE KGALVDLFSK ECVPEGWNKY NKPKLINKED
SIIYEMHIRD FTINENSGVS ESLRGKFLGA VEEGTFYINK ENGNKVKTGL DHLKELGITH
VHLLPVFDFS SVNEEITKDE NNRNWGYDPK NFNAIEGSYS TDPYTPSRRI IEFREMIKKF
HDNGIRVVLD MVYNHMYETS NMDNIVPLYY FRSDKLGKYT NGSGCGNEMA SEKPMVRKFI
LDSILHWIKN YHIDGLRFDL MELIDLDTMK EIVKRSEEID EKILIYGEPW KGGDSPLSNG
TYKGSQKGLG FSIFNDDFRN ALRGNNDPSN GFINGEQHNK NKAWQVIEGI KGSINSITYK
PMESINYLES HDNYTLWDQI VKSQNHSVEK GHYRDFNEEN ILDNFYVKED LLGASILFTS
QGIPFIQSGA EILRSKDGDH NSYKSPDSIN ALNWAEKEKY IDVFNYYKDL ISLRKNHSAF
RMKNPEDIIE NLEVYFYDNN DTSGVIIAHY KNNANEDLWK DIVVIYNGTT IDDYNVISSM
PKSSNGFWNI AVKNGGVNQF GIERVSEDEI PKIKSHSMMI LYDE