Gene CPF_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1301 
Symbol 
ID4202370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1474245 
End bp1480142 
Gene Length5898 bp 
Protein Length1965 aa 
Translation table11 
GC content31% 
IMG OID638082182 
Productglycosyl hydrolase family 31 protein 
Protein accessionYP_695747 
Protein GI110800450 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.449133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AGATGATTAT TAAAAAATTA AGTGTTTTAA CTTTAGGTAC TATTTTTACA 
TCTAATATTC TTTTACCTTC AACATTAATT TATGCTTTTC CAACAGAAGG GATAAATCAT
AATATCGAAA AAGAGGAGAA AAAAGAAATA AAGGAAAGTA ATTATTCTCA AATTGGAGGA
GTAGAAAATT TTACTCAAAA TGAAAACGAT GTATTATTGG ATTTAAGTAC AGGAGAAAAG
ATTAGAATAT CATTCTTGAA AGAGGATGTT TTAAGAATTT ATATGGATCC TACAGGAGAA
TTTCAAGAGG AGCCAACTCC AAATAGTAAG GACCATATTA CTAAAATAAT TAATAAGACT
GAGGAAGAAT ATGAAAATCC TACTCCAATA GTGGAAGATG GAGAAATAAT AAAAATATCT
ACAAGTGCTG TTCAGTTAAG AATAGAAAAA TCAACTGGAA AAATGGAGCT TTTTAATAAA
TTAAATAATA AAACTGTATG GAAGGAAGCT GAACCTTTAA AGCATAGAGT AGACGGTACT
ATTCAGACTC TTGAGTCAAA TGAAGATGAA TATTTCTATG GTGGAGGAAT GCAAAATGGT
AGGTTCTCTC ATAAGGGTAA AAAGATAAAT ATTAAGAATG AAAATAATTG GGTAGATGGT
GGTGTTGCAA GTCCAAATCC ATTTTATTTT TCAACTAATG GGTATGGAGT TATGAGACAT
ACATTTAAAC CTGGTGAGTA TGATTTTGAA GTTTCTGCAT CAGGTAAAGT TACAACAAAA
CATGAAGAGA ATCGTTTTGA TGCTTATTAT TTTATTGATG AAAAACCAAC AAATATAATA
GATAAGTTCA CAGAACTAAC AGGAGAACCA GTTTTGTTAC CTGAGTATGG ATTTTATTTA
GGACATGCAA ACTGTTATTC TCGTGACTGG ATAAATGATG AAACAGGACA AGAGTCTCAA
ACACAAAAGC CTGGTTTTGA TAGACAAGAG AGTTTAATGG TTGATGCTAA AAAAGTAGTT
GATGACCATG TTTCTAATGA TACACCACTT GGATGGTTTT TACCTAATGA TGGTTATGGT
TGTGGATATG GACGCGAAGA AAGTATAGAT GGAAACATAG CTAATTTAAA AGAATTTGTT
GATTATGCAA GAGGCTTTGG AATTCAAACA GGACTTTGGA CACAGAGTTC TTTAAAACCT
ACAGGAAATC AAGAGGCTTA TCTTGAGCGT GATATAGATA AGGAAGTTGG GGTTGCTGGC
ACAAATGGAG TTAAAACAGA TGTGGCATGG GTTGGAGCAG GATATTCATT TGCACTTAAC
TCAGTTAGAC AAGCTGCTGA AGGAATAATA AATAATTCAA AAGATAAGGC AAGGCCATTT
ATAGTAAGTT TAGATGGATG GGCTGGAACT CAGCGTTATG CAAGTATTTG GTCAGGAGAT
CAATATGGTG GAGAATGGGA ATATATAAGA TTTCATATAC CAACATATAT AGGGGCTGGT
CTTTCAGGTC AACCTAATGT AGGATCTGAT ATGGATGGCA TATTTGGAGG AAGTAAATTA
GTACAAACTA GAGATTTTCA ATGGAAGGCA TTTACTCCAG TACAAATAGA TATGGATGGA
TGGGGAGCAA ATGCTAAATA TCCATATGTA TTTGGAGAGC CTTATACTTC AATAAATAGA
ATGTATTTAA AATTAAAAGC AGAAATGATG CCATATAATT ATAGTATAGC TAATGAAGCT
ACTAATAATG GAGTTCCTAT GATAAGAGCT ATGATGCTTG AATATCCAGA AGAATATACT
TATGGAACTG ATACACAGTA TCAATATATG TGGGGACCTA ATATGTTAGT TGCTCCTATA
TATCAAAATA CAGATGGTGA TTCAGAAGGT AATGATATAA GAAATAATAT TTATCTTCCA
GATGAAGAGC AAATTTGGAT TGATTATTTT ACTGGAAAGC AATATAGGGG TGGTGGAGTA
TTAAATAATT TTGAAGCTCC TTTATGGAAA TTGCCTATCT TTGTTAAGAA TGGAGCAATA
ATTCCAATGA CTAGTGAAAA TAATAATCCA GAAGAAAGAG ATGATTCCCA TCGTATTTAT
GAAGTATATC CAAGTGGAGA TACAAGTTTT GAGGTTTATG AGGATGATGG TTTAACAACT
GATTATAAGG AAGGAAAATC AGCTAAAACT ATGGTAACTT CTAGTGCTCC AAAGACTGGA
AAAGGAACTG CTGTTATTAA TGTAGGATTA TTAGAAGGAG ACTATAATGG AATAGTTTTA
GATAGATCAA CAGAGTTTAT AGTTAATGTA AGTGAAAAAC CAAGCAATTT AGGTGTTACT
TTAGGTGGTA ATGATGTTCA GCTAACAGAA GCGCAAAGTT TAGAAGAATT TGAAAAGGGA
AGCAATATGT ATTTCTATGA TGAAACTCCA AATCTTAATA AATATGCTAC TGAAGGTTCA
GAATTTGCAA AGGTTGAAAT TACTTCAACA CCTAAGCTTC GTGTAAAAGT TGATAAAACA
AATGTAAAAG AAAATGAAGT TAAATTAACC ATTGATGGAT TTAATAATAC TCAAGATATT
GATAAAAATG AAGTGAATGA AAGTTTGGGA GTTCCAGGTA ATTTTAGGGC ATCAGAAGAA
GATATAACTC CTGAATCTAT AAAATTAACT TGGGATGAAG TTGAAGGTGC TACAACTTAT
GATGTTGAAA TTGATGGAAC TATATTTAAA AATATCAAAA ATACTGAGTA TTTAGATACA
GGATTGAATT ATGATACTGA ATATAGTTAT CGTGTAAGAA GTGTAAATAA GGATGGACAT
TCTAATTGGA GTGAATTGAT AAAGGTAAAA ACTGATTTAG ATCCATACAG AAATGTGCCT
AAGGATATGA AAACAGAGTG GAAATGGGGA CAATATAGCA GTGATGAACC AAGTAAGGCT
GTTGATGGAG ACGATTCATC ACAATTCCAC TCACAAGATA GTGCTATAGA TAAACCATTT
ATAATTGATA TGCAAAAGGC ATATACAATA GAGAAATTAG AATTATTATT TAGAAAAAAT
GGTAATGGTT CTGTTAAGAG AGCAGAAATA TATAGTAGCT TAGATGGAGT AACTTATGAA
AAAGTATTTA GTAATGCTGA AGGTTCAGAT ATAGCACCAT GGGCTACTGA TGGAGAAGTT
AAAACTATAA ACTTTAATAA GCCAATAAAG GTTCGTTACT TTAAGATTGT AACTAAGGAA
TCTATAGGAA ACTTCTTGGC TATGAGAGAG TTTAGACCTT ATAAGGTTGA TGGAACTAAT
GGACAGATTG TAGGAGATTG GAATAATAGT GGTTCAATAG AAGAGGGAGA TTTAGTATTC
CTAGAAAACT ATGCTGGATT AACTACTGCT GACTCTGATT GGGGCTATGT ATCAATGGCA
GATTTAAATA ACAATGGTTT GATTGATGCT TACGATATAT CTTATGTTTC TAGTAAATTG
GAAGGTGGAG TAAAACCTTC AGAAGGATTA GAACTTCAAG GAGATATGAT GCTTGTACCA
AGTAAATCTG AAATTAAATC TGGTGAAACA TTTACTATTG ATTTAGTAGG AACAGGTCTT
TCAGATATTA ATGCATTTAG TGCAGAAATT CCTTTAGATT CTACTAAGTA TGAGTATATT
AAGACTGAAG GAACTGTTTC CACTTCTGGA ATGAAAAATC TTTCAAAGGC AAGGGTTCAT
ACAGATAATA CTCAACATGT TTATGTGAAC TTTACTAATA TAGGAGATAA TGTAAAAGTA
AATGGAACAG ATACAATTGC TAGAATAACA TTAAAAGCTA AACAAAATAT TACCTGGGAT
ATGGAGATTA GCAATGCTTT ATTAGTAGAT AGCAAGTTAA ATTCTAAGTC AGCTATTGCT
AAAATTATTG ATTTAGAAAG TGAATTACCA TCTGGAAGAC CAAATAGTAG TAAGGTTTCA
AAAGAAAACA TTACTGTAAG TGGTGACTCA AGTCAATTAC AATCAGGAAT GGGATTAGAT
AAATTAATAG ATGGAACAAC AAGTAGTGAT GATAGTTCGC GTATGGATTT AAAATGGATA
TTTACTTCTG ATCAACAAGA TAAGGGAACT CTTCCTTTTG AAATGACCTT TGAATTTAAT
GAACCAAAAA CTTTAGAAAA TTTTACTATA TACAATAGAA TGAATTCAAA TGGAACCATA
AATATTGCAG CCATGAAAAA GGTTAAGGCT GTTGGATATT TAAATGGAGA AGAATTTGAT
TTAGGAGAAA AAGCTAATAT TACATCGGCT ACAACTGTTT ATGAATTAGG TGGAAAAGAA
TTTGATAAGA TTGTAATTAC TGCTTTAGAT TCTCATAAAG ATAAGAATAC TTTAGCTATA
AATGAAATTG AATTTTATGA GAAGAGTGAA GTGGAAACAA CAGGAATAAG TTTCGCAGAA
AATACTCCAG AATCAGTATA TTTAAATAGA ATTACTCCTA TTTTTGCAGA AGTTACACCT
GATAATGCAA ATAACTTAAA CTATAGATTA GCCTCAGAAA ACCCTGATAT TTTGCAAATT
TTACGTGTAG ATAGAGAGGA TAAAGCAACT TATTATTTAC GTGGATTAAA TCCTGGAGTA
GCAAAATTAG TTGCAACTAC TGCTGAGGGA AATCATAAAG TTGAAAAAGA AATAGTTGTT
TTAGATGGAA TTGATAAAAC TTTATTAACT AAAGCCATAG AAGAAGCTAA ATCCTATGAA
TCTTTAAGTG AGATATATAC TATTGAATCT TATGAAGCTT TACTTGAAGC TATTAAACAT
GCTGAAGATG TATTAGAAAA TAGTTCAAGT GAAAAAGAAA TTGGAGATGC AATAATAAAT
TTAAGAAGCA AAATTTCAAA GTTAGAAGAA CGTGAAACTG TAGAGGAGGA TAAGATAGAT
TCTAGCAAGT TAGAAGCTAT TTATGCAACT AGTGAAGCAG ATAGAGATTA TAAAGAGAAT
GCTGTAGATG GAGATGAAAA TACCATTTGG CATTCAGCTT ATCAAGCTGC AGATAAACTA
CCAGTATCTA TAACAATTAA GTTAGATAAA GCTTATGATC TTAATCAAAT AGACTATTTA
CCAAGACAAA ACAGTAGAAA TGGTCATGTT ACTGAGTATA AGATTGAAAC AAGTTTAGAT
AATGAAAATT GGACTGAGGT AAGAACAGGG AACCTAGAAG TTAATGAAGC TGGTAATGCT
TTGGCTAATA GAGGATATAA TCCTATAAGA TTTAATACTA TTAATGCTCA ATATTTAAGA
TTTACAGCCC TTAAAACTTT AGGAGATACA AATAATAAGT ATGCAAGTGC AGCTGAATTA
GTATTCTACG GAAAAGAGGG GAAAGTAAGT GCTGAATCTA TTACATTAGA AAAGACAGAG
TTAAAATTAA ATGTTAATGA ATCAGAGCAA CTAAAAGCTG TATTAAATCC TATAGAAAGT
AATGATACTA TTACTTGGAC TTCAAGTGAT GAGAGCATTG CTAAAGTAGA TGAAAATGGA
GTAGTTACTG GAATAGGCAA AGGAGAAGCC TTAATCACTG CTACAATACC TAATGGAAAA
TCTGCTACTT CCAAGGTTAT AGTAGAAGAT AGCGTTTCTG AGGAAATAAT AGTTAGTCCA
GTAAGAGACT TTAAAGCTTC TCAGGTTAAT AAAAAAGATG TAACTGTAAC ATGGACTACT
CCAGAATCAA CTACTGGATT AGAAGGATAT ATACTCTATA GAGATGGAAA GAAAGTAGCT
CAGTTAGAAG CGGATGAAAC TTCATATATG TTTAATAAAT TAAACAGACA TACAATTTAT
AATTTTAAGA TTGCAGCTAA GTATTCAAAT GGAAAAATTT CAGAAAAGTC ATCAATAACT
ATTAGAACAG CAAGGTAG
 
Protein sequence
MKRKMIIKKL SVLTLGTIFT SNILLPSTLI YAFPTEGINH NIEKEEKKEI KESNYSQIGG 
VENFTQNEND VLLDLSTGEK IRISFLKEDV LRIYMDPTGE FQEEPTPNSK DHITKIINKT
EEEYENPTPI VEDGEIIKIS TSAVQLRIEK STGKMELFNK LNNKTVWKEA EPLKHRVDGT
IQTLESNEDE YFYGGGMQNG RFSHKGKKIN IKNENNWVDG GVASPNPFYF STNGYGVMRH
TFKPGEYDFE VSASGKVTTK HEENRFDAYY FIDEKPTNII DKFTELTGEP VLLPEYGFYL
GHANCYSRDW INDETGQESQ TQKPGFDRQE SLMVDAKKVV DDHVSNDTPL GWFLPNDGYG
CGYGREESID GNIANLKEFV DYARGFGIQT GLWTQSSLKP TGNQEAYLER DIDKEVGVAG
TNGVKTDVAW VGAGYSFALN SVRQAAEGII NNSKDKARPF IVSLDGWAGT QRYASIWSGD
QYGGEWEYIR FHIPTYIGAG LSGQPNVGSD MDGIFGGSKL VQTRDFQWKA FTPVQIDMDG
WGANAKYPYV FGEPYTSINR MYLKLKAEMM PYNYSIANEA TNNGVPMIRA MMLEYPEEYT
YGTDTQYQYM WGPNMLVAPI YQNTDGDSEG NDIRNNIYLP DEEQIWIDYF TGKQYRGGGV
LNNFEAPLWK LPIFVKNGAI IPMTSENNNP EERDDSHRIY EVYPSGDTSF EVYEDDGLTT
DYKEGKSAKT MVTSSAPKTG KGTAVINVGL LEGDYNGIVL DRSTEFIVNV SEKPSNLGVT
LGGNDVQLTE AQSLEEFEKG SNMYFYDETP NLNKYATEGS EFAKVEITST PKLRVKVDKT
NVKENEVKLT IDGFNNTQDI DKNEVNESLG VPGNFRASEE DITPESIKLT WDEVEGATTY
DVEIDGTIFK NIKNTEYLDT GLNYDTEYSY RVRSVNKDGH SNWSELIKVK TDLDPYRNVP
KDMKTEWKWG QYSSDEPSKA VDGDDSSQFH SQDSAIDKPF IIDMQKAYTI EKLELLFRKN
GNGSVKRAEI YSSLDGVTYE KVFSNAEGSD IAPWATDGEV KTINFNKPIK VRYFKIVTKE
SIGNFLAMRE FRPYKVDGTN GQIVGDWNNS GSIEEGDLVF LENYAGLTTA DSDWGYVSMA
DLNNNGLIDA YDISYVSSKL EGGVKPSEGL ELQGDMMLVP SKSEIKSGET FTIDLVGTGL
SDINAFSAEI PLDSTKYEYI KTEGTVSTSG MKNLSKARVH TDNTQHVYVN FTNIGDNVKV
NGTDTIARIT LKAKQNITWD MEISNALLVD SKLNSKSAIA KIIDLESELP SGRPNSSKVS
KENITVSGDS SQLQSGMGLD KLIDGTTSSD DSSRMDLKWI FTSDQQDKGT LPFEMTFEFN
EPKTLENFTI YNRMNSNGTI NIAAMKKVKA VGYLNGEEFD LGEKANITSA TTVYELGGKE
FDKIVITALD SHKDKNTLAI NEIEFYEKSE VETTGISFAE NTPESVYLNR ITPIFAEVTP
DNANNLNYRL ASENPDILQI LRVDREDKAT YYLRGLNPGV AKLVATTAEG NHKVEKEIVV
LDGIDKTLLT KAIEEAKSYE SLSEIYTIES YEALLEAIKH AEDVLENSSS EKEIGDAIIN
LRSKISKLEE RETVEEDKID SSKLEAIYAT SEADRDYKEN AVDGDENTIW HSAYQAADKL
PVSITIKLDK AYDLNQIDYL PRQNSRNGHV TEYKIETSLD NENWTEVRTG NLEVNEAGNA
LANRGYNPIR FNTINAQYLR FTALKTLGDT NNKYASAAEL VFYGKEGKVS AESITLEKTE
LKLNVNESEQ LKAVLNPIES NDTITWTSSD ESIAKVDENG VVTGIGKGEA LITATIPNGK
SATSKVIVED SVSEEIIVSP VRDFKASQVN KKDVTVTWTT PESTTGLEGY ILYRDGKKVA
QLEADETSYM FNKLNRHTIY NFKIAAKYSN GKISEKSSIT IRTAR