Gene CPF_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1221 
SymbollacZ 
ID4203932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1386055 
End bp1390422 
Gene Length4368 bp 
Protein Length1455 aa 
Translation table11 
GC content29% 
IMG OID638082102 
Productbeta-galactosidase 
Protein accessionYP_695667 
Protein GI110800840 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGCA AAAAGACTAT TGCTAATATA TTAAGTATTG CTTTATTTTC AAATATTGTA 
ATTAATAATT TTAGCATTAA CGAAGTTTTA GCAAATGTAA AAAAAGTAAA AATTGAAGAT
GGTGTTTTTT GGGCAAATAA TCCTGAAAAA TTTGAAGACA ATCGGGAAAA AGCACATGCT
ACTTTAATGC CTTTTAATAG TATTGAAGAA GCATTAAATA ATCCTAACTA CTCTGATTAT
TCTAATTCAG AGAATTATAT GTCTTTAAAT GGAGAATGGA AATTTAATTT AGTAGAAACT
TATGATAAAG ATATAAAAGA TTTTTATAAA ACCGATTTTG ATTCTAGTTC ATGGAATACT
ATACCTGTTC CATCTAGTTG GCAATTGCAT GGATATGATC AGCCAAGGTA TAATGATACA
GCTTATCCTT GGGAATATCA AGATAATATA CCAGAACCGC CTGATGTACC AACAGACTAT
AATCCAATTG GATATTATAA GAAAACATTT ACTATTCCTG AAGGATGGGA TAATAAAGAA
GTTTTTGTAT CATTTCAAGG AGTTGAATCT GCATATTACT TATATATAAA TGGAGAATAT
GTGGGTTATA GTGAGGATTC ATTTACTGGA CACGATTTTA ATATAGGAAA GTTTTTAAAA
GAGGGAGAAA ATGAAATATC TGTAAAAGTT CACAGATGGA GTGATGGAAG TTGGTTAGAA
TCACAAGATA TGATAAAGCT TAGTGGGATA TTTAGAGATG TGTTCTTATA TTCTACACCT
AAAGCTCATA TAAGAGATTA TACTTTAGTT ACAGATTTAG ATGATAAATA TAGAGATTCT
AATTTGAATG TTGAAATTGA TATTTCTAAT TATGGTATTA AAGCTGGAAA ATATAAAATA
AAAGGTATTT TATATGATGA AAATAAAAAA ATAGTAAATG AAGATATTAG TGAATTTGAA
TTAAATGATG AAGAAAATGT ATTAGTTTCT ATTAATACTA GAGTTGAAAA TCCTAAAAAA
TGGACAGCGG AAACTCCTAA TTTATATACT TATGTAATTG CTTTAGAAAA TGAAAATGGT
GAAGTGGTAG AAACTATAAG TAACAAGTTT GGATTTAGAA AAATAGAAAT TAAAAATAAT
CAAGTATGTA TAAATGGACA GCCTATATCA TTTAAAGGAG TGAATAGGCA TGAATTTTTA
CCTGATACTG GTAGAACACT TACGGAAGAA AGTATGATAG AAGATATAAA GCTTATGAAA
AAAAATAATA TAAATGCAGT AAGGTCATCT CATTATCCTA ATGATCCAAG ATGGTACGAT
TTATGTAATG AGTATGGACT ATATGTTATG GATGAAGCTA ATTTGGAAAC TCATGGAAGA
TTAGATGATA TTCCTCAGAG TAGACCAGAA TGGACCGAGG CGGTTATAGA TAGACAGAGA
TCGATGTTAG AGAGATCAAA AAATGAAACA AGTATAATAA TGTGGTCATT AGGAAATGAG
TCAAGTGGAG GGGAAAATTT TGAAATAGCT GCAAAGTGGA TAAAGGAAAA TGATCCTACT
AGATTAGTTC ATTATGAAGC GGAGAGGACT GTAGGAGACG TATATAGTAG AATGTATAGA
ACTATAGAAG AAATGGAAGC CTATGCAAAT GATCCTGATA ATAAAAAGCC ATATATACAA
TGTGAATATG CTCATGGAAT GGGAAATAGT ATAGGTAATT TACAAAAATA TTGGGATGTC
TTTGATAAAT ACGATATAAT GCAAGGTGGA TTTATATGGG ATTGGGCTGA TCAAGCTATA
AGAATGAAAG ATAAAAACAC AGGTGAAGAA TTTCTAAGTT ATGGTGGCGA TTGGGGAGAT
AGTGAATTTA CGGATGGGAA TTTCTGTGCT AATGGCTTAG TTTCAGCTGA TAGAACAGTT
CAACCTGAAC TTCAAGAAGT AAAAAAGGTT TATCAAGAAA TAGAGATAGA GGATATTGAT
ATTTTAAATG GTAAAGTAAA GATAGTAAAT GAACATTTAT TTACTAATTT AAATAAATAC
AAAGGAAAAT GGGAGTTAAG AGCCGATGAT AATATATTAC AAAATGGGGA GTTAGATATT
TCTGTTGATC CATTAAGTAG CAAGGAATTT ACGATTCCAT TTAAAAAGCC AGAACTTAGT
CCAGGGGTTG AGTATTGGTT AAATATAAGT TTTGAATTAA AGGATGATGA GCCTTGGGCT
GAAAAAGGAT ATGTAATATC AAAAGAACAA TTTAAACTTC CTTTTGATAA TGAAATGGAA
AAAGGAATTG ATTTAAATTC AATGAATTCA ATAGAACTTA AAAATGATGA AAATAATGTG
CAAATTATAG GTGATGGATT TAAGGTTTCT TTTGATAAGA AACTTGGTGC TTTAGAATCA
TATAAAATAG ATAAAGAAGG GAATGAAATT GAGCTTATAG AAGAACCTAT AAGACCTAAT
TATTGGAGAG CACCTAATGA TAATGACAAA GGATTTGGTG CTGAAGAAAG ATTTGATACT
TGGAGATATG CAGGTGCAAA TGCTAAAGTA GAAAATCTAG AAGTTATAGA AGTAGGAGAT
AAGGCTGTAA AAGTAAATGT AGACTTTATA TTACCAACTA ATATAGAATC TAAACTTAAT
GTTGAATATA TTGTTTATGG AAATGGAGAA GTTTCTGTAA ATAATACACT TAATGCTTCT
AAAGGTCTTT CAGAAATACC AGAAATAGGA ATGATGCTTA AACTTCCTAA AGAATTTGAT
AGTATTACTT GGTATGGAAG AGGACCAGAA GAAAATTATA TAGATAGAAA TACTGGGTAT
GATATTGGTG TTTATAACAA GAATGTTAAA GATTTCTTCT TCCCATACTT AGAACCATCC
GAAACAGGAA ATAGAACAGA TACAAGATGG GTAACATTAA CTAATAATAA TGGAGTAGGA
TTAATGGCAT CTGGAATACC TAGTATAGAA TTTAATGCAT TACAGTATAC TCCAGAAGAA
CTTTCATCCG GTAAACGTCA TCCACATGAG TTAGCAAAAG AAGATAGTAT AGTTTTAAGA
ATTAATCATA GACAAATGGG TGTTGGAGGA GATAACAGCT GGGGAGCAAC TCCACATAGA
GAATTTATGA ATGAATCTGG TAAAATATAT AATTATTCTT TTAAAATTAA AGGTATCGAC
AAAAGTTCTT CTCCAATGGA AATAAGTAAA AAGAATTTAA AGGAAGATTT AATTAAGGAT
ATTAAAATTG ATGGTGTTAG TTTAAGGGGC TTTAATGAAA ATATTACAGA GTATAACATA
GATTACTTAG AGAAAACATT GGAAAAACCT CCAGTAATAG AAGTTGTAAA AGCTAATGAT
AATATAGATG TAGAGGTAGA AAATGTATAT ACAATACCAG GAAAAGCAAC CATTAAGGTT
AATCATAAGG ATGAGCTTTT AAATAAAATT TATACTAAAG AATATGTTAT AAATTTTGGA
ACACATAATG TGGAATATCT TTCAGATATA AGTTGGAAAA GCGCAACTTC TGGCATGTAT
GAACCTGTAA AAGATAGGTC TGTAGTTAAT AATCCATTAG TGTTAAAAAT TGATGGAGAA
ATAAAAACTT TTGATAAAGG TGTGGGAGTA AACTCAAACT CAGAAATAGT AGTTGATTTA
AAAGGAAAAG GTTATGAAAA ATTTGAAGCT TATGTTGGCA TGGATAGAGG AGTAAGTGGA
TATGGATCTT CTATAGTAGC TAGAGTTGAG GTTGATGGCA AGGAGGTATT CAATAGTGAG
AAAATATATA GTACTTCTAA TTGTCATAAG GTTGAAGTAG ATTTAAAAGG TGCTGAAAAA
TTAGCATTAT ATATTGATGA CTATGACAAT AATATTAAGT ATGATCATGG AAATTGGGCT
GATGCTAAAT TTATAAAAGC TGAGGCAAAT AAAGATACTT CCTTAAAAGC TTTAAAAGTA
AATGGAGAAA ATTTAAAAGG GTTTAACACA AATACTTTTG AATATGATGT GAACCTTTCA
AAGGATTCTC ATATACCTAA GATTGAAGCT ATTGCAATGA ATGAAAATTC AAAAGTGTTT
ATTGAAGATG CATTAAGTAT TCCTGGCACA AGTTATATTA AGGTTATTTC AGAGGCAAGC
ACTATGAAAA CTTATAAAAT AAACTTTGAA TTAAATGGAA ATTCTACAGA AGATGATGAG
AAGCCTGGAG ATGTAGAAAA ACCAAATGAT AATGACTTAG ATGAAAATAA TAAGCCAGAA
GATTCTGAGG GATTACCAAA TACAGGACAA AGTGTTATTA ACTTTGCTAT TATTGGAATT
ATTTCATCTT TAGCAGGAGT AAAACTATTA AGAAGAAAAA ATAAATAA
 
Protein sequence
MFSKKTIANI LSIALFSNIV INNFSINEVL ANVKKVKIED GVFWANNPEK FEDNREKAHA 
TLMPFNSIEE ALNNPNYSDY SNSENYMSLN GEWKFNLVET YDKDIKDFYK TDFDSSSWNT
IPVPSSWQLH GYDQPRYNDT AYPWEYQDNI PEPPDVPTDY NPIGYYKKTF TIPEGWDNKE
VFVSFQGVES AYYLYINGEY VGYSEDSFTG HDFNIGKFLK EGENEISVKV HRWSDGSWLE
SQDMIKLSGI FRDVFLYSTP KAHIRDYTLV TDLDDKYRDS NLNVEIDISN YGIKAGKYKI
KGILYDENKK IVNEDISEFE LNDEENVLVS INTRVENPKK WTAETPNLYT YVIALENENG
EVVETISNKF GFRKIEIKNN QVCINGQPIS FKGVNRHEFL PDTGRTLTEE SMIEDIKLMK
KNNINAVRSS HYPNDPRWYD LCNEYGLYVM DEANLETHGR LDDIPQSRPE WTEAVIDRQR
SMLERSKNET SIIMWSLGNE SSGGENFEIA AKWIKENDPT RLVHYEAERT VGDVYSRMYR
TIEEMEAYAN DPDNKKPYIQ CEYAHGMGNS IGNLQKYWDV FDKYDIMQGG FIWDWADQAI
RMKDKNTGEE FLSYGGDWGD SEFTDGNFCA NGLVSADRTV QPELQEVKKV YQEIEIEDID
ILNGKVKIVN EHLFTNLNKY KGKWELRADD NILQNGELDI SVDPLSSKEF TIPFKKPELS
PGVEYWLNIS FELKDDEPWA EKGYVISKEQ FKLPFDNEME KGIDLNSMNS IELKNDENNV
QIIGDGFKVS FDKKLGALES YKIDKEGNEI ELIEEPIRPN YWRAPNDNDK GFGAEERFDT
WRYAGANAKV ENLEVIEVGD KAVKVNVDFI LPTNIESKLN VEYIVYGNGE VSVNNTLNAS
KGLSEIPEIG MMLKLPKEFD SITWYGRGPE ENYIDRNTGY DIGVYNKNVK DFFFPYLEPS
ETGNRTDTRW VTLTNNNGVG LMASGIPSIE FNALQYTPEE LSSGKRHPHE LAKEDSIVLR
INHRQMGVGG DNSWGATPHR EFMNESGKIY NYSFKIKGID KSSSPMEISK KNLKEDLIKD
IKIDGVSLRG FNENITEYNI DYLEKTLEKP PVIEVVKAND NIDVEVENVY TIPGKATIKV
NHKDELLNKI YTKEYVINFG THNVEYLSDI SWKSATSGMY EPVKDRSVVN NPLVLKIDGE
IKTFDKGVGV NSNSEIVVDL KGKGYEKFEA YVGMDRGVSG YGSSIVARVE VDGKEVFNSE
KIYSTSNCHK VEVDLKGAEK LALYIDDYDN NIKYDHGNWA DAKFIKAEAN KDTSLKALKV
NGENLKGFNT NTFEYDVNLS KDSHIPKIEA IAMNENSKVF IEDALSIPGT SYIKVISEAS
TMKTYKINFE LNGNSTEDDE KPGDVEKPND NDLDENNKPE DSEGLPNTGQ SVINFAIIGI
ISSLAGVKLL RRKNK