Gene CPF_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1472 
Symbol 
ID4203492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1660597 
End bp1665363 
Gene Length4767 bp 
Protein Length1588 aa 
Translation table11 
GC content31% 
IMG OID638082350 
Productputative alpha-N-acetylgalactosaminidase 
Protein accessionYP_695915 
Protein GI110799557 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAAGTTA ATAAACTTTC AAAAAAATCA TTATCTTTAA CATTAGGGCT CAGTGTGGCT 
CTAACTAACA CAGCACCATT ATCAGTATTA GCAAATGGAT CCAAGGTAGA GGATGTTGTA
GCTAGTAAGA GCCAAGATAA AATCACTATT GGAAACGAGT ACATTTCTAG AGAGTTTTCT
ATTGAAAACG GGAAGGTATT AACAAGTGAA ATAGAAAATA GCAGAGCAAA TACTACTTTA
GTTCCACAAT TAGGTTCAGA AGATTTTATT ATAAATACAA TACAAGAAAA TTCTGATTTA
CCAGAAGTAG AAGCTAATTC TCCAAAGGAA GTTTTAGATA GAGCTAACTG GAATGCAACC
CTTACAGCTA ATAGTGGAAC AGCATATCCA GCTACAGATA TTGAAAAATT ATTCGATGGA
GATAAAAATA CTTATATAGA TAGTTATAAT ATAACTGGTT ATCCAACATC ATTAAAAATC
GATTTAGGAG AAGTTAAAAC AGTATCTTCA TTTTCTTACC AAAAAAGACC TGGATTTACT
GATGCTAATT ATGGCAAAAA TGGAACTATG GGACAGTACA AGCTTTATGT AAGTGAAGAT
GGAGAAAATT GGACAGAAGC TGGAGAGGGT GAGTTTACTA GAGAAGACTT TAACTTGCAT
CAAGAAGGTA ATCTTCACAA TGTAGGAGAT GTTGTTTATG GTAATTTTGA TAAAACTTAT
GAAAGTAGAT ATATAAGAAT AGATCAATTA TCAGATGCCT TAGGCAATAC ACAGGAGTTT
TCAGGAGCAG AAATAAATTT ATATTCAGAT AAATATGAAA AACCAGAGGA ACCAATAGCA
CCTAAAACTG CCATTAAATC AAGTGATTTA ACAATAGATA CTTCCTCTAC AAAGATAGAA
GATATTGAAA ATGGGAAAAA GCTTACTATT TCATATGAGC CATATGATTT TAATGGAACT
GAGTATGATA TAGATATGGT TACTGTTCTT GAAAATGATG ACCACTATAT GAGATCTTTC
CTTGAAATCA AAACTAATAA TGAAAATGCA AAGATAGATT ATATTGATTT AGATCACTTT
ATATTAGAAG ATGGCATAAA GGATACTTTA TGGTCCCATC CAGATTTAGA AGATGTAACT
TCCATATTAA TAGGAAAAAA TGAATTGATG CTAGGGCAGC CTATTTATGC CAATGGTATG
TTCTTTGGAT CAGAATTCCC AGCTACAGAT ACTGATGTAG TGAATGATGG AATGCAAATT
AGATATTATA GTGGTAAAAG TTTTGAAGAG TTAGAAAGGG ATAATCAATT AACTACAGAT
GGAAAGTTTG TATCATGGCA AAATGTTGTG GGAGCTGCTA AGGGAGTAGA TACAGATGTT
GTTCAAACTG ATTTCTTTGA ATATATATCA GAAATAGCTA CACCAACTGA TTTTAGAAAG
CAATATAATT CATGGTATGA TAATATGCTT ACAATAACTG ATGAAAGTAT TTCTAAATCT
TTCTATGGAA CAGAAAAAGG ATTAACAGAA AATGGAGTGG AACCAATTGA TTCTTATGTA
GTAGACGATG GATGGCATAA CTATAGAGAT CCAGAATTTA ACCCTAATAT TTCAAAAGAA
CAAGCTGGAA ATAGTATGAA TAGAACTGGA TTTTGGGAGT TTAATGATAA GTTTCCTAAT
GAGTTATATA CATCTACAGA GTTAACAAGT AAATTTCAAT CAAAATTTGG ATTATGGCTT
GGACCACAAG GAGGATATAA TTTCTATGGT GGTTTTGCAA GATACTTAGA AAAAATGGGA
ACTGGATATG CTCAAACTAA TAATGGGGTA AATGTTTGCG TTGGTTCAGA TAGGTATATT
AAGAATTTAA CAAGTCTTTT CTTGGATTAC CAAGAGAGAT TTGATATAGA TTATTGGAAA
CTAGATGGAT TTGCTCTTAG ACCTTGTACA AGTGAAAATC ATGATCACAT GACTGGTGGA
CACAACAATA TGTATTATAC AACAGACCTA TGGGAAAAGT GGACAGATGC TTGGGAAACA
ATGAGAGCTA GTAGAGCAGA AGAAGGAAAG GACCTATTTA TAAATGCAAC TTGTTATGTA
AACTTAAGTC CATGGATTCT TCAATGGGTA AATACTGTAT GGATACAAAA TTCACAAGAT
ACAGGAGAAG CTGGTACAGG TTCAAGACAT CAACAAAAAA TAACTTATAG AGATGCTGTT
TATCATGATA TTTATAAATC AAACCAAATA CAATTCCCTG CAAAGAATAT ATATAACCAT
GAGCCAATAT ATGGTGTATC TGATGGAAGT TTTGCAACTA CAGAGGACTT TAGAGATTTC
TTATTTGCTA ATGCAGTTCG TGGAACAGCA TTCTGGGAAT TATATTACTC ACCATCAATA
ATGGATGATG AAAAGTGGAA AGTTAATGCA GATGTTTTAG ACTTTGTGGA AAATAATTTT
AATGTTTTAG AAAAGGCTAA GTTATTCGGA CATAGAGCTA CAGAGGGTGT TTATGGATAT
TCTGCATGGG ATGGCAATGA AGGTATAGTT TCATTTAGAA ATCCAACGGG AGAGACTAAA
GAGTATACTT TAGAGTTAAT TGATATAGTA GGAGTTCCAA AGTCAGTATC AAATCTTAAA
GGAAATCAAG TATTACCATA TAAAGTTGGT GATATAGGAT CAGTATCTTA TGGAGATAGT
ATAACTGTAA CTTTAGAGCC ATATGAAACT AGAATACTTC AATATGGAAA GGTAGATAAT
AAAGCTTCTG AAATAGTTTC TGCTAAGGTA ACTGGAGAAA ATGAAATAAC TATTAAGTAC
AATGAGCGTG TTGAAAATGG AGAAGGAGTT TATTCAGTTG AAGGAAATGA AATTATTGAA
TCTAAATTAT TAGATGATTA TAGAACAGTG GTTATAAGTA CTTCTAATAA ACTTCAAGGA
GAGGCTAAAT TAAATATTAA TGGAGAAAGA GATGCTCTTC AAAATCCACT TACAACAAGT
TTAACAATTC CTGTTTATAA TAATGGAAAA ATTGTTTCAG TACAAAGTGG AGAGGAATTA
ATAGGTGGAG AAAATATAAA TAAGAAGTAT AATGGAAATT CAGACACATA CTTCTTAGAA
ATGAATAAAG CTTATGAGGT TGATACAAAT GAAAGCTTTA AAGGAAAAAC TGATTTTGCA
ATATCTATGG CTATTAATAC TACATCATCT AATGTAAACT TATTAAGCCA AGGAGAGGAA
ATAAAGCTAT CAATTGATGA AGAAGGATAT GTAAACTTCA AAGTTAAAGA TTTAAATCTT
ACTTCAAAGT CAGAAGTAAC TACTGTTACA GAAAAGGCTC ATGGAACTTT TGGAACTAAT
GAATATGTAG AAACTTCAAC TGAATCAACT TTTGTTGGTA AAGTAAATGA TGGAAAATTA
CATCATATAT CTGCAGTTAG AGAGGTTAAC GGAGTTCTTA AAGTTTATAT TGATGGTGAA
TTAATGAATT CTGTATATGA TAAGTCATTA ATAAATCAAG AGATAAATGG TGGAACAATT
ATGGTTTCTG ATAATAACTT TACTGGAGTT ATAGGAGATA TTGAATTAAG AAATAGTTCA
ATATACTATG ATGAAGCAAA AGAACTTTAT AATTCTTATG AAGGTTCTGA CATAATTGAG
TATAGCAGAG AAAATTGGGC TACAAGTGCT TGTTCAGAAA TGAACCCACC AAGTGGCAAT
GATGGTCCTG CTTCATATGC TATTGATGGA AATGAATCTA CTTTATGGCA TACAAATTAT
GTTGGTGGAG ATAATCATAG TGGAAATCAT TGGCTTTCTG TAGATTTTGG TGAAGAATTA
GACTTTGATA CAGTTAATGT ATTATCTCGT GGAAAAGAAA TAAATGGTTC AATTAAAGGA
TATAAGTTAG AGGCTAATAT AAATGGAGAA TGGGCATTTG TTAAAGAAGG AGAATTCACA
GATGGAGTTA AGGAAAAAAT AGAGTTAGAT GAAGCTATAA AAGCAAGTGG AATTAAATTA
ACAGCCCTTT CTACATTTAA TGGACAAAAC TTTGCAGCTG TAAGAGAAAT AAGTGTTACT
AAGAAGGATA GGGAAGCAAC AGAGGATGAA ATAAATGAAT TAAAAGCTTT AGTTAAGGAA
ATAAATAAAG AGGATTATAC TAAGGCAACA GCTAATAGAT ATATTAAGGT AGCAGAAAAA
GTAAATGCAT TAGATAAGAT TAATTTATCA CAATTAGATA GATTAAGAAG TGATTTAAAT
AAAGCTTATG AAGGATTAGT AGAAGCTAGA GAATTAAATA AAACTTTAGT AGAAGCTGGT
AAATTAATTA AAGAAGATTA TACAATTGAA AGCTGGACAG TATTTGAAGA GGCATTAAAT
CATGCAAAAG ACTTAAACAA TGATATAGAA GCTACTAAGG AAGGGGTAGA TGAAGCTACA
TCAAGATTAA AAGAAGCTAT GAATTCTCTT GAGAAGGTTA ATAATGAAGG AGAAATAGTT
AATGGACCTG TGAATAATTT TGAGGCTTCA GAAATAGCTA AGAAGAACGT AACTGTTACT
TGGAGTGCTC CAGAATCTAC AAAGGGCTTA GAAGGATATG CTCTTTATAA AGATGGCAAA
AAGGTAGCTG AAATAGGTGC AGAAGAAACA TCATATAAAT TTAAAGGATT AAACAGACAT
ACAATCTATA ATTTCAAGAT TGCTGCTAAA TATTCTAATG GAGAACTTTC AAAGAAGGAA
AGTATAACTT TAAGAACTGC TAGATAA
 
Protein sequence
MQVNKLSKKS LSLTLGLSVA LTNTAPLSVL ANGSKVEDVV ASKSQDKITI GNEYISREFS 
IENGKVLTSE IENSRANTTL VPQLGSEDFI INTIQENSDL PEVEANSPKE VLDRANWNAT
LTANSGTAYP ATDIEKLFDG DKNTYIDSYN ITGYPTSLKI DLGEVKTVSS FSYQKRPGFT
DANYGKNGTM GQYKLYVSED GENWTEAGEG EFTREDFNLH QEGNLHNVGD VVYGNFDKTY
ESRYIRIDQL SDALGNTQEF SGAEINLYSD KYEKPEEPIA PKTAIKSSDL TIDTSSTKIE
DIENGKKLTI SYEPYDFNGT EYDIDMVTVL ENDDHYMRSF LEIKTNNENA KIDYIDLDHF
ILEDGIKDTL WSHPDLEDVT SILIGKNELM LGQPIYANGM FFGSEFPATD TDVVNDGMQI
RYYSGKSFEE LERDNQLTTD GKFVSWQNVV GAAKGVDTDV VQTDFFEYIS EIATPTDFRK
QYNSWYDNML TITDESISKS FYGTEKGLTE NGVEPIDSYV VDDGWHNYRD PEFNPNISKE
QAGNSMNRTG FWEFNDKFPN ELYTSTELTS KFQSKFGLWL GPQGGYNFYG GFARYLEKMG
TGYAQTNNGV NVCVGSDRYI KNLTSLFLDY QERFDIDYWK LDGFALRPCT SENHDHMTGG
HNNMYYTTDL WEKWTDAWET MRASRAEEGK DLFINATCYV NLSPWILQWV NTVWIQNSQD
TGEAGTGSRH QQKITYRDAV YHDIYKSNQI QFPAKNIYNH EPIYGVSDGS FATTEDFRDF
LFANAVRGTA FWELYYSPSI MDDEKWKVNA DVLDFVENNF NVLEKAKLFG HRATEGVYGY
SAWDGNEGIV SFRNPTGETK EYTLELIDIV GVPKSVSNLK GNQVLPYKVG DIGSVSYGDS
ITVTLEPYET RILQYGKVDN KASEIVSAKV TGENEITIKY NERVENGEGV YSVEGNEIIE
SKLLDDYRTV VISTSNKLQG EAKLNINGER DALQNPLTTS LTIPVYNNGK IVSVQSGEEL
IGGENINKKY NGNSDTYFLE MNKAYEVDTN ESFKGKTDFA ISMAINTTSS NVNLLSQGEE
IKLSIDEEGY VNFKVKDLNL TSKSEVTTVT EKAHGTFGTN EYVETSTEST FVGKVNDGKL
HHISAVREVN GVLKVYIDGE LMNSVYDKSL INQEINGGTI MVSDNNFTGV IGDIELRNSS
IYYDEAKELY NSYEGSDIIE YSRENWATSA CSEMNPPSGN DGPASYAIDG NESTLWHTNY
VGGDNHSGNH WLSVDFGEEL DFDTVNVLSR GKEINGSIKG YKLEANINGE WAFVKEGEFT
DGVKEKIELD EAIKASGIKL TALSTFNGQN FAAVREISVT KKDREATEDE INELKALVKE
INKEDYTKAT ANRYIKVAEK VNALDKINLS QLDRLRSDLN KAYEGLVEAR ELNKTLVEAG
KLIKEDYTIE SWTVFEEALN HAKDLNNDIE ATKEGVDEAT SRLKEAMNSL EKVNNEGEIV
NGPVNNFEAS EIAKKNVTVT WSAPESTKGL EGYALYKDGK KVAEIGAEET SYKFKGLNRH
TIYNFKIAAK YSNGELSKKE SITLRTAR