Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1073 |
Symbol | |
ID | 4201929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1218551 |
End bp | 1224979 |
Gene Length | 6429 bp |
Protein Length | 2142 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638081954 |
Product | discoidin domain-containing protein |
Protein accession | YP_695519 |
Protein GI | 110801328 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.742063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAATATCTAA AATTCTTATT GCATCAATGA TAATGTCAAA TGCAAGTCCT ATTTTGAATG TTTATGCTAG TGAGGTAATA AAGGAAAAGA TAAGTTCTAT AGAAGAAAGC GTAATAAATC AAGCATCTAT AAATCAATTC AATTTAAATA GCTACGAGAA TTTTGATGCA TATAATCAAA AGTACAAAGT ACAGAGAAAT GAAATAGTTT CAATAACAAA TAATGGTGGA CAATATTCAT CAAGCTCTAT AGATAAAGCT ATAGATGGTG ATTTATCTAC TCACTGGGAA ACAGGAAGAC AAAATAGTTC TGATTTTACA AATGAAGTTG TAGTTGAATT TAATGATTTA GAATCAATAG ATAGAATAGC ATATGCAACA AGACAAGATG GAGCAAGAGG AAAGGGATAT CCAACAGAGT TTGATATATA TGCTTCACAA ACTGGAAATG ATGATGACTT TAAATTAGTT TCTCAAGGGA CAAGCAAGTC AACAGGAAAC CTTATGGAAT TTAAGTTTGA TACAGTACAA GCTAAGAAAA TTAAATTTGT CTTTAAAGTA GCAGATAGAG ACTGGGCAAG TGCTTCAGAG TTTTGGTTTT ATAAGGAAGA TAAAATCATG GATAAGGTGA ATTCTATATT CATAAATGAA GAGAAAAACA AGGTGAATCC AGAATTTGAC ACTATAGAAA AATTAGATGT TTTTGAAAAT GAGATTAAAG ATCATCCATT TTATGAGAGT TTTAGAGAAA TAATAGAAAA TGCTAAGTTA GTTTTACAAG GAAACAATGT AGTTTACAAG GATGCCTTAG TAAGTAAGTT TAAAGATTTT AATGATGAAT CATTAAAAAA ATATAATGAA CTATTTAAGA TAAATACATC ATCAATAACG ACTAATGGAG GAAATTATGC AGAGAATACA ATAGATAGAG CTATTGATGG CGATGTTAAT ACAAAATGGC ATTCTGGAAA GCAAAATAGT TCTGATTTTA CAAATGAAGT AATAATTACT TTAGATAAAT TAGAGACTTT AGATAGAGTA GTTTACACAA ACTTAAATAT GAGAGGATTT GCTGAGGCCT TTGATATATA TACATCAAAG ACAACTTCTG GAGATACTTT TGAAAAAGTA ACATCAGGAA GCAGTGAAGT AACAAAAGGA TCAATAGAAA TTAAGTTTAA TCCAACAGAG GCAAGAAGAG TTAAATTTGT ATTTAAAAAA GGATATGAAA ATTGGGCTTT AGCATCAGAA TTTACTTTCT ATAAAGAAGA TCAGTTAAGA GATAAAATGA GTAGATTATT CACAGATAGC ACTATGAGTC AAGTATCTGA AGAATTTAAT ACTTTAGAAA AAATAGAAAA GTTAGAATCA GAAGCTAAGG AGCATCCTTT TTATAATGAT TATAAAGAAG AATTAGAGAA TGCTAAGTTA ATTATTGAAA ATAAAGAGGT TCAATATACT GATGCTAAAG TATCTGACTT TTTAAATCCA GATAATGAAT TACTTAAAGC TTATGATGCT ATCTATAAGT TAGATAAAAG CAAAATAAAA TCAATAAAAA CCAATGGTGG ACAATATGCA TCAGAATCTA TAGATAAAGC TATAGATGGA GATTTCAATA CAAAATGGCA TTCAGGAAAG CAAAATACAG AAAACTTTAC AAATGAAGTT GAAATAGAAT TAGAAGAATT AACTACATTA GATAGAATAG TATATACAGC ACCTCGTGGT TCAAATAGAG GATTTGCTGA AGCTTTTGAT ATTTATGCAT CAAGAACAAC TAAGGGAGAT AATTATCAAA AGGTAACTAG TGGCAATGCT AATATAACTC AAAATTCTGT AGAAATAAAA TTTAATCCAA CAGAATTTAA GAGAATTAAA TTTGTATTCA AAAAAGGTTA TGAAAACTGG GCTTGTGCTA GTGAATTTGG ATTATATAAA GAAGATAAAA CATCAGATAA AGTTGATAAA CTATTTACAA ATGGATTAAT GAATGAATTA TCAGAAGACT TTAATACAGA AGAAAAATTA GCTTCATTAG AGGAAGAGGT AAAAAATCAT CCCTTAGTAA ATTTATATAA GGAAAAATTA GAACTTGCTA AAGAAGTATT AGCAGGAAAT ACAGGATCTA CTGTATTTGA ATTACAAAGT AGAGGAAACT CAATTAAAGA ATCTCAAAAA AGAAAGGTTT GGAATTTCCA AGACTGGCAA CCAACTGGCT ATGCAGTAAA ATCTGGACAA GTAATAACTG TTTATGTTGA TGTGGAAGAT GGAAAGCCAA CACCAAAATT AGTATTTAAA CAAATGGATA GTCAGCATAA CGGAGATGTT ACTATAAGTT TAAGCAAAGG TAAAAATGTA ATAACTATAC CAGAAAAACC TACAAATGAG TTAAGACCAG GAACAGCTAA GGCAGGTGTA CTTTATACAA GTAACCCATA CACTTCAGAA GAACAAGGAA GAAAGCCTAA AATAAGAATA GAGGGTGCTA TAAACTATCC AAACTATATA AAAGGCATTG ATAATGATGA AGAAGTTATG AATGATCTTG AAGAATATGT TGATTTATTA AAAAAGGATC CTCAATTACC AGATGTATTT GATGTATTTA GTGATAAAAC ATTAGTTAAC GTAACTGCAA CCTATGCTTT AAATTGGTAT AAAAATAATA ATAAATTACC AAGTGAAACT GCAAATAAAA GTGATGAAGT AATTAAAGAA ACTATGAAAT ATTGGGGATT TGATGAATCT AGTGAGGTTA ACTCTGACTT TAACTTTAGA TATATAAGTA TGCTTAAATG GTTAGACAAT GGAGGCTTTA TGAATGCTGG AAATGGAATA ACAGGATTTA ATAAAGCAGA GCAAGGTGGA GCGTTAGGCG TAGATACTGG TTGGGGATTT ATGCATGAAA TGGGTCATAA TTTTGACACT AATAATAGAA CTATAGTAGA GGTAACTAAT AATATGCTTC CACTTCATTT TGAGAGAATT AAAGGTGTAC CATCTAATAT AACTAGACAA AATTTATGGG AAAGAAACAT ATTACCAAAG GTGGCTTTAG ATGATTATTC TAATAATGAA TATTATCCAG AAAGTGATAA ATCTTTATTA TCTCATGTAG CCCCTTTATG GCAATTACAG TTATATGATA AAACTTTCTG GCCAAGATTT GAACAAGAAT TTAGAAGTAG AGATATAGGT GGTGGAAGTT GGGAAAATAA ACACAATGCA TGGGTTATGG CTGCATCAGA TGTGTTTAAG TTAGACTTAT CAGAGCATTT TGAAAGACAT GGAATGGATG TATGGAAAGA GACTAAGGAA TATACATCTA AATATCCTAA ACCATCTAAT AAATTATGGT ACGCAAATGA TAAAATGTAC TTAAATAAAG GTGGAGTATT TACAGAAAAT CTTAAGTTCG AGGCAGAAGC AAAAATAGTA AACGGAAATG ATGTATCAAT TTCTTTTGAT ATAGATAATG AAAATAAAAA CAATGTTATA GGATATGAAA TATCTAGGGA TGGCAAAACA ATAGGATTTA CATCAACAAA TAATTTTGTT GATCACGGAG CTAATATAGA TGAAAACCAT GAATACAGTA TAGTTGCTTA TGACAATGAA ATAAATCCTA GTAAGCCTTA TAATTTTAAA TTACATGCTC CAAGTATTAG TGCACAACAA AAGGTTTTAA TAGCGCTTAA TGAAGAGTTT AACCCATTGG ATTACGTAAA AGCTTATAAT TACGAAGGAA ATGATATAAG CAATAAAATT GAAGTAATTA AGAATACTGT TGATAACACT AAGAAAGGTG AATATGAAGT TGTTTATAAA GTCACTGATG AAGAGGAAAC TAAAGAAAAA GCTCTTAAAG TTGAAGTAGT TAGTACTTAT GACTATTTAT CAGATGAAGA GTGGGAATCA GTAGAAACTC AATGGGGAAC TCCAAGAAGA AATACAAATA TAAAAGGTAG AGTAAATGGA GAAATTAAAA CCTTTGATAA GGGCATAGGA ATACATGCTA ATGGAAAAGT TGTATATGAT TTAGAGGGAA AAGATTATGA CAGATTTGAA GCTTTATTAG GTGTTGATCA AACAATAGGA GCTAACGATA ATTCAAGTAT TTCTTTTAAG GTTATAGCAG ATGGAGAGAC TTTAGAAACA ACTAAGGTTT TAAAATATAA TGATAATATG GTAGAAATAA ATATTCCAGT TAAAGGAATT AACAAATTAG AGATACAAGT TAGTGATAGT GGAAATGGAA ATACATCAGA CCACGGTATA ATAGTAAACC CTAAGTTATC AACTAATAAT GTAAAGCCTA AAATAAAAAC TGAAGATGCA GTAATAAATG TTAAGGATAA TTTTAATATA TTAGATGGTG TTACTGCTAA TGATGTAGAA GATGGTGATT TAACATCAAG CATAAAAGTA AAATCAAGTG ATTTTGAAGC AAATAGAGGT GGAATATACA CTGTAGTTTA TGAAGTTACA GATAAAGACG GAAACACAGT TACTAAGGAA AGAAAAATAT ACACAATAAC TACAGCTGAA AGTTTAAGTG ATAAGGAATG GAAATCAGCA TCATCTGGAT GGAGAGATGT AAAAAAAGAC TTAAGTGTTG AAGATAATAG AATAACTTTA CTTGGTGAAA ATGGACAAGA AGTAGAATAT GATAAAGGTC TTGGAACTCA TGCAAACTCA GAGATAGTTT ATGATTTATC AGGTAAAAAT TATGGAATGT TTGAAACTTA TGTTGGTGTA GATAGAGAAA TGAGAAACTC TAATGAGCCA TCAGTAATAT TTGAAGTTTA CGTTGATGGA AAAAAAGTAT TTGATAGCGG TGTTATGAAT GTAAACACTG AAAGAAAGCA TGTTTTAATT CCAATAGCAG GAGCAAGTGA GCTTAAATTA GTCGCTAAAG ATGGTGGAAA TGGAAATGCC GGAGATCATG CAGACTGGGC AGATGCAAAA GTTTATACTA CAAGCGATAA ACCAATATTA ACTGGGGAAG AGGTAGCCTT AAATATAGGT GATTCATTTA ACCCATTACA AGGAATGGTT GCTAATGATC CTGAAGATGG AGACATAACA AAGAATATAA AAGTTATAAA TAATAATGTA AATACAAAAA GAGGAGGTAA CTATGTAGTA TCCTATGAAG TTACTGATTC TCATGGAAAT AAAACTACAC TAGATAGAAA AGTATCAGTT GTTAATGCTT ATGATTATAT AAGTGATAAG AACTGGAAAT CAGCTAATTC AGGTTGGAGA AGTGTACAAA AAGATAGAAG TGTTGAAAAT AATACCATAA CTTTACTTGG TGAAGATGGA CAAGAAGTGG AGTATAAGAA AGGTATTGGA ACTCATGCAA CTTCAACAAT AGTATATGAT TTATCTGAGG GAAATTATAA GTTCTTTGAA GCATATGTTG GTGTAGATAG AGAGATGAGA AATTCTGATG TATCATCTTT AAGTTTTGAA GTTTATGCTG ATGGAAAAAA AGTATTTGAT AGTGGTGTTA TGAATTCAGA GACTCCAAGA AAGCATGTTT TAATTCCAAT AGTTGGAGTA AGTGAACTTA AATTAGTAGC TAAAGATGGT GAAAATGGAA ATGGAGGAGA TCATGCAGAC TGGGCAGATG CTAAGTTATT ATATGCAGAC TCTAAGGATT TTACTGCTTT AGAAAGAATA GTTGAGGAAG CAAGAGGAGT AGATGAAAAT TTATATACAG AGGAAAGCTT TAATAAATTA CAAGTTGCTC TAGAAAAGGC TAATAAAGTA TTAGAAACTC CTAATCCAGA ACAAGAGGTA ATAGACTCTA CTATTATAGA GCTTAGAGAG GCTATGGATA ATTTAGAGGC AGCTATTGAT TTAACAGAGG AAGTTAATAT ACCAGATAAT GAATTAAAAA GGGCTATAAA AGATCAATTA AATATATCAA GTGATGTAAT AACAAGAGGA GATATGAATA AGTTAACAAA CTTATCAGCA GTTGGATATG GAATAGCAAA CTTAGAAGGT TTACAATATG CTGTAAATAT AGAAGACTTA AATCTTGACT GCAATGAAAT AAGAGATATA TCTAAAATAA AAGGTTTAAA GAAGCTTAAC AATGTAAGCA TAAAAGAACA ATATATAGTT ATAAGATCAC CAGAAGAAGT TGAAGGAAAA TATGTTATAA ATGAATCCTT TGTAGGAAAA GATGGAGAAA GATTAGCACC TAAAGAAATA AACATAAGAA GAAATACAGG TGGACAAAGT ATTGACATAT CAAATGTTGA TGTTGAAAGT TCTTTAAATA ATGGAAATTT AGAACTAGAT ACTAAGCTTT TTAAAGAAGG AGTTAGTGGC ATATCAGCAG TATATAAGGA TTTAGATGGG AAATATGTAG CTACGCTTTC AACTATAGTA AGTAGATAA
|
Protein sequence | MKKKISKILI ASMIMSNASP ILNVYASEVI KEKISSIEES VINQASINQF NLNSYENFDA YNQKYKVQRN EIVSITNNGG QYSSSSIDKA IDGDLSTHWE TGRQNSSDFT NEVVVEFNDL ESIDRIAYAT RQDGARGKGY PTEFDIYASQ TGNDDDFKLV SQGTSKSTGN LMEFKFDTVQ AKKIKFVFKV ADRDWASASE FWFYKEDKIM DKVNSIFINE EKNKVNPEFD TIEKLDVFEN EIKDHPFYES FREIIENAKL VLQGNNVVYK DALVSKFKDF NDESLKKYNE LFKINTSSIT TNGGNYAENT IDRAIDGDVN TKWHSGKQNS SDFTNEVIIT LDKLETLDRV VYTNLNMRGF AEAFDIYTSK TTSGDTFEKV TSGSSEVTKG SIEIKFNPTE ARRVKFVFKK GYENWALASE FTFYKEDQLR DKMSRLFTDS TMSQVSEEFN TLEKIEKLES EAKEHPFYND YKEELENAKL IIENKEVQYT DAKVSDFLNP DNELLKAYDA IYKLDKSKIK SIKTNGGQYA SESIDKAIDG DFNTKWHSGK QNTENFTNEV EIELEELTTL DRIVYTAPRG SNRGFAEAFD IYASRTTKGD NYQKVTSGNA NITQNSVEIK FNPTEFKRIK FVFKKGYENW ACASEFGLYK EDKTSDKVDK LFTNGLMNEL SEDFNTEEKL ASLEEEVKNH PLVNLYKEKL ELAKEVLAGN TGSTVFELQS RGNSIKESQK RKVWNFQDWQ PTGYAVKSGQ VITVYVDVED GKPTPKLVFK QMDSQHNGDV TISLSKGKNV ITIPEKPTNE LRPGTAKAGV LYTSNPYTSE EQGRKPKIRI EGAINYPNYI KGIDNDEEVM NDLEEYVDLL KKDPQLPDVF DVFSDKTLVN VTATYALNWY KNNNKLPSET ANKSDEVIKE TMKYWGFDES SEVNSDFNFR YISMLKWLDN GGFMNAGNGI TGFNKAEQGG ALGVDTGWGF MHEMGHNFDT NNRTIVEVTN NMLPLHFERI KGVPSNITRQ NLWERNILPK VALDDYSNNE YYPESDKSLL SHVAPLWQLQ LYDKTFWPRF EQEFRSRDIG GGSWENKHNA WVMAASDVFK LDLSEHFERH GMDVWKETKE YTSKYPKPSN KLWYANDKMY LNKGGVFTEN LKFEAEAKIV NGNDVSISFD IDNENKNNVI GYEISRDGKT IGFTSTNNFV DHGANIDENH EYSIVAYDNE INPSKPYNFK LHAPSISAQQ KVLIALNEEF NPLDYVKAYN YEGNDISNKI EVIKNTVDNT KKGEYEVVYK VTDEEETKEK ALKVEVVSTY DYLSDEEWES VETQWGTPRR NTNIKGRVNG EIKTFDKGIG IHANGKVVYD LEGKDYDRFE ALLGVDQTIG ANDNSSISFK VIADGETLET TKVLKYNDNM VEINIPVKGI NKLEIQVSDS GNGNTSDHGI IVNPKLSTNN VKPKIKTEDA VINVKDNFNI LDGVTANDVE DGDLTSSIKV KSSDFEANRG GIYTVVYEVT DKDGNTVTKE RKIYTITTAE SLSDKEWKSA SSGWRDVKKD LSVEDNRITL LGENGQEVEY DKGLGTHANS EIVYDLSGKN YGMFETYVGV DREMRNSNEP SVIFEVYVDG KKVFDSGVMN VNTERKHVLI PIAGASELKL VAKDGGNGNA GDHADWADAK VYTTSDKPIL TGEEVALNIG DSFNPLQGMV ANDPEDGDIT KNIKVINNNV NTKRGGNYVV SYEVTDSHGN KTTLDRKVSV VNAYDYISDK NWKSANSGWR SVQKDRSVEN NTITLLGEDG QEVEYKKGIG THATSTIVYD LSEGNYKFFE AYVGVDREMR NSDVSSLSFE VYADGKKVFD SGVMNSETPR KHVLIPIVGV SELKLVAKDG ENGNGGDHAD WADAKLLYAD SKDFTALERI VEEARGVDEN LYTEESFNKL QVALEKANKV LETPNPEQEV IDSTIIELRE AMDNLEAAID LTEEVNIPDN ELKRAIKDQL NISSDVITRG DMNKLTNLSA VGYGIANLEG LQYAVNIEDL NLDCNEIRDI SKIKGLKKLN NVSIKEQYIV IRSPEEVEGK YVINESFVGK DGERLAPKEI NIRRNTGGQS IDISNVDVES SLNNGNLELD TKLFKEGVSG ISAVYKDLDG KYVATLSTIV SR
|
| |