Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0166 |
Symbol | colA |
ID | 4201122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 197623 |
End bp | 200937 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 638081047 |
Product | collagenase |
Protein accession | YP_694630 |
Protein GI | 110798785 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA ACTTAAAAAG GGGAGAGCTA ACGAAACTAA AGTTAGTTGA AAGATGGTCA GCTACCTTTA CTTTAGCAGC ATTTATTTTA TTTAATAGCT CGTTTAAAGT ATTTGCAGCT GATAAAAAAA TAGAGAATAG TAATAATGGA CAGATTACTA GAGAGATTAA TGCTGATCAG ATTTCTAAAA CAGAATTAAA TAATGAGGTA GCTACAGACA ATAATAGACC ATTAGGCCCT AGTATTGCTC CATCAAGAGC AAGAAACAAC AAGATCTATA CATTCGATGA ACTTAACAGA ATGAATTATA GTGATCTAGT TGAATTAATA AAAACAATAA GTTATGAAAA CGTACCAGAC TTATTTAATT TTAATGATGG TTCATATACT TTCTTTAGTA ATAGAGATCG TGTACAAGCT ATAATATATG GTCTAGAGGA TAGTGGAAGA ACTTATACAG CAGATGATGA TAAGGGAATT CCAACTTTAG TTGAGTTTTT AAGAGCTGGA TATTATTTAG GATTTTACAA TAAACAATTA TCATACCTAA ATACACCACA GTTAAAAAAT GAGTGTTTAC CAGCTATGAA AGCGATTCAA TATAATAGTA ATTTTAGATT AGGAACAAAG GCGCAAGATG GAGTTGTTGA GGCTTTAGGA AGACTTATAG GTAATGCTTC AGCAGATCCA GAAGTTATTA ATAATTGCAT ATATGTCTTA AGTGATTTTA AAGATAATAT AGATAAGTAT GGTTCGAACT ATAGCAAGGG AAATGCAGTA TTCAACCTTA TGAAAGGTAT TGATTATTAC ACTAATTCAG TAATATACAA TACTAAGGGA TATGATGCTA AAAACACTGA GTTTTATAAT AGAATAGATC CATATATGGA AAGATTAGAA AGTTTATGTA CAATAGGTGA TAAGTTAAAT AATGATAATG CTTGGTTTGT AAATAATGCC TTATATTACA CAGGTAGAAT GGGTAAGTTT AGAGAAGACC CATCAATATC TCAAAGAGCT TTAGAAAGAG CTATGAAGGA GTATCCTTAT TTATCATATC AATATATTGA AGCTGCCAAT GATTTAGATT TAAATTTTGG TGGCAAAAAT TCATCAGGAA ATGATATAGA TTTCAATAAG ATAAAAGCAG ATGCAAGGGA AAAATATCTT CCAAAAACAT ATACTTTTGA TGATGGCAAA TTTGTAGTAA AAGCTGGTGA TAAAGTAACA GAAGAAAAGA TAAAAAGATT ATACTGGGCT TCAAAGGAAG TTAAGGCTCA ATTTATGAGA GTAGTTCAAA ATGATAAGGC TTTAGAAGAG GGAAATCCAG ATGATATTTT AACTGTTGTT ATTTATAACT CACCAGAAGA GTATAAGTTA AATCGTATAA TAAATGGATT TAGTACTGAT AATGGTGGTA TATATATTGA AAACATAGGA ACTTTCTTTA CTTATGAAAG AACACCAGAG GAAAGTATAT ATACATTAGA AGAATTATTC CGTCATGAAT TTACTCACTA TCTTCAAGGT AGATATGTAG TTCCTGGAAT GTGGGGACAA GGAGAATTCT ATCAAGAGGG AGTTTTAACT TGGTATGAAG AAGGAACAGC AGAGTTCTTT GCAGGTTCAA CTAGAACTGA TGGAATAAAA CCAAGAAAAT CAGTTACACA AGGATTAGCT TACGATAGAA ATAATAGGAT GTCTTTATAT GATGTATTAC ATGCTAAATA TGGCTCATGG GATTTCTATA ATTATGGATT TGCTTTATCA AACTACATGT ACAACAATAA CATTGGAATG TTTAATAAGA TGACAAATTA CATAAAGAAT AATGATGTAT CTGGTTATAA AGATTATATT GCATCAATGA GTAGTGATTA CGGATTAAAT GATAAATATC AAGACTATAT AGATTCTTTA TTAAATAATA TTGACAACTT AGATGTTCCT TTAGTTTCAG ATGAATATGT AAATGGACAT GAAGCTAAGG ATATAAATGA AATAACTAAT GATATAAAAG AAGTTTCAAA TATAAAAGAT CTTTCTAGTA ATGTTGAAAA GTCTCAATTC TTTACTACTT ACGATATGAG AGGAACATAT GTAGGGGAAA GAAGCCAAGG GGAAGAAAAT GACTGGAAAG ATATGAATTC TAAGTTAAAT GATATATTAA AAGAATTATC TAAAAAGAGC TGGAATGGGT ATAAAACTGT TACTGCATAC TTTGTAAACC ATAAAGTAGA TGAAAATGGT AACTATGTTT ATGATGTTGT ATTCCATGGA ATGAATACAG ATACAAATAC TGATGTTCAT GTAAATAAAG AGCCTAAGGC TGTTATAAAA TCTGATTCTT CAGTAATAGT TGAAGAAGAA ATAAACTTTG ATGGAACAGA GTCAAAAGAT GAAGATGGTG AAATTAAAGC TTATGAATGG GACTTTGGAG ATGGAGAAAA ATCTAATGAG GCTAAAGCTG CTCATAAATA TAATAAAACT GGAGAATATG AAGTAAAATT AACAGTTACA GATAATAACG GTGGAATAAA TACTGAAAGT AAAAAGATAA AAGTAGTAGA AGATAAACCT GTTGAAGTTA TAAATGAAAG CGAGCCTAAC AATGATTTTG AAAAGGCTAA CCAAATAGCT AAATCTAATA TGTTAGTTAA GGGTACTTTA TCAGAAGAGG ATTATTCAGA TAAATATTAT TTTGATGTAG CTAAAAAAGG CAATGTTAAA ATCACTCTTA ATAATTTAAA TTCAGTAGGA ATAACTTGGA CACTTTATAA AGAGGGAGAC CTAAACAATT ATGTTTTATA TGCAACTGGA AATGATGGAA CAGAATTAAA GGGTGAAAAG ACTTTAGAGC CTGGAAGATA CTACTTAAGT GTATATACTT ATGATAATCA ATCAGGAGCT TACACAGTAA ATGTAAAAGG AAAACTTAAA AATGAAGTTA AAGAAACAGA AAAGGATGCT ATAAAAGAAG TTGAAAATAA CAATGATTTT GATAAAGCTA TGAAGGTAGA TAGTAATAGC AAAATAGTTG GAACATTAAG CAATGATGAT CTTAAGGATA TTTATAGCAT AGATATACAA AATCCAAGTG ACTTAAACAT AGTAGTTGAA AACTTAGATA ATATAAAAAT GAACTGGTTA TTATATTCAG CTGATGATTT AAGTAACTAT GTGGATTACG CTAACGCAGA TGGAAATAAA TTAAGTAACA CTTGTAAGTT AAATCCAGGT AAATATTACT TATGTGTTTA TCAATTTGAA AACTCAGGTA CTGGAAATTA CACAATAAAC TTACAAAACA AATAA
|
Protein sequence | MKKNLKRGEL TKLKLVERWS ATFTLAAFIL FNSSFKVFAA DKKIENSNNG QITREINADQ ISKTELNNEV ATDNNRPLGP SIAPSRARNN KIYTFDELNR MNYSDLVELI KTISYENVPD LFNFNDGSYT FFSNRDRVQA IIYGLEDSGR TYTADDDKGI PTLVEFLRAG YYLGFYNKQL SYLNTPQLKN ECLPAMKAIQ YNSNFRLGTK AQDGVVEALG RLIGNASADP EVINNCIYVL SDFKDNIDKY GSNYSKGNAV FNLMKGIDYY TNSVIYNTKG YDAKNTEFYN RIDPYMERLE SLCTIGDKLN NDNAWFVNNA LYYTGRMGKF REDPSISQRA LERAMKEYPY LSYQYIEAAN DLDLNFGGKN SSGNDIDFNK IKADAREKYL PKTYTFDDGK FVVKAGDKVT EEKIKRLYWA SKEVKAQFMR VVQNDKALEE GNPDDILTVV IYNSPEEYKL NRIINGFSTD NGGIYIENIG TFFTYERTPE ESIYTLEELF RHEFTHYLQG RYVVPGMWGQ GEFYQEGVLT WYEEGTAEFF AGSTRTDGIK PRKSVTQGLA YDRNNRMSLY DVLHAKYGSW DFYNYGFALS NYMYNNNIGM FNKMTNYIKN NDVSGYKDYI ASMSSDYGLN DKYQDYIDSL LNNIDNLDVP LVSDEYVNGH EAKDINEITN DIKEVSNIKD LSSNVEKSQF FTTYDMRGTY VGERSQGEEN DWKDMNSKLN DILKELSKKS WNGYKTVTAY FVNHKVDENG NYVYDVVFHG MNTDTNTDVH VNKEPKAVIK SDSSVIVEEE INFDGTESKD EDGEIKAYEW DFGDGEKSNE AKAAHKYNKT GEYEVKLTVT DNNGGINTES KKIKVVEDKP VEVINESEPN NDFEKANQIA KSNMLVKGTL SEEDYSDKYY FDVAKKGNVK ITLNNLNSVG ITWTLYKEGD LNNYVLYATG NDGTELKGEK TLEPGRYYLS VYTYDNQSGA YTVNVKGKLK NEVKETEKDA IKEVENNNDF DKAMKVDSNS KIVGTLSNDD LKDIYSIDIQ NPSDLNIVVE NLDNIKMNWL LYSADDLSNY VDYANADGNK LSNTCKLNPG KYYLCVYQFE NSGTGNYTIN LQNK
|
| |