Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0162 |
Symbol | colA |
ID | 4205051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 199196 |
End bp | 202510 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642564717 |
Product | collagenase |
Protein accession | YP_697499 |
Protein GI | 110803242 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA ACTTAAAAAG GGGAGAGCTA ACGAAACTAA AGTTAGTTGA AAGATGGTCA GCTACCTTTA CTTTAGCAGC ATTTATTTTA TTTAATAGCT CGTTTAAAGT ATTTGCAGCT GATAAAAAAA TAGAGAATAG TAATAATGGA CAGATTACTA GAGAGATTAA TGCTGATCAG ATTTCTAAAA CAGAATTAAA TAATGAGGTA GCTACAGACA ATAATAGACC ATTAGGACCT AGTATTGCTC CATCAAGAGC AAGAAACAAC AAGATCTATA CATTCGATGA ACTTAACAGA ATGAATTATA GTGATCTAGT TGAATTAATA AAAACAATAA GTTGTGAAAA CGTACCAGAC TTATTTAATT TTAATGATGG TTCATATACT TTCTTTAGTA ATAGAGATCG TGTACAAGCT ATAATATATG GTCTAGAGGA TAGTGGAAGA ACTTATACAG CAGATGATGA TAAGGGAATT CCAACTTTAG TTGAGTTTTT AAGAGCTGGA TATTATTTAG GGTTTTACAA TAAACAATTA TCATACCTAA ATACACCACA GTTAAAAAAT GAGTGTTTAC CAGCTATGAA AGCGATTCAA TATAATAGTA ATTTTAGATT AGGAACAAAG GCGCAAGATG GAGTTGTTGA GGCTTTAGGA AGACTTATAG GTAATGCTTC AGCAGATCCA GAAGTTATTA ATAATTGCAT ATATGTCTTA AGTGATTTTA AAGATAATAT AGATAAGTAT GGTTCGAACT ATAGCAAGGG AAATGCAGTA TTCAACCTTA TGAAAGGTAT TGATTATTAT ACTAATTTAG TAATATACAA TACTAAGGGA TATGATGCTA AAAACACTGA GTTCTATAAT AGAATAGATC CATATATGGA AAGATTAGAA AGTTTATGTA CAATAGGTGA TAAGTTAAAT AATGATAATG CTTGGTTTGT AAATAATGCC TTATATTACA CAGGTAGAAT GGGTAAGTTT AGAGAAGACC CATCAATATC TCAAAGAGCT TTAGAAAGAG CTATGAAGGA GTATCCTTAT TTATCATATC AATATATTGA AGCTGCCAAT GATTTAGATT TAAATTTTGG TGGCAAAAAT TCATCAGGAA ATGATATAGA TTTTAATAAG ATAAAAGCAG ATGCAAGGGA AAAATATCTT CCAAAAACAT ATACTTTTGA TGATGGCAAA TTTGTAGTAA AAGCTGGTGA TAAAGTAACA GAAGAAAAGA TAAAAAGATT ATACTGGGCT TCAAAGGAAG TTAAGGCTCA ATTTATGAGA GTAGTTCAAA ATGATAAGGC TTTAGAAGAG GGAAATCCAG ATGATATTTT AACTGTTGTT ATTTATAACT CACCAGAAGA GTATAAGTTA AATCGTATAA TAAATGGATT TAGTACTGAT AATGGTGGTA TATATATTGA AAACATAGGA ACTTTCTTTA CTTATGAAAG AACACCAGAG GAAAGTATAT ATACATTAGA AGAATTATTC CGTCATGAAT TTACTCACTA TCTTCAAGGT AGATATGTAG TTCCTGGAAT GTGGGGACAA GGAGAATTCT ATCAAGAGGG AGTTTTAACT TGGTATGAAG AAGGAACAGC AGAGTTCTTT GCAGGTTCAA CTAGAACTGA TGGAATAAAA CCAAGAAAAT CAGTTACACA AGGATTAGCT TACGATAGAA ATAATAGGAT GTCTTTATAT GATGTATTAC ATGCTAAATA TGGCTCATGG GATTTCTATA ATTATGGATT TGCTTTATCA AACTACATGT ACAACAATAA CATTGGAATG TTTAATAAGA TGACAAATTA CATAAAGAAT AATGATGTAT CTGGTTATAA AGATTATATT GCATCAATGA GTAGTGATTA CGGATTAAAT GATAAATATC AAGACTATAT AGATTCTTTA TTAAATAATA TTGACAACTT AGATGTTCCT TCAGTTTCAG ATGAATATGT AAATGGACAT GAAGCTAAGG ACATAAATGA AATAACTAAG GATATAAAAG AAGTTTCAAA TATAAAAGAT CTTTCTAGTA ATGTTGAAAA GTCTCAATTC TTTACTACTT ACGATATGAG AGGAACATAT GTAGGGGGAA GAAGCCAAGG GGAAGAAAAT GACTGGAAAG ATATGAATTC TAAGTTAAAT GATATATTAA AAGAATTATC TAAAAAGAGC TGGAATGGGT ATAAAACTGT TACTGCATAC TTTGTAAACC ATAAAGTAGA TGAAAATGGT AACTATGTTT ATGATGTTGT ATTCCATGGA ATGAATACAG ATACAAATAC TGATGTTCAT GTAAATAAAG AGCCTAAGGC TGTTATAAAA TCTGATTCTT CAGTAATAGT TGAAGAAGAA ATAAACTTTG ATGGAACAGA GTCAAAAGAT GAAGATGGTG AAATTAAAGC TTATGAATGG GACTTTGGAG ATGGAGAAAA ATCTAATGAG GCTAAAGCTG CTCATAAATA TAATAAAACT GGAGAATATG AAGTAAAATT AACAGTTACA GATAATAATG GTGGAATAAA TACTGAAAGT AAAAAGATAA AAGTAGTAGA AGATAAACCT GTTGAAGTTA TAAATGAAAG TGAGCCTAAC AATGATTTGG AAAAAGCTAA CCAAATAACT AAATCTAATA TGTTAGTTAA GGGTACTTTA TCACAAAATG ATTATTCAGA TAAATATTAT TTTGATGTAG CTAAAAAAGG AAATGTTAAA ATAACTCTTA ATAATTTAAA TTCAGTAGGA ATAACTTGGA CACTTTATAA AGAGGGAGAC CTAAACAATT ATGTTTTATA TGCAACTAGA AATGACGGAA CAGAATTAAA GGGTGAAAAG ATTTTAGAGC CTGGAAGATA CTATTTAAGT GTATATACTT ATGATAATCA ATCAGGAGCT TACACAGTAA ATGTAAAAGG AAACCTTAAA AATGAAGTTA AAGAAGTAGA AAAAGATTCT ATAAAAGAAG TTGAAAATAA CAATGATTTT GATAAAGCTA TGAAGGTAGA CAGTAATAGC AAAATAGTTG GAACATTAAG CAATGATGAT CTTAAGGATA TTTATAGCAT AGATATACAA AATCCAAGTG ATTTAAACAT AGTAGTTGAA AACTTAGATA ATATAAAAAT GAACTGGTTA TTATATTCAG CTGATGATTT AAGTAACTAT GTGTATTACG CTAATGCAGA TGGAAATAAA TTAAGTAACA CTTGTAAGTT AAATCCAGGT AAATATTACT TATGTGTTTA TCAATTTGAA AACTCAGGTA CTGGAAATTA CACAGTGAAC TTACAAAACA AATAA
|
Protein sequence | MKKNLKRGEL TKLKLVERWS ATFTLAAFIL FNSSFKVFAA DKKIENSNNG QITREINADQ ISKTELNNEV ATDNNRPLGP SIAPSRARNN KIYTFDELNR MNYSDLVELI KTISCENVPD LFNFNDGSYT FFSNRDRVQA IIYGLEDSGR TYTADDDKGI PTLVEFLRAG YYLGFYNKQL SYLNTPQLKN ECLPAMKAIQ YNSNFRLGTK AQDGVVEALG RLIGNASADP EVINNCIYVL SDFKDNIDKY GSNYSKGNAV FNLMKGIDYY TNLVIYNTKG YDAKNTEFYN RIDPYMERLE SLCTIGDKLN NDNAWFVNNA LYYTGRMGKF REDPSISQRA LERAMKEYPY LSYQYIEAAN DLDLNFGGKN SSGNDIDFNK IKADAREKYL PKTYTFDDGK FVVKAGDKVT EEKIKRLYWA SKEVKAQFMR VVQNDKALEE GNPDDILTVV IYNSPEEYKL NRIINGFSTD NGGIYIENIG TFFTYERTPE ESIYTLEELF RHEFTHYLQG RYVVPGMWGQ GEFYQEGVLT WYEEGTAEFF AGSTRTDGIK PRKSVTQGLA YDRNNRMSLY DVLHAKYGSW DFYNYGFALS NYMYNNNIGM FNKMTNYIKN NDVSGYKDYI ASMSSDYGLN DKYQDYIDSL LNNIDNLDVP SVSDEYVNGH EAKDINEITK DIKEVSNIKD LSSNVEKSQF FTTYDMRGTY VGGRSQGEEN DWKDMNSKLN DILKELSKKS WNGYKTVTAY FVNHKVDENG NYVYDVVFHG MNTDTNTDVH VNKEPKAVIK SDSSVIVEEE INFDGTESKD EDGEIKAYEW DFGDGEKSNE AKAAHKYNKT GEYEVKLTVT DNNGGINTES KKIKVVEDKP VEVINESEPN NDLEKANQIT KSNMLVKGTL SQNDYSDKYY FDVAKKGNVK ITLNNLNSVG ITWTLYKEGD LNNYVLYATR NDGTELKGEK ILEPGRYYLS VYTYDNQSGA YTVNVKGNLK NEVKEVEKDS IKEVENNNDF DKAMKVDSNS KIVGTLSNDD LKDIYSIDIQ NPSDLNIVVE NLDNIKMNWL LYSADDLSNY VYYANADGNK LSNTCKLNPG KYYLCVYQFE NSGTGNYTVN LQNK
|
| |