Gene CPR_0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0162 
SymbolcolA 
ID4205051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp199196 
End bp202510 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content29% 
IMG OID642564717 
Productcollagenase 
Protein accessionYP_697499 
Protein GI110803242 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA ACTTAAAAAG GGGAGAGCTA ACGAAACTAA AGTTAGTTGA AAGATGGTCA 
GCTACCTTTA CTTTAGCAGC ATTTATTTTA TTTAATAGCT CGTTTAAAGT ATTTGCAGCT
GATAAAAAAA TAGAGAATAG TAATAATGGA CAGATTACTA GAGAGATTAA TGCTGATCAG
ATTTCTAAAA CAGAATTAAA TAATGAGGTA GCTACAGACA ATAATAGACC ATTAGGACCT
AGTATTGCTC CATCAAGAGC AAGAAACAAC AAGATCTATA CATTCGATGA ACTTAACAGA
ATGAATTATA GTGATCTAGT TGAATTAATA AAAACAATAA GTTGTGAAAA CGTACCAGAC
TTATTTAATT TTAATGATGG TTCATATACT TTCTTTAGTA ATAGAGATCG TGTACAAGCT
ATAATATATG GTCTAGAGGA TAGTGGAAGA ACTTATACAG CAGATGATGA TAAGGGAATT
CCAACTTTAG TTGAGTTTTT AAGAGCTGGA TATTATTTAG GGTTTTACAA TAAACAATTA
TCATACCTAA ATACACCACA GTTAAAAAAT GAGTGTTTAC CAGCTATGAA AGCGATTCAA
TATAATAGTA ATTTTAGATT AGGAACAAAG GCGCAAGATG GAGTTGTTGA GGCTTTAGGA
AGACTTATAG GTAATGCTTC AGCAGATCCA GAAGTTATTA ATAATTGCAT ATATGTCTTA
AGTGATTTTA AAGATAATAT AGATAAGTAT GGTTCGAACT ATAGCAAGGG AAATGCAGTA
TTCAACCTTA TGAAAGGTAT TGATTATTAT ACTAATTTAG TAATATACAA TACTAAGGGA
TATGATGCTA AAAACACTGA GTTCTATAAT AGAATAGATC CATATATGGA AAGATTAGAA
AGTTTATGTA CAATAGGTGA TAAGTTAAAT AATGATAATG CTTGGTTTGT AAATAATGCC
TTATATTACA CAGGTAGAAT GGGTAAGTTT AGAGAAGACC CATCAATATC TCAAAGAGCT
TTAGAAAGAG CTATGAAGGA GTATCCTTAT TTATCATATC AATATATTGA AGCTGCCAAT
GATTTAGATT TAAATTTTGG TGGCAAAAAT TCATCAGGAA ATGATATAGA TTTTAATAAG
ATAAAAGCAG ATGCAAGGGA AAAATATCTT CCAAAAACAT ATACTTTTGA TGATGGCAAA
TTTGTAGTAA AAGCTGGTGA TAAAGTAACA GAAGAAAAGA TAAAAAGATT ATACTGGGCT
TCAAAGGAAG TTAAGGCTCA ATTTATGAGA GTAGTTCAAA ATGATAAGGC TTTAGAAGAG
GGAAATCCAG ATGATATTTT AACTGTTGTT ATTTATAACT CACCAGAAGA GTATAAGTTA
AATCGTATAA TAAATGGATT TAGTACTGAT AATGGTGGTA TATATATTGA AAACATAGGA
ACTTTCTTTA CTTATGAAAG AACACCAGAG GAAAGTATAT ATACATTAGA AGAATTATTC
CGTCATGAAT TTACTCACTA TCTTCAAGGT AGATATGTAG TTCCTGGAAT GTGGGGACAA
GGAGAATTCT ATCAAGAGGG AGTTTTAACT TGGTATGAAG AAGGAACAGC AGAGTTCTTT
GCAGGTTCAA CTAGAACTGA TGGAATAAAA CCAAGAAAAT CAGTTACACA AGGATTAGCT
TACGATAGAA ATAATAGGAT GTCTTTATAT GATGTATTAC ATGCTAAATA TGGCTCATGG
GATTTCTATA ATTATGGATT TGCTTTATCA AACTACATGT ACAACAATAA CATTGGAATG
TTTAATAAGA TGACAAATTA CATAAAGAAT AATGATGTAT CTGGTTATAA AGATTATATT
GCATCAATGA GTAGTGATTA CGGATTAAAT GATAAATATC AAGACTATAT AGATTCTTTA
TTAAATAATA TTGACAACTT AGATGTTCCT TCAGTTTCAG ATGAATATGT AAATGGACAT
GAAGCTAAGG ACATAAATGA AATAACTAAG GATATAAAAG AAGTTTCAAA TATAAAAGAT
CTTTCTAGTA ATGTTGAAAA GTCTCAATTC TTTACTACTT ACGATATGAG AGGAACATAT
GTAGGGGGAA GAAGCCAAGG GGAAGAAAAT GACTGGAAAG ATATGAATTC TAAGTTAAAT
GATATATTAA AAGAATTATC TAAAAAGAGC TGGAATGGGT ATAAAACTGT TACTGCATAC
TTTGTAAACC ATAAAGTAGA TGAAAATGGT AACTATGTTT ATGATGTTGT ATTCCATGGA
ATGAATACAG ATACAAATAC TGATGTTCAT GTAAATAAAG AGCCTAAGGC TGTTATAAAA
TCTGATTCTT CAGTAATAGT TGAAGAAGAA ATAAACTTTG ATGGAACAGA GTCAAAAGAT
GAAGATGGTG AAATTAAAGC TTATGAATGG GACTTTGGAG ATGGAGAAAA ATCTAATGAG
GCTAAAGCTG CTCATAAATA TAATAAAACT GGAGAATATG AAGTAAAATT AACAGTTACA
GATAATAATG GTGGAATAAA TACTGAAAGT AAAAAGATAA AAGTAGTAGA AGATAAACCT
GTTGAAGTTA TAAATGAAAG TGAGCCTAAC AATGATTTGG AAAAAGCTAA CCAAATAACT
AAATCTAATA TGTTAGTTAA GGGTACTTTA TCACAAAATG ATTATTCAGA TAAATATTAT
TTTGATGTAG CTAAAAAAGG AAATGTTAAA ATAACTCTTA ATAATTTAAA TTCAGTAGGA
ATAACTTGGA CACTTTATAA AGAGGGAGAC CTAAACAATT ATGTTTTATA TGCAACTAGA
AATGACGGAA CAGAATTAAA GGGTGAAAAG ATTTTAGAGC CTGGAAGATA CTATTTAAGT
GTATATACTT ATGATAATCA ATCAGGAGCT TACACAGTAA ATGTAAAAGG AAACCTTAAA
AATGAAGTTA AAGAAGTAGA AAAAGATTCT ATAAAAGAAG TTGAAAATAA CAATGATTTT
GATAAAGCTA TGAAGGTAGA CAGTAATAGC AAAATAGTTG GAACATTAAG CAATGATGAT
CTTAAGGATA TTTATAGCAT AGATATACAA AATCCAAGTG ATTTAAACAT AGTAGTTGAA
AACTTAGATA ATATAAAAAT GAACTGGTTA TTATATTCAG CTGATGATTT AAGTAACTAT
GTGTATTACG CTAATGCAGA TGGAAATAAA TTAAGTAACA CTTGTAAGTT AAATCCAGGT
AAATATTACT TATGTGTTTA TCAATTTGAA AACTCAGGTA CTGGAAATTA CACAGTGAAC
TTACAAAACA AATAA
 
Protein sequence
MKKNLKRGEL TKLKLVERWS ATFTLAAFIL FNSSFKVFAA DKKIENSNNG QITREINADQ 
ISKTELNNEV ATDNNRPLGP SIAPSRARNN KIYTFDELNR MNYSDLVELI KTISCENVPD
LFNFNDGSYT FFSNRDRVQA IIYGLEDSGR TYTADDDKGI PTLVEFLRAG YYLGFYNKQL
SYLNTPQLKN ECLPAMKAIQ YNSNFRLGTK AQDGVVEALG RLIGNASADP EVINNCIYVL
SDFKDNIDKY GSNYSKGNAV FNLMKGIDYY TNLVIYNTKG YDAKNTEFYN RIDPYMERLE
SLCTIGDKLN NDNAWFVNNA LYYTGRMGKF REDPSISQRA LERAMKEYPY LSYQYIEAAN
DLDLNFGGKN SSGNDIDFNK IKADAREKYL PKTYTFDDGK FVVKAGDKVT EEKIKRLYWA
SKEVKAQFMR VVQNDKALEE GNPDDILTVV IYNSPEEYKL NRIINGFSTD NGGIYIENIG
TFFTYERTPE ESIYTLEELF RHEFTHYLQG RYVVPGMWGQ GEFYQEGVLT WYEEGTAEFF
AGSTRTDGIK PRKSVTQGLA YDRNNRMSLY DVLHAKYGSW DFYNYGFALS NYMYNNNIGM
FNKMTNYIKN NDVSGYKDYI ASMSSDYGLN DKYQDYIDSL LNNIDNLDVP SVSDEYVNGH
EAKDINEITK DIKEVSNIKD LSSNVEKSQF FTTYDMRGTY VGGRSQGEEN DWKDMNSKLN
DILKELSKKS WNGYKTVTAY FVNHKVDENG NYVYDVVFHG MNTDTNTDVH VNKEPKAVIK
SDSSVIVEEE INFDGTESKD EDGEIKAYEW DFGDGEKSNE AKAAHKYNKT GEYEVKLTVT
DNNGGINTES KKIKVVEDKP VEVINESEPN NDLEKANQIT KSNMLVKGTL SQNDYSDKYY
FDVAKKGNVK ITLNNLNSVG ITWTLYKEGD LNNYVLYATR NDGTELKGEK ILEPGRYYLS
VYTYDNQSGA YTVNVKGNLK NEVKEVEKDS IKEVENNNDF DKAMKVDSNS KIVGTLSNDD
LKDIYSIDIQ NPSDLNIVVE NLDNIKMNWL LYSADDLSNY VYYANADGNK LSNTCKLNPG
KYYLCVYQFE NSGTGNYTVN LQNK