Gene CPF_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0166 
SymbolcolA 
ID4201122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp197623 
End bp200937 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content29% 
IMG OID638081047 
Productcollagenase 
Protein accessionYP_694630 
Protein GI110798785 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA ACTTAAAAAG GGGAGAGCTA ACGAAACTAA AGTTAGTTGA AAGATGGTCA 
GCTACCTTTA CTTTAGCAGC ATTTATTTTA TTTAATAGCT CGTTTAAAGT ATTTGCAGCT
GATAAAAAAA TAGAGAATAG TAATAATGGA CAGATTACTA GAGAGATTAA TGCTGATCAG
ATTTCTAAAA CAGAATTAAA TAATGAGGTA GCTACAGACA ATAATAGACC ATTAGGCCCT
AGTATTGCTC CATCAAGAGC AAGAAACAAC AAGATCTATA CATTCGATGA ACTTAACAGA
ATGAATTATA GTGATCTAGT TGAATTAATA AAAACAATAA GTTATGAAAA CGTACCAGAC
TTATTTAATT TTAATGATGG TTCATATACT TTCTTTAGTA ATAGAGATCG TGTACAAGCT
ATAATATATG GTCTAGAGGA TAGTGGAAGA ACTTATACAG CAGATGATGA TAAGGGAATT
CCAACTTTAG TTGAGTTTTT AAGAGCTGGA TATTATTTAG GATTTTACAA TAAACAATTA
TCATACCTAA ATACACCACA GTTAAAAAAT GAGTGTTTAC CAGCTATGAA AGCGATTCAA
TATAATAGTA ATTTTAGATT AGGAACAAAG GCGCAAGATG GAGTTGTTGA GGCTTTAGGA
AGACTTATAG GTAATGCTTC AGCAGATCCA GAAGTTATTA ATAATTGCAT ATATGTCTTA
AGTGATTTTA AAGATAATAT AGATAAGTAT GGTTCGAACT ATAGCAAGGG AAATGCAGTA
TTCAACCTTA TGAAAGGTAT TGATTATTAC ACTAATTCAG TAATATACAA TACTAAGGGA
TATGATGCTA AAAACACTGA GTTTTATAAT AGAATAGATC CATATATGGA AAGATTAGAA
AGTTTATGTA CAATAGGTGA TAAGTTAAAT AATGATAATG CTTGGTTTGT AAATAATGCC
TTATATTACA CAGGTAGAAT GGGTAAGTTT AGAGAAGACC CATCAATATC TCAAAGAGCT
TTAGAAAGAG CTATGAAGGA GTATCCTTAT TTATCATATC AATATATTGA AGCTGCCAAT
GATTTAGATT TAAATTTTGG TGGCAAAAAT TCATCAGGAA ATGATATAGA TTTCAATAAG
ATAAAAGCAG ATGCAAGGGA AAAATATCTT CCAAAAACAT ATACTTTTGA TGATGGCAAA
TTTGTAGTAA AAGCTGGTGA TAAAGTAACA GAAGAAAAGA TAAAAAGATT ATACTGGGCT
TCAAAGGAAG TTAAGGCTCA ATTTATGAGA GTAGTTCAAA ATGATAAGGC TTTAGAAGAG
GGAAATCCAG ATGATATTTT AACTGTTGTT ATTTATAACT CACCAGAAGA GTATAAGTTA
AATCGTATAA TAAATGGATT TAGTACTGAT AATGGTGGTA TATATATTGA AAACATAGGA
ACTTTCTTTA CTTATGAAAG AACACCAGAG GAAAGTATAT ATACATTAGA AGAATTATTC
CGTCATGAAT TTACTCACTA TCTTCAAGGT AGATATGTAG TTCCTGGAAT GTGGGGACAA
GGAGAATTCT ATCAAGAGGG AGTTTTAACT TGGTATGAAG AAGGAACAGC AGAGTTCTTT
GCAGGTTCAA CTAGAACTGA TGGAATAAAA CCAAGAAAAT CAGTTACACA AGGATTAGCT
TACGATAGAA ATAATAGGAT GTCTTTATAT GATGTATTAC ATGCTAAATA TGGCTCATGG
GATTTCTATA ATTATGGATT TGCTTTATCA AACTACATGT ACAACAATAA CATTGGAATG
TTTAATAAGA TGACAAATTA CATAAAGAAT AATGATGTAT CTGGTTATAA AGATTATATT
GCATCAATGA GTAGTGATTA CGGATTAAAT GATAAATATC AAGACTATAT AGATTCTTTA
TTAAATAATA TTGACAACTT AGATGTTCCT TTAGTTTCAG ATGAATATGT AAATGGACAT
GAAGCTAAGG ATATAAATGA AATAACTAAT GATATAAAAG AAGTTTCAAA TATAAAAGAT
CTTTCTAGTA ATGTTGAAAA GTCTCAATTC TTTACTACTT ACGATATGAG AGGAACATAT
GTAGGGGAAA GAAGCCAAGG GGAAGAAAAT GACTGGAAAG ATATGAATTC TAAGTTAAAT
GATATATTAA AAGAATTATC TAAAAAGAGC TGGAATGGGT ATAAAACTGT TACTGCATAC
TTTGTAAACC ATAAAGTAGA TGAAAATGGT AACTATGTTT ATGATGTTGT ATTCCATGGA
ATGAATACAG ATACAAATAC TGATGTTCAT GTAAATAAAG AGCCTAAGGC TGTTATAAAA
TCTGATTCTT CAGTAATAGT TGAAGAAGAA ATAAACTTTG ATGGAACAGA GTCAAAAGAT
GAAGATGGTG AAATTAAAGC TTATGAATGG GACTTTGGAG ATGGAGAAAA ATCTAATGAG
GCTAAAGCTG CTCATAAATA TAATAAAACT GGAGAATATG AAGTAAAATT AACAGTTACA
GATAATAACG GTGGAATAAA TACTGAAAGT AAAAAGATAA AAGTAGTAGA AGATAAACCT
GTTGAAGTTA TAAATGAAAG CGAGCCTAAC AATGATTTTG AAAAGGCTAA CCAAATAGCT
AAATCTAATA TGTTAGTTAA GGGTACTTTA TCAGAAGAGG ATTATTCAGA TAAATATTAT
TTTGATGTAG CTAAAAAAGG CAATGTTAAA ATCACTCTTA ATAATTTAAA TTCAGTAGGA
ATAACTTGGA CACTTTATAA AGAGGGAGAC CTAAACAATT ATGTTTTATA TGCAACTGGA
AATGATGGAA CAGAATTAAA GGGTGAAAAG ACTTTAGAGC CTGGAAGATA CTACTTAAGT
GTATATACTT ATGATAATCA ATCAGGAGCT TACACAGTAA ATGTAAAAGG AAAACTTAAA
AATGAAGTTA AAGAAACAGA AAAGGATGCT ATAAAAGAAG TTGAAAATAA CAATGATTTT
GATAAAGCTA TGAAGGTAGA TAGTAATAGC AAAATAGTTG GAACATTAAG CAATGATGAT
CTTAAGGATA TTTATAGCAT AGATATACAA AATCCAAGTG ACTTAAACAT AGTAGTTGAA
AACTTAGATA ATATAAAAAT GAACTGGTTA TTATATTCAG CTGATGATTT AAGTAACTAT
GTGGATTACG CTAACGCAGA TGGAAATAAA TTAAGTAACA CTTGTAAGTT AAATCCAGGT
AAATATTACT TATGTGTTTA TCAATTTGAA AACTCAGGTA CTGGAAATTA CACAATAAAC
TTACAAAACA AATAA
 
Protein sequence
MKKNLKRGEL TKLKLVERWS ATFTLAAFIL FNSSFKVFAA DKKIENSNNG QITREINADQ 
ISKTELNNEV ATDNNRPLGP SIAPSRARNN KIYTFDELNR MNYSDLVELI KTISYENVPD
LFNFNDGSYT FFSNRDRVQA IIYGLEDSGR TYTADDDKGI PTLVEFLRAG YYLGFYNKQL
SYLNTPQLKN ECLPAMKAIQ YNSNFRLGTK AQDGVVEALG RLIGNASADP EVINNCIYVL
SDFKDNIDKY GSNYSKGNAV FNLMKGIDYY TNSVIYNTKG YDAKNTEFYN RIDPYMERLE
SLCTIGDKLN NDNAWFVNNA LYYTGRMGKF REDPSISQRA LERAMKEYPY LSYQYIEAAN
DLDLNFGGKN SSGNDIDFNK IKADAREKYL PKTYTFDDGK FVVKAGDKVT EEKIKRLYWA
SKEVKAQFMR VVQNDKALEE GNPDDILTVV IYNSPEEYKL NRIINGFSTD NGGIYIENIG
TFFTYERTPE ESIYTLEELF RHEFTHYLQG RYVVPGMWGQ GEFYQEGVLT WYEEGTAEFF
AGSTRTDGIK PRKSVTQGLA YDRNNRMSLY DVLHAKYGSW DFYNYGFALS NYMYNNNIGM
FNKMTNYIKN NDVSGYKDYI ASMSSDYGLN DKYQDYIDSL LNNIDNLDVP LVSDEYVNGH
EAKDINEITN DIKEVSNIKD LSSNVEKSQF FTTYDMRGTY VGERSQGEEN DWKDMNSKLN
DILKELSKKS WNGYKTVTAY FVNHKVDENG NYVYDVVFHG MNTDTNTDVH VNKEPKAVIK
SDSSVIVEEE INFDGTESKD EDGEIKAYEW DFGDGEKSNE AKAAHKYNKT GEYEVKLTVT
DNNGGINTES KKIKVVEDKP VEVINESEPN NDFEKANQIA KSNMLVKGTL SEEDYSDKYY
FDVAKKGNVK ITLNNLNSVG ITWTLYKEGD LNNYVLYATG NDGTELKGEK TLEPGRYYLS
VYTYDNQSGA YTVNVKGKLK NEVKETEKDA IKEVENNNDF DKAMKVDSNS KIVGTLSNDD
LKDIYSIDIQ NPSDLNIVVE NLDNIKMNWL LYSADDLSNY VDYANADGNK LSNTCKLNPG
KYYLCVYQFE NSGTGNYTIN LQNK