Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1395 |
Symbol | |
ID | 4206182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1566617 |
End bp | 1569538 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642565949 |
Product | peptidase, putative |
Protein accession | YP_698714 |
Protein GI | 110803737 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.131852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTA AAGAGAATAA TATTTATAGT GGATTTAAAC TTTTAAAAAT AGAAAATTTA AATGAAATAG GTGGATTAGG TTTAAGGTTT GAACATGAAA AAACTAAGGC TAAACTTATA AAAATCTTAA GTGAAGATGA TAATAAGTGC TTTGCAATAG GTTTTAGAAC ACCACCTGAA AATAGTACAG GAGTTCCTCA TATTTTAGAG CATTCAGTTT TATGTGGTTC TAGAAAATTT AATACTAAGG AACCCTTTGT AGAGCTTTTA AAAGGGTCTT TAAATACATT CTTAAATGCT ATGACATACC CAGATAAAAC AATATATCCA GTAGCATCAA GAAATGAAAA AGACTTTATG AATCTTATGG ATGTTTACTT AGATGCTGTA TTATATCCAA ATATATATAA GCATAAGGAA ATATTCATGC AAGAGGGATG GCATTATTAT ATAGAAAATA AGGAAGATGA ATTAAAGTAT AATGGTGTTG TTTATAATGA GATGAAAGGG GCATACTCAT CTCCAGATTC TATACTTTAT AGAAAGATTC CTCAAACAAT ATACCCAGAT ACTTGTTATG CCTTATCTTC AGGAGGAGAT CCTGATGAAA TACCAAATTT AACTTATGAA GAGTTTGTAG AATTTCATAA GAAATATTAT CATCCATCAA ACTCATATAT TTTCTTATAT GGTAATGGAG ATACTGAAAA AGAATTAGAA TTTATAAATG AAGAATATTT AAAGAATTTT GAATATAAAG AGATAGATTC AGAAATAAAA GAACAAAAAT CCTTTGAAAG TATGAAAGAA GAAAGTTTTA CTTATGGAAT AGCTGAAAGT GAAGATTTAA ATCATAAAAG TTATTATAGT TTAAACTTTG TAATTGGAGA TGCCACAGAT GGAGAAAAAG GCTTAGCTTT TGATGTTTTA GCATATCTTC TAACAAGAAG TACAGCAGCC CCATTAAAGA AAGCATTAAT AGATGCAGGT ATAGGGAAAG CTGTATCAGG AGACTTTGAT AACTCAACTA AACAATCAGC CTTTACTGTT TTAGTTAAGA ATGCAGAGCT AAACAAAGAA GAAGAATTTA AAAAAGTAGT AATGGATACT TTAAAGGATT TGGTTGAAAA TGGAATAGAT AAAGAACTTA TAGAAGCTTC CATAAATAGA GTTGAATTTG AATTAAGAGA AGGGGATTAT GGTTCTTATC CTAATGGGTT AATTTATTAT TTAAAAGTTA TGGATAGTTG GCTTTATGAT GGGGATCCAT ATGTTCATTT AGAATATGAA AAAAATCTTG AAAAAATAAA ATCTGCTTTA ACAAGCAATT ACTTTGAAGA TTTAATTGAA AGATATATGA TAAATAATAC TCACTCTTCA CTTGTTTCTC TTCATCCTGA AAAAGGAATA AATGAGAAAA AGTCAGCTGA ATTAAAGAAA AAGTTAGAAG AGATTAAAAA TAGTTTTGAT GAAAAGACTT TAAATGAAAT AATTGATAAT TGTAAAAAGT TAAAAGAAAG ACAAAGTACA CCTGATAAAA AAGAAGATTT AGAAAGCATT CCTATGTTAT CTTTAGAGGA TATAGATAAA GAAGCAACTA AAATTCCTAC AGAAGAGAAA GAGATAGATG GAATTACAAC ATTACACCAT GATTTCCATA CTAATAAAAT AGACTATGTT AATTTCTTCT TTAATACAAA TAGTGTTCCT CAAGATTTAA TACCTTATGT TGGATTGCTA TGTGATATAT TAGGTAAGTG TGGAACAGAA AATTATGATT ATTCTAAGTT ATCAAATGCC ATAAATATAA GCACAGGTGG AATAAGCTTT GGGGCTATAA CTTTTGCTAA TTTAAAGAAA AATAATGAGT TTAGACCATA TTTGGAAATT TCATATAAAG CATTAAGCAG TAAGACTAAT AAAGCTATAG AATTAGTTTC TGAAATTGTA AATCACACTG ACCTAGATGA TATGGACAGA ATTATGCAAA TAATTAGAGA GAAGAGAGCT AGATTAGAAG GTGCTATATT TGATAGTGGT CATAGAATAG CTATGAAAAA AGTTTTATCA TACTCTACAA ATAGAGGAGC TTATGATGAA AAAATAAGTG GATTAGATTA TTATGATTTT CTAGTAAATA TAGAGAAGGA AAATAAAAAA TCAATAATAT CAGATAGCTT AAAAAAGGTA AGAGACTTAA TCTTTAATAA GGGAAATATG CTTATAAGTT ATTCAGGAAA AGAAGAGGAA TATGAAAACT TTAAGGAAAA AGTAAAATAT TTAATAAGCA AAACAAGTAA TAATGATTTT GAAAAAGAAG AGTATAATTT TGAGTTAGGA AAGAAAAATG AAGGACTTTT AACTCAAGGA AATGTACAAT ATGTAGCTAA GGGTGGAAAT TATAAAACTC ATGGATATAA GTATTCTGGG GCCCTATCTT TATTAGAAAG TATTCTAGGC TTTGACTACT TATGGAATGC CGTAAGGGTT AAAGGTGGAG CTTATGGAGT GTTCTCTAAC TTTAGAAGAG ATGGCGGAGC ATATATAGTT TCATATAGAG ACCCTAATAT AAAAAGCACT TTAGAAGCTT ATGATAATAT ACCTAAGTAT TTAAATGATT TTGAAGCTGA CGAAAGAGAA ATGACTAAAT ACATCATAGG TACAATAAGA AAATATGATC AACCTATAAG CAATGGAATA AAAGGAGATA TAGCAGTTTC ATATTACTTG AGTAACTTTA CTTATGAAGA TCTTCAAAAG GAAAGAGAAG AAATCATAAA TGCAGATGTA GAAAAAATTA AGAGTTTTGC ACCTATGATT AAAGATTTAA TGAAGGAAGA TTACATCTGT GTACTAGGTA ATGAAGAAAA GATAAAAGAA AATAAAGAGT TATTTAATAA TATTAAAAGT GTAATTAAAT AG
|
Protein sequence | MNFKENNIYS GFKLLKIENL NEIGGLGLRF EHEKTKAKLI KILSEDDNKC FAIGFRTPPE NSTGVPHILE HSVLCGSRKF NTKEPFVELL KGSLNTFLNA MTYPDKTIYP VASRNEKDFM NLMDVYLDAV LYPNIYKHKE IFMQEGWHYY IENKEDELKY NGVVYNEMKG AYSSPDSILY RKIPQTIYPD TCYALSSGGD PDEIPNLTYE EFVEFHKKYY HPSNSYIFLY GNGDTEKELE FINEEYLKNF EYKEIDSEIK EQKSFESMKE ESFTYGIAES EDLNHKSYYS LNFVIGDATD GEKGLAFDVL AYLLTRSTAA PLKKALIDAG IGKAVSGDFD NSTKQSAFTV LVKNAELNKE EEFKKVVMDT LKDLVENGID KELIEASINR VEFELREGDY GSYPNGLIYY LKVMDSWLYD GDPYVHLEYE KNLEKIKSAL TSNYFEDLIE RYMINNTHSS LVSLHPEKGI NEKKSAELKK KLEEIKNSFD EKTLNEIIDN CKKLKERQST PDKKEDLESI PMLSLEDIDK EATKIPTEEK EIDGITTLHH DFHTNKIDYV NFFFNTNSVP QDLIPYVGLL CDILGKCGTE NYDYSKLSNA INISTGGISF GAITFANLKK NNEFRPYLEI SYKALSSKTN KAIELVSEIV NHTDLDDMDR IMQIIREKRA RLEGAIFDSG HRIAMKKVLS YSTNRGAYDE KISGLDYYDF LVNIEKENKK SIISDSLKKV RDLIFNKGNM LISYSGKEEE YENFKEKVKY LISKTSNNDF EKEEYNFELG KKNEGLLTQG NVQYVAKGGN YKTHGYKYSG ALSLLESILG FDYLWNAVRV KGGAYGVFSN FRRDGGAYIV SYRDPNIKST LEAYDNIPKY LNDFEADERE MTKYIIGTIR KYDQPISNGI KGDIAVSYYL SNFTYEDLQK EREEIINADV EKIKSFAPMI KDLMKEDYIC VLGNEEKIKE NKELFNNIKS VIK
|
| |