Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2070 |
Symbol | |
ID | 4206076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2292547 |
End bp | 2294301 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642566620 |
Product | subtilase family protein |
Protein accession | YP_699379 |
Protein GI | 110803826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.725606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTTA AGTCTTCAGC TCCAGAATAT GTATTTAATA ATGAAAATTA TTTAAATTAT CTAGTTCAGT ATCAAGGTGA CATTATAGGA GAGTTTAATA ATCAAAATGG AATATATGCA ACTTTAATTA ATGATAGATA TGCCATAATA ACTATTGATA AAAATTTACA GGAATATGAA AATGATATAT CTATGATTAA ATATAAAAAA GAAAATGGAA GAGATGTAGA AATAGATAAC ATTACTTATA TAAAAGATCC AGAAGGATAT GTACTTCAGG AAATAAGCCC TTTACAAGCA GCAAATATTG AATATGTTCA AATACAATCA TACTTTAATC TAACAGGTAA AGGGGTTATT GTTGGAATTT TAGATACTGG AATAGATTAT TTAAACGAAG AGTTTATGGA TTCATATGGT AATACAAGAA TTTTAGGAAT TTGGGATCAA ACAATTTCAA GTGAAAGTTC AAGGGAGAAT AATTTACCTT ATGGAACTTT TTATTCAGAA GAAGATATAA ATAGAGCAAT AAGACTTAGT AGAGAAGGAG GAGATCCATA TACAATAGTT CCATCAAGAG ATGAAATAGG CCATGGAACA TCAATGGCAG GAATAATTGG ATCATCAGGA AAAAATCCTA GGTTAAAGGG GGTTGCACCA GATTGTAAGT TCTTAGTTGT AAAGCTTGCA CAGTCACTTT ATTATAAAAA AGAGTATGAA ATAAAAATAC CTGTTTATAA TATAACTGAA ATATTTACAG GAATACAATA TTTATATTCA TACTTTTTAA AAAGATCTCA AAGTATGATT ATATATTTAC CTTTAGGCAC TAATAGAGGA AGTCATAAAG GAACAAGCAT CTTAGAAGAA TTTTTAGATT CTATATTAAT AAATAGAGGA ATTGCTTTAA TTACAGGTGC TGGAAATGAA GGAGCAGCAT TATTACATGG ATCAGGAACT ATAAAACCTA ATGGACAAGT AACTACTCAT GAATTTAATA TAGATGAAAA TCAAAAAAAG ATTATCATTG AAGCTTGGAT ACAAATACCT AGTATTGCTT CCGTAGAAAT AGTTTCACCT ACTGGAGGAA CTACAGGAAT AATTCAACCT TTCTTTGGAA AAGGAAATAA GTATTATTTT ACAATTGAAC GAACTACAGT GTTAGTAAGT TATTATATAC CTGAAGAAAT TTATGAAGAT TCATTGATAT TAATTATACT AGATAATGTT CAAGCTGGAA TATGGAGTTT TAAGTTTAGA GGATTAAATA ATATAGAAGG AAGATATAAT ATTTGGCTTC CGCCAAAAGG GATAAGTAAA GAAAAAACTA GAATGATATA TCCTGACCCA TATGGAACAG TAACTGTCCC AGGTACTAGC ATCTCTGTAA TAACAGTTGC TGCATATAAT CAATTAAATA ATACACAATT AATTTATTCA GGGAGAGGAT TTAAAGATAA TTATATTGAT ATTATAGATG TAGCAGCAGG AGGAGTAAAT GCACTTACAG TTGCGCCAGA TAATAAGACA ACTTTAGCAA ATGGTACAAG TGTTGCAGCG GCTATAGTAG CTGGAATTTG TGTTTTACTT TTCCAATGGG GAATTGTTGA AGGAAATTAT CCTTATATGT TCTCTCAAAC TTTAAAAGCT TTTATTACAA GAGGTACAAG AAAGAGAAAA GGGGACACTT ATCCAAATCC AGAGTGGGGA TATGGAATAG TAGATATGTT TAATATGTTT AATCTTACTA ATTAA
|
Protein sequence | MEFKSSAPEY VFNNENYLNY LVQYQGDIIG EFNNQNGIYA TLINDRYAII TIDKNLQEYE NDISMIKYKK ENGRDVEIDN ITYIKDPEGY VLQEISPLQA ANIEYVQIQS YFNLTGKGVI VGILDTGIDY LNEEFMDSYG NTRILGIWDQ TISSESSREN NLPYGTFYSE EDINRAIRLS REGGDPYTIV PSRDEIGHGT SMAGIIGSSG KNPRLKGVAP DCKFLVVKLA QSLYYKKEYE IKIPVYNITE IFTGIQYLYS YFLKRSQSMI IYLPLGTNRG SHKGTSILEE FLDSILINRG IALITGAGNE GAALLHGSGT IKPNGQVTTH EFNIDENQKK IIIEAWIQIP SIASVEIVSP TGGTTGIIQP FFGKGNKYYF TIERTTVLVS YYIPEEIYED SLILIILDNV QAGIWSFKFR GLNNIEGRYN IWLPPKGISK EKTRMIYPDP YGTVTVPGTS ISVITVAAYN QLNNTQLIYS GRGFKDNYID IIDVAAGGVN ALTVAPDNKT TLANGTSVAA AIVAGICVLL FQWGIVEGNY PYMFSQTLKA FITRGTRKRK GDTYPNPEWG YGIVDMFNMF NLTN
|
| |