Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1027 |
Symbol | |
ID | 4204843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1168894 |
End bp | 1170105 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642565584 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_698350 |
Protein GI | 110802900 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.155494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA TTTATAATAA TAGATATTAT GATTACAATA GATATTATGA TGATGAATAT TATCAAGATG ATTATTATCA AAATGATTAT CACTGCAAAG AAGACTATTG TTGTAAAGAT GATTGCTATT TAAAAATAAA TTGTAATTGT TGCAAGCCTG GACCAAGGGG ACCAAGAGGC CCTCAAGGTG AACAAGGACC TCAAGGTGAA AGAGGATTTA CTGGCCCTCA AGGTCCTGTT GGTCCTCAAG GTGAACAAGG ACCTCAAGGT GAAAGAGGAT TTACCGGTCC TCAAGGTCCT ATTGGTCTTC AAGGTGAACA AGGACCTCAA GGTGAAAGAG GATTTACCGG TCCTCAAGGT CCTGTTGGTC CTCAAGGTGA ACAAGGACCT CAAGGTGAAA GAGGATTTAC CGGTCCTCAA GGTCCTGTTG GTCCTCAAGG TGAACAAGGA CCTCAAGGTG AAAGAGGATT TACCGGCCCT CAAGGTCCTA TTGGTCCTCA AGGTGAACAA GGACCTCAAG GTGAAAGAGG ATTTACTGGC CCTCAAGGTC CTATTGGTCC TCAAGGAAAT CAAGGTCCTA TTGGTCCCCA AGGTGAACAA GGTCCTCAGG GCGCTACAGG ACCACAAGGT CCTCAAGGTC CTGTTGGTCC TCAAGGAAAT CAAGGCCCTA TTGGTCCTCA AGGTCCTGTT GGTCCTCAAG GTCCTCAAGG GCAACCTGGA GTTAATTTTA ACGATACCTT ATTAGTTAGT AGTTCTTCAT TATCTTCACA AAATGTTGGT TCTAATGGTA TATTCACTTA TAATATCCAA AATCCTAATG GTTCAACTTT TACAGCAATA ACTGCCAATA TAGCAAACGG AACGTTTACA ATAAATGAAC CTGGAAGATA TTTATTTATT TGGTCATTTA ATTTAGATAA CACAAATAAT ACTACAGCTA ATACTATAGT ATCTTTATTT AGAAATGGTT TTAGAATCTT TTTATCTGGT ACTCCTAGAG TAGCTCCTGG TGAAATAGGC GTAGTAAATG GAAGTATTGC CGTAAATGCT AATGCTGGTG ATGTATTTGC TTTAGTTAAT AATTCTACAA GAAACGTTTT ATCACAAATA ATATCTTCAC CAATTTCTGT AACTCCAGCT ATCTTAGGAG AATCTACAGG GATAAATTCA GGAATAGGAT CTTGGGTTCA AATAGTTAAA GTATCTGATT AA
|
Protein sequence | MRKIYNNRYY DYNRYYDDEY YQDDYYQNDY HCKEDYCCKD DCYLKINCNC CKPGPRGPRG PQGEQGPQGE RGFTGPQGPV GPQGEQGPQG ERGFTGPQGP IGLQGEQGPQ GERGFTGPQG PVGPQGEQGP QGERGFTGPQ GPVGPQGEQG PQGERGFTGP QGPIGPQGEQ GPQGERGFTG PQGPIGPQGN QGPIGPQGEQ GPQGATGPQG PQGPVGPQGN QGPIGPQGPV GPQGPQGQPG VNFNDTLLVS SSSLSSQNVG SNGIFTYNIQ NPNGSTFTAI TANIANGTFT INEPGRYLFI WSFNLDNTNN TTANTIVSLF RNGFRIFLSG TPRVAPGEIG VVNGSIAVNA NAGDVFALVN NSTRNVLSQI ISSPISVTPA ILGESTGINS GIGSWVQIVK VSD
|
| |