Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1888 |
Symbol | |
ID | 4205339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2085145 |
End bp | 2086320 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642566438 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_699198 |
Protein GI | 110802849 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTA ACTTAGTAGA TATAATTATA GTTTGTTCTT TTATACTACC ATTAGTAGTT GCTTACAAAA GAAAGTTTAA TATTATTAGA ATAAAAAATA GTATAGAGGA ATTAGGAGGA TATATATCCT TTTTTCTTGC TTTATACTTA AGTTTTATAG CTATAAAAAA GATAGATATA ATAGAGAGGA TGTTTTCAAT TGTAGTTGTT GAACTAAATA ATATTATAGC TAACTTTAAT ATTTCACCAA AGGTTATAAT AATATTCATA GTATTAGCTC TTACCCTTGT AATTTATTTT ATAGTAAAAG TTATACTTAA AATATTTAGT TTTATTATTA TAAATCCAAT ACTTAGATGG TTAAAAAAAG CAGAGTCTAG AAGGGGTAAG GGATTTGGTA AAGTTGCAGC CTTAATAATA AATATACCAA AGTCTCTTTT TTATATGGCA GTAATTGCTT TAGTCATAGT TATTCTAGGA AGCAATGGTT TCTTAGGAGA GAAAATACAA GGCATGACTT TAGCTTCAAA GGCTTATGAG GTTATAAATA GTAATAAGTA CTATGCTGCT TTAAATAAAG AATATGAAGC TTTTCATGAT GAATATAAAG ATGTTATTAG CAAAAACATA GATTCTACAG TTGAAAGTAA TAAAGAGCCA AAGAGTGAAA AAGTGTTTGA AAGTAATAGA AATGTTATAA ATCTTTATAA TGGTGTAACT TTAGAACAAG GTATAAAATC AAACGAAGCT ATTAATAGAA AAGCTAAGGA ACTTACTAAA AATGCAAAAA GTAGTAGAGA AAAAGCTAAA AGAATATATA CTTGGATAAG TGAAAATATT AATTATGATG ATAATAAAGC TGAAAATATA AGCGAGAAAA CTTCTGAGTA TAAGTCTGGT GCTATTGAAG CCTTTGAAAC TAGAAAAGGA ATATGCTTTG ATTATTCTTG TCTTTATGTT GCTATGGCAA GAGAAGCGGG ACTTAAAGTT AGAATTGTAA CTGGAGAAGG ATTCAATGGA AAGGAATGGG GACCACATTC TTGGAATGAG GTCTATTTAC CAGAAAAAAA TCAATGGATA ACTGTTGATC CTACCTTTGG TAAAGCTGGA AACTATTTTG ATAGCAAAAA AAATAGTGAA TCACACAGAG ATGGAAAAAT AGTTGGTGAA TGGTAA
|
Protein sequence | MNFNLVDIII VCSFILPLVV AYKRKFNIIR IKNSIEELGG YISFFLALYL SFIAIKKIDI IERMFSIVVV ELNNIIANFN ISPKVIIIFI VLALTLVIYF IVKVILKIFS FIIINPILRW LKKAESRRGK GFGKVAALII NIPKSLFYMA VIALVIVILG SNGFLGEKIQ GMTLASKAYE VINSNKYYAA LNKEYEAFHD EYKDVISKNI DSTVESNKEP KSEKVFESNR NVINLYNGVT LEQGIKSNEA INRKAKELTK NAKSSREKAK RIYTWISENI NYDDNKAENI SEKTSEYKSG AIEAFETRKG ICFDYSCLYV AMAREAGLKV RIVTGEGFNG KEWGPHSWNE VYLPEKNQWI TVDPTFGKAG NYFDSKKNSE SHRDGKIVGE W
|
| |