Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1049 |
Symbol | |
ID | 4204931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1194804 |
End bp | 1195958 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565605 |
Product | ISCpe2, transposase orfB |
Protein accession | YP_698371 |
Protein GI | 110802891 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00154487 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAAGAA AGAAAGCATA TAAATTTAGA ATATATCCAA CTAAGAAACA AGCCGAGCTA ATTAACAAGA CTATTGGATG TTGTAGGTTT GTATTCAACT ATTCTTTAGG TGTTCAAAAA ACTAAAGATA ATTATTGGAA TATAGTTGAG GAGATGGTTC AACAAGGCTA CTTCCAAGAA AATAATTGGA AAGGCGAGTT CTTTAATAAA GCCAATTCAA TTAAAGATAT AGCTAAATTA AAGAAAAGCT ATGATTGGCT TAAAGAAGTA GATAGCATAG CACTACAGGC TTCTGTTGAA AATCTTGGTG CTGGATATGA TAAGTATTAC AAAAAAATAG GTGGCAGACC TAAATTTAAG TCTAAGAAAA ATGAAATACA ATCTTATACT ACTAAATTGG TAAAAGCTAA AGGTAATGTA AATATTGAGA TAGTAGGTAA AGGAATTAAA CTACCTAAAC TTGGATTAGT TAAGATAGAA AATTCAAGAA ATGTAGATGG AAATATTAAA AGAGTTACTG TGAGCAGAAC ACAATCTGGT AAATATTTTG CATCTATTCT TTGTGATGTA AATATTCAAG AATTACCTAA AATAGATAAA AAAGTTGGTG TAGATGTTGG ATTAAAAACT TTTGCTGTTT GTTCTGATGG ATATGAAGAA GCTAATCCTA AACACTTTAG AAAGGCAGAA AAAAGACTTA TTAAACTTCA AAGAGATTTA GCTCGTAAAG AATATAACTC TAAAAACTAT CACAAAAATA GAATTATGAT TGCTAAATTA CACGAAAGGA TAGTAAATCA AAGAATGGAC TTCTTACAAA AATTTTCAAC TAAGCTAATT AGAGAAAACC AATCAATAGC CATTGAAGAT TTAAGAGTTT CAAATATGCT TAAAAACCAC AAACTTGCTA AAGTAATTTC AGAAGCAAGT TGGTCAGAAT TTAGAAGAAT GCTAGAATAC AAGGCTGAAT GGTATGGTAG AAAAATAGTA ATTGCTCCAC CAGACTATGC AAGCTCTCAA CTATGTTCTG AATGTGGCTA TAAGAATATA GAAGTTAAAA ACCTAGGTCT AAGAGAATGG GTATGTCCCG AATGTGAAAC ACACCATCAA AGAGACCTTA ATGCTTCAAT AAACTTGGAA AAGCTGATAG CTTAA
|
Protein sequence | MIRKKAYKFR IYPTKKQAEL INKTIGCCRF VFNYSLGVQK TKDNYWNIVE EMVQQGYFQE NNWKGEFFNK ANSIKDIAKL KKSYDWLKEV DSIALQASVE NLGAGYDKYY KKIGGRPKFK SKKNEIQSYT TKLVKAKGNV NIEIVGKGIK LPKLGLVKIE NSRNVDGNIK RVTVSRTQSG KYFASILCDV NIQELPKIDK KVGVDVGLKT FAVCSDGYEE ANPKHFRKAE KRLIKLQRDL ARKEYNSKNY HKNRIMIAKL HERIVNQRMD FLQKFSTKLI RENQSIAIED LRVSNMLKNH KLAKVISEAS WSEFRRMLEY KAEWYGRKIV IAPPDYASSQ LCSECGYKNI EVKNLGLREW VCPECETHHQ RDLNASINLE KLIA
|
| |