Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1266 |
Symbol | entA |
ID | 4206529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1423866 |
End bp | 1426736 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642565822 |
Product | putative enterotoxin |
Protein accession | YP_698588 |
Protein GI | 110801837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTAG CAATAGTTCA ACAACAGCAG CAACAACCAA ATGATAAGAT TAATATAGAC TTGTCTAATT CTAATGTTGC TGTGGTAAAA AATAAAAAAA ATACTAATCC CCCTATAAAA GATGAAAAAG AATCAAAGGA TAAAGATAAT TTAAAATCTG ATAAAAAAAC CGAGCCTAAA ATTGAAGAAA AAAAACAAGA TAAAGGTAAC TCAAAGGAAG AAGATAAAGC TCAGGTAAAA ACTGAACCTA AAGTTAAAGA GGATAAAGAC ACAAAGAAAA GTTCTGAAGA AATTAAAGAA GAGGTAAAAA ATCAAAAGCC TGTTATAAAA GAAGTAAAAT TACCTGTTGA AAAAGATTCA AAGAATTCTT CTGAAAATTT ATCTAATAAC TCAAGTGACT TTGCTTCAGA TGTTAATAGA AGATTATATG AATATGAAAG TGTTAGAAGT AATCAATCTA ATGCAATGAA TGAAGCTATT CGTCTTCATA ATGGAGACCC TAGCAACACT TGTGTTTTCT TCCAAGGTAC TTGTTTAAGA GCAATAGGAG TACCTGTTCC AAGTCATATT GGATACACTT CTAACCTTTG GAATTGGCTT GAAAATAATA ACTGGGAAAT GCATAGAGAT TTTTCAAACA TTCAAAAGGG AGATATTATC TTTGCAGGTG AATATCACAC AATGTGTTTC ATGGGATGGA AGGATAAAGC AAATGGTATT GCCTATGTAA TGGGAAATGA AGCCTATACC TATGGAGATG CTTATGCTCA TCGTAATTTA AATGGACAAG CTCCTACTAA AGAAAATGGA TGGAATAGAC AATATAGAGC TACAAGATAC TACAAATATA AAGGAAATTC TTCAAAAAAC AATTCTGACA TTGTAAAAAC AAACATTGTA ACATCAAAAC CTCTTAGCTC AAAGGATCAT AAATATACAG TTGAAGCCTT AAAGGGTACT GTAACAGCTA AGCAAGATGT TTATATTAAT GAAAAACCCT TCCCTACTAC TGAAGGAATT AAGCCTGTAG GACTAGCTAA TGGAGGAGAT AAATTAACTG TAACTGGTAG AGCCTCAAAT GGATGGTATG AAGTTTTATA TAATGGCCAA AAAGCTTACA TAAGTCATAG ATATACTAAC TTCTTAGAAG AAAAGGAATC TAAATCTAAG TCCTCTGAAT CTATTACTCC TCTTGGCGTA GTTAAAACTA ATACTTCAGA TAAAAAATCT GATTCTAAAG AAGATAATAA AAAGATAGAA AATTCTCTTA AAGACAAAGA TGAAAAGGTG AACAAGGGAG CTGAATCTAC TGAAGATAAA AAGGTAGATG AATCTAAGGA TAAAGCTACT AATAAAGTAG AATCAAATAA AACTAAACAA GCTAAACCTT TAGAATCTAG TAAGCCAGTT GAAGAAACTA AAAAAGAAGA ATCAAGTAAG TCTTCTGAAA CAGTTACAAA AACTGCCTTT ATAAAGGCTA ATGGTGGTTT ATGGCTACAT TCTAGTAAAG ATACTTCATA CTCTTCTAGA GTAAGCCTAA TGAGTAATGG AGCTAAGGTT AATGTTTTAG AGGAAGATAA CTCTTGGTTT AAGGTTGACT ATAATGGAAA TACTGGTTGG TGTTCAAGTA AATATGTAAC TAACCCAGTT TCTTCTCATA GTTCAACTAA TAAAAAAGTA GAAAAAAATA AATCTACAGA ACCTAAAAAA ACTATGGAAG AAAATAAACC TAAGAAAGAA GCTGAAACAA GCAAACCTGC TTTGACAATT GTTAAAACTG CTTCTGTAAA AGCTAATGGT GGACTATGGT TACACTCTTC TAAGGATTCC TATATCTCTT CTAGATTAGC AATTATGGAT AAAGGTGAAA AAGTTACAAT CTTAGAGGAA AGTGGAGATT GGTTTAAGGT TAACTATAAT GGTAAAACTG GCTTCTGTGC AAGTAAATAC TTAAGTGAAC CTACTACTTA TGAAAAACAA TCTGAATCTA ATAAAATAGA GGAAGTTAAA AATACCTCTC CTAGTAAGGC TGTAGAAGAA AGTAAACCTA CTGCACCTAA GAAAACTGTG GAAGAAAATA AGCCTAAAAA GGAAACTGAA ACAAGTAAAC CTACTTTAAC AAATATTAAA AGAGCCTCTG TAAAGGCAAA TGGTGGTTTA TGGCTACACT CTACTAAGGA TTCCTCTGCA TCTTCTAGAA TTACCATAAT GAGTAATGGA GAAAAGGTTG ATATTTTAGA TGAAAGTGGT TCTTGGTACA AGATTAACTA TAATGGAACT ATGGGTTGGT GCCCAAGTCA ATTCTTAAGT AACCCAACTG TTATCTCTCA AAGCTCACAA AGCAAGGCTG TAGAGGAAAA TAAACCAGTT TACGAAAACA AAACTGTTGA AGTAAGTAAA CCAGTTAATA GTACAGTAAA AACAGCTTAC ATAAAAGCAA ATGGAGGTTT ATGGTTACAC TCCTCTAAAA ATTCCTATGC TTCTTCTAGA ATAAGCATAA TGAACAAAGG ATCAAAGGTA AGAGTTTTAG AAGAAAGTGG TTCTTGGTTT AAAGTTGACC ACAATGGAAA TATAGGTTGG TGTTCAAGTG AATTCCTAAC TAATCCAGTA ACATCTAAAA GTAATACTGT GGAAGAAAGT AAACCAGTTC ATCTAGTTCA ATCAAATACT AATGAAACTT CATTAAGATC AGCCCATGTA AAAGCTAATG GAGGATTATG GCTACACTCA TCTAAAGACT CCTCTACTTC ATCTAGATTA ACTGTAATGG GTAATGGTCA TAAGGTTGAA ATCTTAGAAG AAAGTGGAGA TTGGGTTAAG GTTAGATATA ATGGAAACAC AGGTTGGTGT GTTAAAAAAT TTATAGCCTA A
|
Protein sequence | MAVAIVQQQQ QQPNDKINID LSNSNVAVVK NKKNTNPPIK DEKESKDKDN LKSDKKTEPK IEEKKQDKGN SKEEDKAQVK TEPKVKEDKD TKKSSEEIKE EVKNQKPVIK EVKLPVEKDS KNSSENLSNN SSDFASDVNR RLYEYESVRS NQSNAMNEAI RLHNGDPSNT CVFFQGTCLR AIGVPVPSHI GYTSNLWNWL ENNNWEMHRD FSNIQKGDII FAGEYHTMCF MGWKDKANGI AYVMGNEAYT YGDAYAHRNL NGQAPTKENG WNRQYRATRY YKYKGNSSKN NSDIVKTNIV TSKPLSSKDH KYTVEALKGT VTAKQDVYIN EKPFPTTEGI KPVGLANGGD KLTVTGRASN GWYEVLYNGQ KAYISHRYTN FLEEKESKSK SSESITPLGV VKTNTSDKKS DSKEDNKKIE NSLKDKDEKV NKGAESTEDK KVDESKDKAT NKVESNKTKQ AKPLESSKPV EETKKEESSK SSETVTKTAF IKANGGLWLH SSKDTSYSSR VSLMSNGAKV NVLEEDNSWF KVDYNGNTGW CSSKYVTNPV SSHSSTNKKV EKNKSTEPKK TMEENKPKKE AETSKPALTI VKTASVKANG GLWLHSSKDS YISSRLAIMD KGEKVTILEE SGDWFKVNYN GKTGFCASKY LSEPTTYEKQ SESNKIEEVK NTSPSKAVEE SKPTAPKKTV EENKPKKETE TSKPTLTNIK RASVKANGGL WLHSTKDSSA SSRITIMSNG EKVDILDESG SWYKINYNGT MGWCPSQFLS NPTVISQSSQ SKAVEENKPV YENKTVEVSK PVNSTVKTAY IKANGGLWLH SSKNSYASSR ISIMNKGSKV RVLEESGSWF KVDHNGNIGW CSSEFLTNPV TSKSNTVEES KPVHLVQSNT NETSLRSAHV KANGGLWLHS SKDSSTSSRL TVMGNGHKVE ILEESGDWVK VRYNGNTGWC VKKFIA
|
| |