Gene CPR_1266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1266 
SymbolentA 
ID4206529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1423866 
End bp1426736 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content32% 
IMG OID642565822 
Productputative enterotoxin 
Protein accessionYP_698588 
Protein GI110801837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAG CAATAGTTCA ACAACAGCAG CAACAACCAA ATGATAAGAT TAATATAGAC 
TTGTCTAATT CTAATGTTGC TGTGGTAAAA AATAAAAAAA ATACTAATCC CCCTATAAAA
GATGAAAAAG AATCAAAGGA TAAAGATAAT TTAAAATCTG ATAAAAAAAC CGAGCCTAAA
ATTGAAGAAA AAAAACAAGA TAAAGGTAAC TCAAAGGAAG AAGATAAAGC TCAGGTAAAA
ACTGAACCTA AAGTTAAAGA GGATAAAGAC ACAAAGAAAA GTTCTGAAGA AATTAAAGAA
GAGGTAAAAA ATCAAAAGCC TGTTATAAAA GAAGTAAAAT TACCTGTTGA AAAAGATTCA
AAGAATTCTT CTGAAAATTT ATCTAATAAC TCAAGTGACT TTGCTTCAGA TGTTAATAGA
AGATTATATG AATATGAAAG TGTTAGAAGT AATCAATCTA ATGCAATGAA TGAAGCTATT
CGTCTTCATA ATGGAGACCC TAGCAACACT TGTGTTTTCT TCCAAGGTAC TTGTTTAAGA
GCAATAGGAG TACCTGTTCC AAGTCATATT GGATACACTT CTAACCTTTG GAATTGGCTT
GAAAATAATA ACTGGGAAAT GCATAGAGAT TTTTCAAACA TTCAAAAGGG AGATATTATC
TTTGCAGGTG AATATCACAC AATGTGTTTC ATGGGATGGA AGGATAAAGC AAATGGTATT
GCCTATGTAA TGGGAAATGA AGCCTATACC TATGGAGATG CTTATGCTCA TCGTAATTTA
AATGGACAAG CTCCTACTAA AGAAAATGGA TGGAATAGAC AATATAGAGC TACAAGATAC
TACAAATATA AAGGAAATTC TTCAAAAAAC AATTCTGACA TTGTAAAAAC AAACATTGTA
ACATCAAAAC CTCTTAGCTC AAAGGATCAT AAATATACAG TTGAAGCCTT AAAGGGTACT
GTAACAGCTA AGCAAGATGT TTATATTAAT GAAAAACCCT TCCCTACTAC TGAAGGAATT
AAGCCTGTAG GACTAGCTAA TGGAGGAGAT AAATTAACTG TAACTGGTAG AGCCTCAAAT
GGATGGTATG AAGTTTTATA TAATGGCCAA AAAGCTTACA TAAGTCATAG ATATACTAAC
TTCTTAGAAG AAAAGGAATC TAAATCTAAG TCCTCTGAAT CTATTACTCC TCTTGGCGTA
GTTAAAACTA ATACTTCAGA TAAAAAATCT GATTCTAAAG AAGATAATAA AAAGATAGAA
AATTCTCTTA AAGACAAAGA TGAAAAGGTG AACAAGGGAG CTGAATCTAC TGAAGATAAA
AAGGTAGATG AATCTAAGGA TAAAGCTACT AATAAAGTAG AATCAAATAA AACTAAACAA
GCTAAACCTT TAGAATCTAG TAAGCCAGTT GAAGAAACTA AAAAAGAAGA ATCAAGTAAG
TCTTCTGAAA CAGTTACAAA AACTGCCTTT ATAAAGGCTA ATGGTGGTTT ATGGCTACAT
TCTAGTAAAG ATACTTCATA CTCTTCTAGA GTAAGCCTAA TGAGTAATGG AGCTAAGGTT
AATGTTTTAG AGGAAGATAA CTCTTGGTTT AAGGTTGACT ATAATGGAAA TACTGGTTGG
TGTTCAAGTA AATATGTAAC TAACCCAGTT TCTTCTCATA GTTCAACTAA TAAAAAAGTA
GAAAAAAATA AATCTACAGA ACCTAAAAAA ACTATGGAAG AAAATAAACC TAAGAAAGAA
GCTGAAACAA GCAAACCTGC TTTGACAATT GTTAAAACTG CTTCTGTAAA AGCTAATGGT
GGACTATGGT TACACTCTTC TAAGGATTCC TATATCTCTT CTAGATTAGC AATTATGGAT
AAAGGTGAAA AAGTTACAAT CTTAGAGGAA AGTGGAGATT GGTTTAAGGT TAACTATAAT
GGTAAAACTG GCTTCTGTGC AAGTAAATAC TTAAGTGAAC CTACTACTTA TGAAAAACAA
TCTGAATCTA ATAAAATAGA GGAAGTTAAA AATACCTCTC CTAGTAAGGC TGTAGAAGAA
AGTAAACCTA CTGCACCTAA GAAAACTGTG GAAGAAAATA AGCCTAAAAA GGAAACTGAA
ACAAGTAAAC CTACTTTAAC AAATATTAAA AGAGCCTCTG TAAAGGCAAA TGGTGGTTTA
TGGCTACACT CTACTAAGGA TTCCTCTGCA TCTTCTAGAA TTACCATAAT GAGTAATGGA
GAAAAGGTTG ATATTTTAGA TGAAAGTGGT TCTTGGTACA AGATTAACTA TAATGGAACT
ATGGGTTGGT GCCCAAGTCA ATTCTTAAGT AACCCAACTG TTATCTCTCA AAGCTCACAA
AGCAAGGCTG TAGAGGAAAA TAAACCAGTT TACGAAAACA AAACTGTTGA AGTAAGTAAA
CCAGTTAATA GTACAGTAAA AACAGCTTAC ATAAAAGCAA ATGGAGGTTT ATGGTTACAC
TCCTCTAAAA ATTCCTATGC TTCTTCTAGA ATAAGCATAA TGAACAAAGG ATCAAAGGTA
AGAGTTTTAG AAGAAAGTGG TTCTTGGTTT AAAGTTGACC ACAATGGAAA TATAGGTTGG
TGTTCAAGTG AATTCCTAAC TAATCCAGTA ACATCTAAAA GTAATACTGT GGAAGAAAGT
AAACCAGTTC ATCTAGTTCA ATCAAATACT AATGAAACTT CATTAAGATC AGCCCATGTA
AAAGCTAATG GAGGATTATG GCTACACTCA TCTAAAGACT CCTCTACTTC ATCTAGATTA
ACTGTAATGG GTAATGGTCA TAAGGTTGAA ATCTTAGAAG AAAGTGGAGA TTGGGTTAAG
GTTAGATATA ATGGAAACAC AGGTTGGTGT GTTAAAAAAT TTATAGCCTA A
 
Protein sequence
MAVAIVQQQQ QQPNDKINID LSNSNVAVVK NKKNTNPPIK DEKESKDKDN LKSDKKTEPK 
IEEKKQDKGN SKEEDKAQVK TEPKVKEDKD TKKSSEEIKE EVKNQKPVIK EVKLPVEKDS
KNSSENLSNN SSDFASDVNR RLYEYESVRS NQSNAMNEAI RLHNGDPSNT CVFFQGTCLR
AIGVPVPSHI GYTSNLWNWL ENNNWEMHRD FSNIQKGDII FAGEYHTMCF MGWKDKANGI
AYVMGNEAYT YGDAYAHRNL NGQAPTKENG WNRQYRATRY YKYKGNSSKN NSDIVKTNIV
TSKPLSSKDH KYTVEALKGT VTAKQDVYIN EKPFPTTEGI KPVGLANGGD KLTVTGRASN
GWYEVLYNGQ KAYISHRYTN FLEEKESKSK SSESITPLGV VKTNTSDKKS DSKEDNKKIE
NSLKDKDEKV NKGAESTEDK KVDESKDKAT NKVESNKTKQ AKPLESSKPV EETKKEESSK
SSETVTKTAF IKANGGLWLH SSKDTSYSSR VSLMSNGAKV NVLEEDNSWF KVDYNGNTGW
CSSKYVTNPV SSHSSTNKKV EKNKSTEPKK TMEENKPKKE AETSKPALTI VKTASVKANG
GLWLHSSKDS YISSRLAIMD KGEKVTILEE SGDWFKVNYN GKTGFCASKY LSEPTTYEKQ
SESNKIEEVK NTSPSKAVEE SKPTAPKKTV EENKPKKETE TSKPTLTNIK RASVKANGGL
WLHSTKDSSA SSRITIMSNG EKVDILDESG SWYKINYNGT MGWCPSQFLS NPTVISQSSQ
SKAVEENKPV YENKTVEVSK PVNSTVKTAY IKANGGLWLH SSKNSYASSR ISIMNKGSKV
RVLEESGSWF KVDHNGNIGW CSSEFLTNPV TSKSNTVEES KPVHLVQSNT NETSLRSAHV
KANGGLWLHS SKDSSTSSRL TVMGNGHKVE ILEESGDWVK VRYNGNTGWC VKKFIA