Gene CPR_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2070 
Symbol 
ID4206076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2292547 
End bp2294301 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content29% 
IMG OID642566620 
Productsubtilase family protein 
Protein accessionYP_699379 
Protein GI110803826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.725606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTTA AGTCTTCAGC TCCAGAATAT GTATTTAATA ATGAAAATTA TTTAAATTAT 
CTAGTTCAGT ATCAAGGTGA CATTATAGGA GAGTTTAATA ATCAAAATGG AATATATGCA
ACTTTAATTA ATGATAGATA TGCCATAATA ACTATTGATA AAAATTTACA GGAATATGAA
AATGATATAT CTATGATTAA ATATAAAAAA GAAAATGGAA GAGATGTAGA AATAGATAAC
ATTACTTATA TAAAAGATCC AGAAGGATAT GTACTTCAGG AAATAAGCCC TTTACAAGCA
GCAAATATTG AATATGTTCA AATACAATCA TACTTTAATC TAACAGGTAA AGGGGTTATT
GTTGGAATTT TAGATACTGG AATAGATTAT TTAAACGAAG AGTTTATGGA TTCATATGGT
AATACAAGAA TTTTAGGAAT TTGGGATCAA ACAATTTCAA GTGAAAGTTC AAGGGAGAAT
AATTTACCTT ATGGAACTTT TTATTCAGAA GAAGATATAA ATAGAGCAAT AAGACTTAGT
AGAGAAGGAG GAGATCCATA TACAATAGTT CCATCAAGAG ATGAAATAGG CCATGGAACA
TCAATGGCAG GAATAATTGG ATCATCAGGA AAAAATCCTA GGTTAAAGGG GGTTGCACCA
GATTGTAAGT TCTTAGTTGT AAAGCTTGCA CAGTCACTTT ATTATAAAAA AGAGTATGAA
ATAAAAATAC CTGTTTATAA TATAACTGAA ATATTTACAG GAATACAATA TTTATATTCA
TACTTTTTAA AAAGATCTCA AAGTATGATT ATATATTTAC CTTTAGGCAC TAATAGAGGA
AGTCATAAAG GAACAAGCAT CTTAGAAGAA TTTTTAGATT CTATATTAAT AAATAGAGGA
ATTGCTTTAA TTACAGGTGC TGGAAATGAA GGAGCAGCAT TATTACATGG ATCAGGAACT
ATAAAACCTA ATGGACAAGT AACTACTCAT GAATTTAATA TAGATGAAAA TCAAAAAAAG
ATTATCATTG AAGCTTGGAT ACAAATACCT AGTATTGCTT CCGTAGAAAT AGTTTCACCT
ACTGGAGGAA CTACAGGAAT AATTCAACCT TTCTTTGGAA AAGGAAATAA GTATTATTTT
ACAATTGAAC GAACTACAGT GTTAGTAAGT TATTATATAC CTGAAGAAAT TTATGAAGAT
TCATTGATAT TAATTATACT AGATAATGTT CAAGCTGGAA TATGGAGTTT TAAGTTTAGA
GGATTAAATA ATATAGAAGG AAGATATAAT ATTTGGCTTC CGCCAAAAGG GATAAGTAAA
GAAAAAACTA GAATGATATA TCCTGACCCA TATGGAACAG TAACTGTCCC AGGTACTAGC
ATCTCTGTAA TAACAGTTGC TGCATATAAT CAATTAAATA ATACACAATT AATTTATTCA
GGGAGAGGAT TTAAAGATAA TTATATTGAT ATTATAGATG TAGCAGCAGG AGGAGTAAAT
GCACTTACAG TTGCGCCAGA TAATAAGACA ACTTTAGCAA ATGGTACAAG TGTTGCAGCG
GCTATAGTAG CTGGAATTTG TGTTTTACTT TTCCAATGGG GAATTGTTGA AGGAAATTAT
CCTTATATGT TCTCTCAAAC TTTAAAAGCT TTTATTACAA GAGGTACAAG AAAGAGAAAA
GGGGACACTT ATCCAAATCC AGAGTGGGGA TATGGAATAG TAGATATGTT TAATATGTTT
AATCTTACTA ATTAA
 
Protein sequence
MEFKSSAPEY VFNNENYLNY LVQYQGDIIG EFNNQNGIYA TLINDRYAII TIDKNLQEYE 
NDISMIKYKK ENGRDVEIDN ITYIKDPEGY VLQEISPLQA ANIEYVQIQS YFNLTGKGVI
VGILDTGIDY LNEEFMDSYG NTRILGIWDQ TISSESSREN NLPYGTFYSE EDINRAIRLS
REGGDPYTIV PSRDEIGHGT SMAGIIGSSG KNPRLKGVAP DCKFLVVKLA QSLYYKKEYE
IKIPVYNITE IFTGIQYLYS YFLKRSQSMI IYLPLGTNRG SHKGTSILEE FLDSILINRG
IALITGAGNE GAALLHGSGT IKPNGQVTTH EFNIDENQKK IIIEAWIQIP SIASVEIVSP
TGGTTGIIQP FFGKGNKYYF TIERTTVLVS YYIPEEIYED SLILIILDNV QAGIWSFKFR
GLNNIEGRYN IWLPPKGISK EKTRMIYPDP YGTVTVPGTS ISVITVAAYN QLNNTQLIYS
GRGFKDNYID IIDVAAGGVN ALTVAPDNKT TLANGTSVAA AIVAGICVLL FQWGIVEGNY
PYMFSQTLKA FITRGTRKRK GDTYPNPEWG YGIVDMFNMF NLTN