Gene CPR_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2200 
Symbol 
ID4205206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2429001 
End bp2430329 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content30% 
IMG OID642566750 
Productserine protease 
Protein accessionYP_699500 
Protein GI110803298 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.550757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATT TCAATAAAAA AGATGAAGGT ATAGATAACT ACTTTGGTAT GGAAGACAAA 
GAAAACATAG AATCAAACAA TTATACTGAG CAAACTAATA TAGATGAAAC TAATAAATTT
AATATAGATA ATGAAATTAA TTCAAAAGAT GAAGTTGAAA AAGAGGATGA TAAAAACTTC
TCTGACATAA AATCAAAAAA TTCTAATGAT AATATTAAAA GTAAAAAAGT AAAGAAAAAG
AGTGGGTTTA AAAGGGTAAT AGCTCTAGTA GCTGGTGCTG TCATAGTTGC TATACTAGGA
GGATCTATAG GAGCTAGTGG AGTTTATTAT GCTTTTAAAA ATAGCATACC AGTAAGTACA
CTAGAGAATA ATAGTAATAC CCAAGTTAAT CCACCAGCCT TTAAAGTGGA AGATGGAGCA
TTAACTGTTC CTCAAGTAGT TGAAAAAGTT ACACCTGCTG TTGTAGGAGT ATCCACAAAG
AGCTTAGTAA GAGATCAATT CTTTAATGTA AAAGAACAAG AAGGATTAGG ATCTGGATTT
ATAATAAATG AAGATGGATA TGTAGTTACA AACTACCATG TTATAAATGG AGCTCAAGAA
GTTAAAGTAA TATTCTCTGA TGGAAAAGAA GTAAATGCTA AGGTTGTAAA TTATGATGCT
GAAAGAGATA TTGCAGTAAT AAAAATAACA GACAATGTTA AAATGCCTGG AATAGCACAA
TTAGGAGATT CATCTATAGT TAAAGCTGGT GAAGAAGTAA TTGCTATAGG AAATCCCCTA
GGAAAAGAAT TTAGCTCAAC AGTAACTAAG GGTATAATAA GTTCTCCAAA TAGAAAAATG
AAGACTGAAA ATGGAAATGT ATTAGATTAT ATACAAACAG ATGCAGCTAT CAACCCAGGT
AATAGTGGGG GTCCATTAAT AAACTCTAAG GGAGAAGTTA TTGGAATAAA TACGGCTAAA
AAAGTTGGTG AAGATATTGA AGGTATCGGA TTTGCAATTC CTATAAATGA AGTAAAAACT
AGATTAGGTT CTTTATCAAA ACCAATATTA AAACTTGGTA TTACGGCTAG AACTGTCACT
CCAGAATTAG CAAAAGAAAA TAATATAGAA GAAGGAATTT ATGTTGTAGG TGTACAAGAA
TTTAGTCCAG CAGAAAAATC AGGATTAAAA ATAGGTGATT TAATAATTGC TTTTGGTGGA
AAAAGAGTAA AAACTTTAGA AGAATTAAAT CAGATTAAAA GTCAATATAA TGATGGAGAT
TCAGTACCGA TTGAAATAAT TCGAGATGGT AAAAAAGTAA ACTTAAATTT AACATTAGTT
GCTAATTAA
 
Protein sequence
MSDFNKKDEG IDNYFGMEDK ENIESNNYTE QTNIDETNKF NIDNEINSKD EVEKEDDKNF 
SDIKSKNSND NIKSKKVKKK SGFKRVIALV AGAVIVAILG GSIGASGVYY AFKNSIPVST
LENNSNTQVN PPAFKVEDGA LTVPQVVEKV TPAVVGVSTK SLVRDQFFNV KEQEGLGSGF
IINEDGYVVT NYHVINGAQE VKVIFSDGKE VNAKVVNYDA ERDIAVIKIT DNVKMPGIAQ
LGDSSIVKAG EEVIAIGNPL GKEFSSTVTK GIISSPNRKM KTENGNVLDY IQTDAAINPG
NSGGPLINSK GEVIGINTAK KVGEDIEGIG FAIPINEVKT RLGSLSKPIL KLGITARTVT
PELAKENNIE EGIYVVGVQE FSPAEKSGLK IGDLIIAFGG KRVKTLEELN QIKSQYNDGD
SVPIEIIRDG KKVNLNLTLV AN