Gene CPR_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0402 
Symbol 
ID4205941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp487046 
End bp489064 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content26% 
IMG OID642564959 
Producthypothetical protein 
Protein accessionYP_697731 
Protein GI110803962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.813066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATT ATTCTAATAA AATAGAGGAA TTAAAAGCAA AGATTAATAA ATACATGCAA 
GGATATGATA CTGAGTTTGT AAGGTACTAT ATAAATCATA ATTGCAAGAA AGAGATAGAT
AAAAAGTTAA TAGGTGCTAA TTTAATTCTT AATAATTCTT TTATATTCGA TGATGAATGG
GATATGGAAC AATGTAAAAT TCCATATTTA AATAGAAATT TAGATTGGAA CTTTACTCCT
AATGGAGATG AGGAATGGGT ATTTATGCTT AATAGACATG AATACTTTGA AAAATTAATT
GCATCCTATT ATTTTAGTAA TGATGAAAAA TATTTAGATA AGTTAAAGGA ATTAATATTT
AATTGGATTG AAAATAATGA GATAAAAGAG TGTGGAGGAC CAACAATAAG AACAATAGAT
ACTGGAATAA GATGTTTTAG TTGGATTAAG TCTCTTTTAC ATTTAATTCA TGAAAATAAG
TTAGAAGATG AAGAAATATT AAAAATAATT TTTAGTATAA AAGAACAATT AGAGTATTTA
AAAAAATCTT ATATTGATAA ATATGTTCTT AGTAATTGGG GAGTATTACA GACAACAGCA
ATCATTACTT GCTCCTTATG GTTAAAGGAC TTTATAGAGG ATGAAGAATT ATATAAATGG
GCCCTTGAGG AACTATACAG GGAAATAAAT CTTCAAGTTT TAGAAGATGG TTCACATTGG
GAGCAGTCTG TAATGTATCA TATTGAAGTA CTAAATTATT CTATGGCAAC TATACACTAT
GTTAAGTATT TTAATGTTGA TTTAGATGAG GAGTTCTTAG AAAAAATACA TTCTATGGCT
AAATATTTAG TTTACTGTGG AGATTCAAAT TCAATTCAAG TTGCTCAAGG GGACAGTGAC
AGAAGTGATA TTAGAGATGT TCTTCTTAGG GCATCTATAT TATTTAATGA TCCTCATTTA
AAATTTAGAG CATATGAAAC TATGGATTTA ACCAGTATAT TATTGTTTGG AAGAGATGGA
TTTTTAAAAT ATACAAATAT GGATGCAGAG GAAATTACAA TGCTTAATAA GTCCTTTATA
GACTCTGGTA ATATTTATAT AAGAAGTGGA TGGGACAAAG AGGCAAGTTT TACTTATTTA
CAAAATGGTA CTCTTGGAAG TGGACATGGA CATTCAGATT TATGCCATTT TTCAATTCAT
TATGGTGGAG AACCATTTTT AATAGATTCA GGTAGGTATA CCTATGTTGA AAGTGATTTT
TTAAGGGAAT ATTTAAAATC TGTTAAAGCT CATAATGTAT CTATAATTGA TGATTCTCCT
TTTGCTATTC CTAAGAATTC ATGGAAATAT AATAAATATC CAGATGTTAT GAAAAATTAT
TTTAATGAAA AAGATAATAT AGCTTATGCT GAAATGGCTT ATTTAGCAAC TTTAAGTGAT
GGAACACCAT ATACGGTTAT TAGAAAGGTG TTAGTTATAT CTCCAGATAT ATGGGTAATA
GTTAATGATA TAAGGTGCAG TGGAAAGCAT ATATGCAAAA ATTATTATAA TTTAGATTAT
AAGGTTAAGG TGATTAAAGA AGAGGGTTAT TTTAGATGTG TAAATAAAGA AAGTGAAATT
AAGATTTATA ATAACAATGT AGATAAGAAA TATATAGAAA ATACTTTAAT TTCAACAAAT
TATAATAGTA TAAATAATAG TAAAAAGATA GTAACACAAT GTACTTTTGA AAATAATTTT
GTGAACTATG ATATAATTTT AGGGCAGAAT TTAAAAAGTA TAGAAATAAA GGATCCTAGT
ATTGTACAAT ATAATTCAAA AGAAAAGATT AATACAAGTG TGGCAATAAC TAAAGAATTT
GTAATAAATG AAAATGAGAG TTATACTATT ATAATATTTA ATAAAGAAAC ATTTAAAGGG
GCTAAGGTGT ATATATATGA CTCTTTGGTT TTATATGGTA AGGTTATTGT TGTGCATAGA
TTTAAAGACG AGAGAGAAAT AATTCGTTTG AAAGCATAA
 
Protein sequence
MSNYSNKIEE LKAKINKYMQ GYDTEFVRYY INHNCKKEID KKLIGANLIL NNSFIFDDEW 
DMEQCKIPYL NRNLDWNFTP NGDEEWVFML NRHEYFEKLI ASYYFSNDEK YLDKLKELIF
NWIENNEIKE CGGPTIRTID TGIRCFSWIK SLLHLIHENK LEDEEILKII FSIKEQLEYL
KKSYIDKYVL SNWGVLQTTA IITCSLWLKD FIEDEELYKW ALEELYREIN LQVLEDGSHW
EQSVMYHIEV LNYSMATIHY VKYFNVDLDE EFLEKIHSMA KYLVYCGDSN SIQVAQGDSD
RSDIRDVLLR ASILFNDPHL KFRAYETMDL TSILLFGRDG FLKYTNMDAE EITMLNKSFI
DSGNIYIRSG WDKEASFTYL QNGTLGSGHG HSDLCHFSIH YGGEPFLIDS GRYTYVESDF
LREYLKSVKA HNVSIIDDSP FAIPKNSWKY NKYPDVMKNY FNEKDNIAYA EMAYLATLSD
GTPYTVIRKV LVISPDIWVI VNDIRCSGKH ICKNYYNLDY KVKVIKEEGY FRCVNKESEI
KIYNNNVDKK YIENTLISTN YNSINNSKKI VTQCTFENNF VNYDIILGQN LKSIEIKDPS
IVQYNSKEKI NTSVAITKEF VINENESYTI IIFNKETFKG AKVYIYDSLV LYGKVIVVHR
FKDEREIIRL KA