Gene CPR_0592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0592 
Symbol 
ID4206241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp706497 
End bp709172 
Gene Length2676 bp 
Protein Length891 aa 
Translation table11 
GC content28% 
IMG OID642565152 
Productcell wall binding repeat-containing protein/mannosyl-glycoprotein endo-beta-N-acetylglucosamidase 
Protein accessionYP_697919 
Protein GI110803679 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0288009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAA AAACAGCTAC TATGTTAGCT ATATTTATAA CACTTTCAAC AAGTAATGTA 
TTTGCTATAG ATATTAAAAT TAATAATAAT TCTAAAAATA AACAAGAGAA AAGTGTTGAG
AGTTACAATC AAAAAGCTTC AGAAAATAAT AAGGATAAAA TTAATGAAAA AAATGTTAAT
GTTGATGAGG CGATAAAAGA AAAGGATACA AATATAGATA ATGACATAAT GAATGGTAAA
GAGTCTGAGG AAAAAAAAGA TAATAATAAT CAAGACATAT ATGTAGAACC AAAAAATAAT
TTAGATACAG AAAATACAAA CAATAATGAT AATTCGAAAG ATTGGAAAAA AGAAGTTGAA
AAAAATAGAG ATGAAAATAA GATTAAAGCA GCTGATGAAG ATAATAAGGT TGAAAATAAG
GATGATTTGG TTGGGAGTAT TTATGAAAAG AAAAGTATTA ATCTAAAGCC TAAATTTCAT
CAATTTTTTA GTATAAATAA TAGTGAGAAC GAGCTAAAAT ATGAGCAAAT AAGAGATTCA
AATGGACTTG AATCATTAGT ACGTTTACAT CCTAAAACAG ATAAGAAGTA TGAGATTGCT
TTAGCTAAAG ATGATGGAAC CTTTGTTTAT ATAGGAAGTT ATGATGATTA TAATATTGCA
AAAAAATCAA CAGATCATAG TAATTATAAT GTTCAGAGTT TTTCTGAAAC TGGTGGCATT
CCTATTATTT TAAATTCAGA AGGAAAAGTA ATATATGCTG AAAAAGCATT AGGCAGGATA
GTTGGGATTA AAGGACAAAG TACTATTAAT ATTTGTTCAG ATTCAGGATT AAAAAATGCG
TTTACTTATA TAGCTGCTTC ATATGTTGAT GATGTTCCAA TAATAAGATA TGATGGAAAT
GCTGTAGAGA TTGTTATTAA TGGATATAGG GGGTGGATAA GCAGAGAGGC TATAGATATT
GTTCCTTTAA ATCAAGTAAA AAATCCAAGC AATTATGTTA ATAGATATGG TGAGTTAAGT
CATTTTATTA GTGATAATTT AATGAGTAGT GAAGAATATT ATCGTTATCC TTTAGGTAAA
TCACCAAGTT ATTTAAAAGA ATGGAAAAAA TATTATAATT ATGATGGTAA ATATTTTTAC
ACAAGTTTAG AATTATTAAT AGATGATTTA CAAAATAATA CTCATAAAAA TGCTGTGAAT
GTTAATGAGC CTTATTACAA CTATTACTAT TATTTACCCA TGAGAAGCAA GACAAGCTTT
ACTGTTGATC AAATAAATAC GTATATTAGT AATAACTCAG CTTACAATAG TAAACTTAGA
GATACAGGCG AATATTTTAT TGAGGCTCAA AACAAATATG GAGTAAATGC GTTAATTATG
TTAGGAATAG CTATAAATGA GTCAGCATGG GGAACTTCAT ATTATGCAAG AAATAAAAAT
AATTTATTTG GAATAGGAGC TGTTGATTCA AATCCAGATG ATGCATTTTA TTTTGAAAGT
GTAGAGCAAT GTATAAATGA ATTTGCAAAA TATCAAATGT CTAGAGGATA TTCAGATCCA
TATAATTGGT CTTATAATGG AGCGGCTCTT GGAAATAAAG GATTTGGAGC GAATATTCAA
TATGCATCAG ATCCATTTTG GGGAGAAAAA GCCTCAAGTA ATTTCTATAA ATTAGATTAT
CATGTTTCTG GAAATGGATT AGCAGGATTA ACTGAGTATG ATAAATATCA ACTAGGATTA
TATATAGGCA ATAGTAAAGT AACAGATAAA TTTGGAAATT TATTATATAA TATAGGAAGT
ATGGAGAGTA GTAAGGTCGG GAAAATTGGA ACAGCTGTAA TAATAAACGA TTTAAATAAA
AAGGATATAA ATAATAGTTT AAACTATACT ATACAACCAG ATAGAACTAT TCCTATAAGC
AATGGAAAAA TAAGTGGGCT TTATGATTGG AAAAATGGAT ATACTCCAAT TGATGGTGTG
AAGCTAATAA ATCAAGGGAA AAATGTGACT AAGCCATCTG GTTGGATTAA TAAAGATGGT
AAATTTTATT ATCAGCACTC AGATGGAACT TTAGCAAAAG GTTGGTTAGA TTTAAGTGGA
TATTGGTATT ATTTAGATAA TACCACTGGA GAAATGAAAA CTGGACTTCA AGAAATAAAT
GGATATAAAT ATTATCTTGA TGAAAGTGGT TATATGAGAA CTGGTTGGGT TAACTATAAA
GATGAATATA GATTTTTTGG TGAAGATGGT ACTATGAAAA CAGGATGGAT AAATGATGGT
TGGACAGATT ATTATTTAAA ACCGGATGGA ACAATCTATA AAGGATGGTT AGATGATGGC
TTAAATAAAT ATTATATGGA TGAAAATGGT CAGATGAGAA AAGGTTGGGT CAAGTATAAT
GGAGAATATT ATTTCTTTGG ACCAGATGGG GCAAGGAGAA CTGGATGGAT AAATGATGGA
TATGCTTATT ATTTCTTAAA AAAAGATGGA ACTATTCATA CTGGTTGGCT AAAGGAAAAT
GGTCAAAAAT ACTATTTAGG TTCTGAAGGC GATATGAAAT TAGGTTGGTA TAATATTGAT
AATAATTGGT ATTATTTTGA TAATTCAGGA GCTATGATGA CAAATACTAT AATAGATGGT
TGGGAAATAG ATGGTAATGG AATTGCAAGA AAATAA
 
Protein sequence
MIRKTATMLA IFITLSTSNV FAIDIKINNN SKNKQEKSVE SYNQKASENN KDKINEKNVN 
VDEAIKEKDT NIDNDIMNGK ESEEKKDNNN QDIYVEPKNN LDTENTNNND NSKDWKKEVE
KNRDENKIKA ADEDNKVENK DDLVGSIYEK KSINLKPKFH QFFSINNSEN ELKYEQIRDS
NGLESLVRLH PKTDKKYEIA LAKDDGTFVY IGSYDDYNIA KKSTDHSNYN VQSFSETGGI
PIILNSEGKV IYAEKALGRI VGIKGQSTIN ICSDSGLKNA FTYIAASYVD DVPIIRYDGN
AVEIVINGYR GWISREAIDI VPLNQVKNPS NYVNRYGELS HFISDNLMSS EEYYRYPLGK
SPSYLKEWKK YYNYDGKYFY TSLELLIDDL QNNTHKNAVN VNEPYYNYYY YLPMRSKTSF
TVDQINTYIS NNSAYNSKLR DTGEYFIEAQ NKYGVNALIM LGIAINESAW GTSYYARNKN
NLFGIGAVDS NPDDAFYFES VEQCINEFAK YQMSRGYSDP YNWSYNGAAL GNKGFGANIQ
YASDPFWGEK ASSNFYKLDY HVSGNGLAGL TEYDKYQLGL YIGNSKVTDK FGNLLYNIGS
MESSKVGKIG TAVIINDLNK KDINNSLNYT IQPDRTIPIS NGKISGLYDW KNGYTPIDGV
KLINQGKNVT KPSGWINKDG KFYYQHSDGT LAKGWLDLSG YWYYLDNTTG EMKTGLQEIN
GYKYYLDESG YMRTGWVNYK DEYRFFGEDG TMKTGWINDG WTDYYLKPDG TIYKGWLDDG
LNKYYMDENG QMRKGWVKYN GEYYFFGPDG ARRTGWINDG YAYYFLKKDG TIHTGWLKEN
GQKYYLGSEG DMKLGWYNID NNWYYFDNSG AMMTNTIIDG WEIDGNGIAR K