Gene CPR_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0959 
Symbol 
ID4206192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1090456 
End bp1092378 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content31% 
IMG OID642565516 
Productglycosyl hydrolase family 20 protein 
Protein accessionYP_698282 
Protein GI110803708 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00497875 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTA AATTAGTTGG ATTAAATGAG GAAATGTTAG AAGCTGTCAA GGGATTAGAA 
GAACTAATTG ATTTTCAGCT TTATAATGAG AAGGGGAGTA AGAATGAAAA TTTAGAGATT
ATAAATGTTT CTAAGCTTCC TGAAGAATCA GAAAATGTAA TAGAAGTCCT TAGAGGAAAA
AATAATGAAA TAAGATATAA GAAAAAACAC CACTTTTTTA GAGCACTTAG TTTATATCTT
CAATTTCTTA AAAAGGGAGA GGAGAGTTTT TCAAGAAGTG AAAAAACTTA TATTGATTCA
GTAGGGGTAA TGATAGATGC CTCAAGAAAT GCTATTTATA GAGTTTCTGA GGTTAAAAGG
ATATTAGGGT ATATGGCCTT AATGGGACAT AATAGGTGTA TGCTTTATAC TGAAGATACT
TATGAGATTG AGGGTTATCC ATACTTTGGT TATATGAGAG GAAGATATAG CAAAGAGGAG
TTAAGGGAAA TTGATGATTA TGGATACTCT TTAGGTATAG AAGTTGTACC TTGCATTCAA
ACTTTAGCTC ACTTAAAACA AACTTTAAGA TGGCCTTATG GGGAAGGGAT GAAAGACACT
CAAGATGTAC TTTTAGTAGG AGAGGAAAAG ACTTATAGGT TTATAGAGGC AATGATTTCT
TCATTAAGAG AATGTTTTAG AAGTAAAAAT ATTCATATAG GTATGGATGA GGCTTTTGAT
TTAGGAAGAG GCCATTATTT AACTAAACAT GGACATGTTT CTCACCAAAA GCTTATGGTA
GAGCATTTAA ATAAGGTTAA TGAAATAGCA AAAAAATATG ATTTTAAGCC AATGATTTGG
GATGATATGT TTTTAAGATG TGGAGCTCCA GATGGTGGAT ATTACGATTT AGATATAGTT
ATAACTCCTG AAATAGCAAA TAATATTCCA GAGGAAGTTT CCTTAGTTTA TTGGGATTAT
TATAATTCTG ATGAAGAAAA ATATAAAAAG CTTTTAGATA TTAGAGATGA TTTTAATAAC
AATATTATCT TTGCAGGAGG ATGCTGGAGA TGGAGTGGAT TTGCTCCAAA CTATTCAAAG
ACCTTTGAAA CAACAAATGC TGCCTTAAAT CAATGTAAGG CTAAAGGAAT AAAAGAAGTC
TTTGCAACGG CTTGGGGAGA TGATGGATCA GAAACCCCAA TATATTCAAT AATAGTAGGT
CTTATATTAT TTGGAGAGCA TGGCTATTAT AACAAGGTAG AGAAAGAATG GATAGATGAA
AGATGCAAAT CTTTAACAGG GCTTAGCATG GAGGATTTTA CTTCTTTGGA AGAGTTAGAT
CTTGTTCCAA CAGTTAAAAC TCCTAATATG GAGGTTTGTA ATCCTTCAAA ATATATAGCC
TACCAAGATT TGCTTTTAGG AGCCTTTGAT AAGCATTTAG AGGGTTTAGA TTTAGAAGAA
CATTATATTA ATCTTTCTAA AAAATATGAG GAAATAGGAG AGAGAAGTGA GAGGTTTAAA
TTAATGTTTA CTATGTATTC TAAACTTGCT GCTTACCTTT CAGTTAAAAG TGAGATAGGA
CTAGAAATAA GAAAAGCTTA TTTAGAAAAG GATAAGGATG CCTTAAGACT TATAGCATAT
AACTTTATTC CAGAAATACA AGAAAAGCTA AAGAGTTTTC ATAAGAGTTT TAGGGATCTA
TGGTATAAGG AATGTAAAGG ACAAGGCTTT GAAGTTATAG ATATTAGACT TGGTGGAGTT
ATGGCAAGAT GTGATTCAGC TATTTATAGA ATAAAAGCTT ATCTTAAAGG AAATATTGAT
AAAATAGAAG AGCTAGAAGA GGAAAGACTT TATTTTTCTG AACATTTTGG AGGCGATGAT
TGTAAGCTTA TATGTTGCAA TGAATATGAA AAAATAGCAA CACAAAATAT TTTAAGTTGG
TAA
 
Protein sequence
MRVKLVGLNE EMLEAVKGLE ELIDFQLYNE KGSKNENLEI INVSKLPEES ENVIEVLRGK 
NNEIRYKKKH HFFRALSLYL QFLKKGEESF SRSEKTYIDS VGVMIDASRN AIYRVSEVKR
ILGYMALMGH NRCMLYTEDT YEIEGYPYFG YMRGRYSKEE LREIDDYGYS LGIEVVPCIQ
TLAHLKQTLR WPYGEGMKDT QDVLLVGEEK TYRFIEAMIS SLRECFRSKN IHIGMDEAFD
LGRGHYLTKH GHVSHQKLMV EHLNKVNEIA KKYDFKPMIW DDMFLRCGAP DGGYYDLDIV
ITPEIANNIP EEVSLVYWDY YNSDEEKYKK LLDIRDDFNN NIIFAGGCWR WSGFAPNYSK
TFETTNAALN QCKAKGIKEV FATAWGDDGS ETPIYSIIVG LILFGEHGYY NKVEKEWIDE
RCKSLTGLSM EDFTSLEELD LVPTVKTPNM EVCNPSKYIA YQDLLLGAFD KHLEGLDLEE
HYINLSKKYE EIGERSERFK LMFTMYSKLA AYLSVKSEIG LEIRKAYLEK DKDALRLIAY
NFIPEIQEKL KSFHKSFRDL WYKECKGQGF EVIDIRLGGV MARCDSAIYR IKAYLKGNID
KIEELEEERL YFSEHFGGDD CKLICCNEYE KIATQNILSW