Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0959 |
Symbol | |
ID | 4206192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1090456 |
End bp | 1092378 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642565516 |
Product | glycosyl hydrolase family 20 protein |
Protein accession | YP_698282 |
Protein GI | 110803708 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00497875 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTTA AATTAGTTGG ATTAAATGAG GAAATGTTAG AAGCTGTCAA GGGATTAGAA GAACTAATTG ATTTTCAGCT TTATAATGAG AAGGGGAGTA AGAATGAAAA TTTAGAGATT ATAAATGTTT CTAAGCTTCC TGAAGAATCA GAAAATGTAA TAGAAGTCCT TAGAGGAAAA AATAATGAAA TAAGATATAA GAAAAAACAC CACTTTTTTA GAGCACTTAG TTTATATCTT CAATTTCTTA AAAAGGGAGA GGAGAGTTTT TCAAGAAGTG AAAAAACTTA TATTGATTCA GTAGGGGTAA TGATAGATGC CTCAAGAAAT GCTATTTATA GAGTTTCTGA GGTTAAAAGG ATATTAGGGT ATATGGCCTT AATGGGACAT AATAGGTGTA TGCTTTATAC TGAAGATACT TATGAGATTG AGGGTTATCC ATACTTTGGT TATATGAGAG GAAGATATAG CAAAGAGGAG TTAAGGGAAA TTGATGATTA TGGATACTCT TTAGGTATAG AAGTTGTACC TTGCATTCAA ACTTTAGCTC ACTTAAAACA AACTTTAAGA TGGCCTTATG GGGAAGGGAT GAAAGACACT CAAGATGTAC TTTTAGTAGG AGAGGAAAAG ACTTATAGGT TTATAGAGGC AATGATTTCT TCATTAAGAG AATGTTTTAG AAGTAAAAAT ATTCATATAG GTATGGATGA GGCTTTTGAT TTAGGAAGAG GCCATTATTT AACTAAACAT GGACATGTTT CTCACCAAAA GCTTATGGTA GAGCATTTAA ATAAGGTTAA TGAAATAGCA AAAAAATATG ATTTTAAGCC AATGATTTGG GATGATATGT TTTTAAGATG TGGAGCTCCA GATGGTGGAT ATTACGATTT AGATATAGTT ATAACTCCTG AAATAGCAAA TAATATTCCA GAGGAAGTTT CCTTAGTTTA TTGGGATTAT TATAATTCTG ATGAAGAAAA ATATAAAAAG CTTTTAGATA TTAGAGATGA TTTTAATAAC AATATTATCT TTGCAGGAGG ATGCTGGAGA TGGAGTGGAT TTGCTCCAAA CTATTCAAAG ACCTTTGAAA CAACAAATGC TGCCTTAAAT CAATGTAAGG CTAAAGGAAT AAAAGAAGTC TTTGCAACGG CTTGGGGAGA TGATGGATCA GAAACCCCAA TATATTCAAT AATAGTAGGT CTTATATTAT TTGGAGAGCA TGGCTATTAT AACAAGGTAG AGAAAGAATG GATAGATGAA AGATGCAAAT CTTTAACAGG GCTTAGCATG GAGGATTTTA CTTCTTTGGA AGAGTTAGAT CTTGTTCCAA CAGTTAAAAC TCCTAATATG GAGGTTTGTA ATCCTTCAAA ATATATAGCC TACCAAGATT TGCTTTTAGG AGCCTTTGAT AAGCATTTAG AGGGTTTAGA TTTAGAAGAA CATTATATTA ATCTTTCTAA AAAATATGAG GAAATAGGAG AGAGAAGTGA GAGGTTTAAA TTAATGTTTA CTATGTATTC TAAACTTGCT GCTTACCTTT CAGTTAAAAG TGAGATAGGA CTAGAAATAA GAAAAGCTTA TTTAGAAAAG GATAAGGATG CCTTAAGACT TATAGCATAT AACTTTATTC CAGAAATACA AGAAAAGCTA AAGAGTTTTC ATAAGAGTTT TAGGGATCTA TGGTATAAGG AATGTAAAGG ACAAGGCTTT GAAGTTATAG ATATTAGACT TGGTGGAGTT ATGGCAAGAT GTGATTCAGC TATTTATAGA ATAAAAGCTT ATCTTAAAGG AAATATTGAT AAAATAGAAG AGCTAGAAGA GGAAAGACTT TATTTTTCTG AACATTTTGG AGGCGATGAT TGTAAGCTTA TATGTTGCAA TGAATATGAA AAAATAGCAA CACAAAATAT TTTAAGTTGG TAA
|
Protein sequence | MRVKLVGLNE EMLEAVKGLE ELIDFQLYNE KGSKNENLEI INVSKLPEES ENVIEVLRGK NNEIRYKKKH HFFRALSLYL QFLKKGEESF SRSEKTYIDS VGVMIDASRN AIYRVSEVKR ILGYMALMGH NRCMLYTEDT YEIEGYPYFG YMRGRYSKEE LREIDDYGYS LGIEVVPCIQ TLAHLKQTLR WPYGEGMKDT QDVLLVGEEK TYRFIEAMIS SLRECFRSKN IHIGMDEAFD LGRGHYLTKH GHVSHQKLMV EHLNKVNEIA KKYDFKPMIW DDMFLRCGAP DGGYYDLDIV ITPEIANNIP EEVSLVYWDY YNSDEEKYKK LLDIRDDFNN NIIFAGGCWR WSGFAPNYSK TFETTNAALN QCKAKGIKEV FATAWGDDGS ETPIYSIIVG LILFGEHGYY NKVEKEWIDE RCKSLTGLSM EDFTSLEELD LVPTVKTPNM EVCNPSKYIA YQDLLLGAFD KHLEGLDLEE HYINLSKKYE EIGERSERFK LMFTMYSKLA AYLSVKSEIG LEIRKAYLEK DKDALRLIAY NFIPEIQEKL KSFHKSFRDL WYKECKGQGF EVIDIRLGGV MARCDSAIYR IKAYLKGNID KIEELEEERL YFSEHFGGDD CKLICCNEYE KIATQNILSW
|
| |