Gene CPR_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0417 
Symbol 
ID4205957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp502193 
End bp503857 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content31% 
IMG OID642564974 
Productoligo-1,6-glucosidase 
Protein accessionYP_697746 
Protein GI110803940 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AGTGGTGGAA AGAACTAATT GCTTACCAGA TTTATCCTAA GAGCTTTATG 
GATTCTAATG GTGATGGTAT AGGTGATATT CAAGGTATAA TATCTAAATT AGATTATTTA
AAAGATTTAG GTATTGATTT AATATGGCTA TGTCCAATGT ATAAATCACC AAATCATGAT
AATGGTTATG ACATTAGTGA TTATAAAGAT ATCTTAGATG AATTTGGAAC TATGGATGAT
TTTAATGAAT TGCTTAATGA GGTTCATAAT AGAGGGATGA AGCTTATTAT AGATTTAGTA
ATAAATCATA CTAGTCATGA ACATCCATGG TTCATAGAAT CAAGATCTTC TAGGGATAAT
CCTAAAAGAG ATTGGTATAT TTGGAGAGAA GGTAAAGGGT ATGAGGAACC AAATAACTGG
GAGAGCATAT TTAAAGGTTC AGCTTGGGAA TTCTGTGAGA ATAGTGAAGA GTATTACCTA
CATTTATTTG CTAAAGAGCA ACCAGATTTA AACTGGGAGA ATAAAGAGGT AAGAAGAGAA
CTATATAATA TGATAAACTG GTGGCTTGAT AAGGGTATTG ATGGATTTAG AGTTGATGCT
ATAAGTCACA TAAAAAAAGA AGAAGGTCTT AAGGATATGG ATAATCCAGA GGGGCTTAAA
TATGTTTCAT CCTTTGAAAA ACATATGAAT GTAGAGGGAA TAAATTCTCA TCTTAAGGAA
CTAAAAGAAG AAACTTTTTC AAAGTACGAT ATAGTTACCG TTGGAGAAGC AAATGGAGTT
AGTGCTAATG AAGCTGATCA CTGGGTAGCT GAAGATGAGG GAACATTTAA TATGATATTC
CAATTTGAGC ATCTTAATCT TTGGAATTAT GAAGAGGGAC AAGGATTTGA TGTGAAGGCA
TACAAAGATG TTTTAACAAA TTGGCAAAAT TCTTTAGAAG GCAAAGGATG GAATGCACTT
TTTATTGAAA ATCATGATAT ACCTAGAGTT GTTTCAACTT GGGGAAATGA CAAGGAATAT
TTAACTGAAT GTGCAAAAGC TTTTGGAGCA ATTTATTTCT TACAAAAGGG AACCCCTTTC
ATATATCAAG GGCAAGAGCT TGGTATGACA AATGTTAAAT ATCATAGTAT ATGTGAGTAT
GATGATGTTA AAACTATAAA TACTTACAAT GAAAGAATTG AAAGTGGTGT TTCAGAGGAA
ATAGCATTAA AAGAAGCTTG GGTAACTTCA AGAGATAATT CAAGAACACC TATGCAATGG
AACTCAAGTA AGAATGCAGG ATTTACTTGT GGAAAACCTT GGATAGGAGT TAATGAAAAT
TATAAAACAA TAAATGTAGA AGTTGAAGAA AGGGATGAAA ATTCAGTTTT AAACTTCTAT
AAAAAGCTTA TAAAACTTAA AAAGTCTAAT GAAGCTTTAA TCTATGGTGT ATATGATTTA
ATCCTTGAAG AGGATGAAAA TATATTTGCT TACACAAGAA CTTTAAATAA TGATAAATTC
TTGATAATGG CTAATTTAAC TGGAGAAAAT GCCAAGTATG TGTATGAGAA AGAAAAACTT
AATTCTAAGG ATTTAATTCT TAACAATTAT GAGGTTTGTG AACATAAAAA CTTAACAGAG
TTTATATTAA AACCTTATGA ATGCAGAGTA TATAAGCTTT CCTAA
 
Protein sequence
MNKKWWKELI AYQIYPKSFM DSNGDGIGDI QGIISKLDYL KDLGIDLIWL CPMYKSPNHD 
NGYDISDYKD ILDEFGTMDD FNELLNEVHN RGMKLIIDLV INHTSHEHPW FIESRSSRDN
PKRDWYIWRE GKGYEEPNNW ESIFKGSAWE FCENSEEYYL HLFAKEQPDL NWENKEVRRE
LYNMINWWLD KGIDGFRVDA ISHIKKEEGL KDMDNPEGLK YVSSFEKHMN VEGINSHLKE
LKEETFSKYD IVTVGEANGV SANEADHWVA EDEGTFNMIF QFEHLNLWNY EEGQGFDVKA
YKDVLTNWQN SLEGKGWNAL FIENHDIPRV VSTWGNDKEY LTECAKAFGA IYFLQKGTPF
IYQGQELGMT NVKYHSICEY DDVKTINTYN ERIESGVSEE IALKEAWVTS RDNSRTPMQW
NSSKNAGFTC GKPWIGVNEN YKTINVEVEE RDENSVLNFY KKLIKLKKSN EALIYGVYDL
ILEEDENIFA YTRTLNNDKF LIMANLTGEN AKYVYEKEKL NSKDLILNNY EVCEHKNLTE
FILKPYECRV YKLS