Gene CPF_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0421 
Symbol 
ID4201891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp503188 
End bp504852 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content31% 
IMG OID638081305 
Productputative oligo-1,6-glucosidase 
Protein accessionYP_694878 
Protein GI110798775 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.591118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AGTGGTGGAA AGAATTAATT GCTTACCAGA TTTATCCTAA GAGCTTTATG 
GATTCTAATG GTGATGGTAT AGGTGATATT CAAGGGATAA TATCTAAATT AGATTATTTA
AAAGATTTAG GTATTGATTT AATCTGGCTA TGTCCAATGT ATAAATCACC AAATCATGAT
AATGGTTATG ACATTAGTGA TTATAAAGAT ATCTTAGATG AATTTGGAAC TATGGATGAT
TTTAATGAAT TGCTTAGTGA GGTTCATAAT AGAGGGATGA AACTTATTAT AGATTTAGTA
ATAAATCATA CTAGTCATGA ACATCCATGG TTTATAGAAT CAAGAGCTTC TAGGGATAAT
CCTAAAAGAG ATTGGTATAT TTGGAGAGAA GGTAAAGGGG ATGAGGAACC AAATAACTGG
GAAAGTATAT TTAAAGGTTC AGCTTGGGAA TTCTGTGAGA ATAGTGAAGA GTATTACCTG
CATTTATTTG CTAAAGAGCA ACCAGATTTA AACTGGGAGA ATAAAGAGGT AAGAAATGAA
CTATATAAGA TGATAAACTG GTGGCTTGAT AAGGGTATTG ATGGGTTTAG AGTTGATGCC
ATAAGTCACA TAAAAAAAGA AGAGGGTCTT AAGGATATGG ATAATCCAGA GGGACTTAAA
TATGTTTCAT CCTTTGAAAA ACATATGAAT GTAGAGGGAA TAAATTCTCA TCTTAAGGAA
CTAAAAGAAG AAACTTTTTC AAAGTACGAT ATAGTTACCG TTGGAGAAGC AAATGGAGTT
AGTGCCAATG AAGCTGATCA CTGGGTAGCT GAAGATGAGG GGACATTTAA TATGATATTC
CAATTTGAGC ATCTTAATCT TTGGAATTAT GAAGAGGGAC AAGGATTTGA TGTGAAGGCA
TACAAAGATG TTTTAACAAA TTGGCAAAAT TCTTTAGAAG GTAAAGGATG GAATGCACTT
TTCATTGAAA ATCATGATAT ACCTAGAGTT GTTTCAACTT GGGGAAATGA CAAGGAATAT
TTAACTGAAT GTGCAAAAGC TTTTGGAGCA ATTTATTTCT TACAAAAGGG AACCCCTTTC
ATATACCAAG GGCAAGAACT TGGTATGACA AATGTTAAAT ATCATAGCAT ATCTGAGTAT
GATGATGTTA AAACTATAAA TACTTACAAT GAAAGAATTG AAAGTGGTGT TTCAGAGGAA
ATAGCATTAA AAGAAGCTTG GGTAACTTCA AGAGATAATT CAAGAACACC TATGCAATGG
AATTCAAGTG AGAATGCAGG GTTTACTTGT GGAAAACCTT GGATAGGAGT TAATGAAAAT
TATAAAACAA TAAATGTAGA AGTTGAAGAA AGGGATGAAA ATTCAGTTTT AAACTTCTAT
AAAAAGCTTA TAAAACTTAA AAAGTCTAAT GAAGCTTTAA TCTATGGTGT ATATGATTTA
ATCCTTGAAG AGGATGAAAA TATCTTTGCT TATACAAGAA CTTTAAATAA TGAAAAGTTC
TTGATAATGG CTAATTTAAC TGGAGAAAAT GCCAAGTACA TGTATGAGAA AGAAAAACTT
AATTCTAAGG ATTTAATTCT TAACAATTAT GAGGTTTGTG AACATAAAAA CTTAACAGAG
TTTACATTAA AACCTTATGA ATGCAGAGTA TATAAGCTTT CTTAA
 
Protein sequence
MNKKWWKELI AYQIYPKSFM DSNGDGIGDI QGIISKLDYL KDLGIDLIWL CPMYKSPNHD 
NGYDISDYKD ILDEFGTMDD FNELLSEVHN RGMKLIIDLV INHTSHEHPW FIESRASRDN
PKRDWYIWRE GKGDEEPNNW ESIFKGSAWE FCENSEEYYL HLFAKEQPDL NWENKEVRNE
LYKMINWWLD KGIDGFRVDA ISHIKKEEGL KDMDNPEGLK YVSSFEKHMN VEGINSHLKE
LKEETFSKYD IVTVGEANGV SANEADHWVA EDEGTFNMIF QFEHLNLWNY EEGQGFDVKA
YKDVLTNWQN SLEGKGWNAL FIENHDIPRV VSTWGNDKEY LTECAKAFGA IYFLQKGTPF
IYQGQELGMT NVKYHSISEY DDVKTINTYN ERIESGVSEE IALKEAWVTS RDNSRTPMQW
NSSENAGFTC GKPWIGVNEN YKTINVEVEE RDENSVLNFY KKLIKLKKSN EALIYGVYDL
ILEEDENIFA YTRTLNNEKF LIMANLTGEN AKYMYEKEKL NSKDLILNNY EVCEHKNLTE
FTLKPYECRV YKLS