Gene CPF_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0802 
Symbol 
ID4200952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp951039 
End bp952871 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content30% 
IMG OID638081686 
Productglycosy hydrolase family protein 
Protein accessionYP_695253 
Protein GI110798751 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAA TTACAAGAGA GGCAATACAT CATATAGCTC AAAGCAATTA TTCTTATGGA 
TATGATAATG AAACTTTGCA TTTAAGAGTT AAGACTAAAA AAGGTGAAGT TAATAAAGTG
GAAATAAGAA TTGGAGATCC TTACATATGG GATGAAGGTG GTTGTGATGG AGGAAATATG
AATGCCACTG GAGGACGATG GACAGGTGGA AAAAGTTATC CTATGAGAAA GGAATGCGAA
ACAAAATACT TTGATCACTG GATAGTTGAG TATAAACCAT TAACCAAACG TTCAAGATAT
GGATTTATAT TATATGGAGA TGAAGAAACT CTTTTATTCA CAGAAAAAAG AATAGAGGAA
TTAGATGGAA AGTATGATGA AGCAAAATTA AGTGATATAG GAAACTTTTA TTGTTTTCCA
TATTTAAATG CCATAGATGT TGCTAAAACA CCAAAATGGG TAAAAGATAC TGTTTGGTAT
CAAATATTCC CAGATAGATT TTGCAATGGA GATAAATCAA TAGATCCAGA AAATGTTGAG
CCATGGGGGA CAGAGCCTAC TAGGGATAAT TTTATGGGAG GAGATTTACA GGGAGTTTTA
GATAAATTAG ATTACTTATG CGATCTTGGA ATTAATGGAC TATATTTTTG TCCTGTATTT
GAAGCTACTG AAAATCATAG ATATGAAACT ATAGATTATT TTAAAGTAGA TCCAGCCCTT
GGTGGAAATG AAGTCTTTAA AAAACTTGTA AGTGAAGCTC ACAAAAGAGG AATGAAAATA
ATGTTAGATG CAGTATTTAA TCATATAGGT TATTTTTCAC CTCAATGGCA AGAGGTATTA
AAGAATAATG AAAAATCAAG ATATAAGGAT TGGTTTTGTA TAAAGAAGTT TCCAGTACTT
GAAAATGGCT TAGAAAATGT TGACGGAAAT AATTTAAATT ATGAAACCTT TGGAAGAATA
GCCACAATGC CTAAACTAAA CACAGAAAAT CCAGAGGTTG TAGAATATTT ATTAAAAGTG
GCTAAGTTCT GGGTTGAAGA AATGGATATA GATGGTTGGA GACTTGATGT ATGTAATGAA
GTAGACCATG TATTTTGGAG AAAATTTAGG GAAGTAGTAA AGGGAACTAA TAAAGAAGTT
TATATATTAG GAGAAGTTTG GCATGATGGA CTTCCATGGC TTATGGGAGA TCAGTTTGAT
GCAGTTATGA ATTACCCTGT TACAGATGCA GTAAAAGAAT ATTTTTGTTT AAATCAATCT
AATACAGAAG ATTTTAAATA TATGATAGAA GCTAATAAGG TTAGTTATTT AAGACAAATA
GGAGAAACAA TATTTAACTT GTTAGATAGT CATGATACTC CAAGAATATT AACTGTAGCT
GAAGGAAACA AGGATAAGAT GAAATTAGCT TATCTGTTTA TGTTTACTCA AGCTGGTTCT
CCATGTATAT ATTATGGGGA TGAAGTTGGT ATGGAAGGTA ATCAAGGAAT GGGTATGGAA
TTCCATAGAA GATGTATGGT TTGGGATGAA AATAAACAAG ATAAAGATAT GCTTAAGTTT
ATGAAACAAA TAATAAAAAT AAGAAAAGAA AATAAGGAAT TAAATTTATT AGATAATAAT
TGGATAAGAG CTAATAGAGC TGAAAATATA CTTATATATT CAAAGGAAAA TATATTTATT
ATTATGAACA ATTCAGAGAA TGAAGAAAAG ATATATTTAC CTAAAGAGAT TAAAAATAAT
AAGGTTAAGG ATTTATTTGA AGAAAAAATT GAACCTTTAA AAGAAGATAT AGGCTTAAAG
CCTTTTGCAT TTAAAGTTTA TAAAAAGCTT TAA
 
Protein sequence
MGKITREAIH HIAQSNYSYG YDNETLHLRV KTKKGEVNKV EIRIGDPYIW DEGGCDGGNM 
NATGGRWTGG KSYPMRKECE TKYFDHWIVE YKPLTKRSRY GFILYGDEET LLFTEKRIEE
LDGKYDEAKL SDIGNFYCFP YLNAIDVAKT PKWVKDTVWY QIFPDRFCNG DKSIDPENVE
PWGTEPTRDN FMGGDLQGVL DKLDYLCDLG INGLYFCPVF EATENHRYET IDYFKVDPAL
GGNEVFKKLV SEAHKRGMKI MLDAVFNHIG YFSPQWQEVL KNNEKSRYKD WFCIKKFPVL
ENGLENVDGN NLNYETFGRI ATMPKLNTEN PEVVEYLLKV AKFWVEEMDI DGWRLDVCNE
VDHVFWRKFR EVVKGTNKEV YILGEVWHDG LPWLMGDQFD AVMNYPVTDA VKEYFCLNQS
NTEDFKYMIE ANKVSYLRQI GETIFNLLDS HDTPRILTVA EGNKDKMKLA YLFMFTQAGS
PCIYYGDEVG MEGNQGMGME FHRRCMVWDE NKQDKDMLKF MKQIIKIRKE NKELNLLDNN
WIRANRAENI LIYSKENIFI IMNNSENEEK IYLPKEIKNN KVKDLFEEKI EPLKEDIGLK
PFAFKVYKKL