Gene CPF_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0652 
Symbol 
ID4202913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp781031 
End bp782341 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content25% 
IMG OID638081537 
ProductF5/8 type C domain-containing protein 
Protein accessionYP_695105 
Protein GI110799173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0313432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CTCTAGTAAA CCTAACTCCA ACTAAAGAAC AAAGAAAATA CCAAGAAGAT 
GAACTAATAG CTTTTTTAGA CCTCCAAATG AATACATTTA CAGCTAATTC TGAAAATAAT
GATGAGGCTC CTTCTTTATT AAAAAAAGAG ACTTCATTTA ACACAGATAT ATGGATAGAA
AATCTATTTA AATTTGGATT TAAGAAAGTT ATTTTAACTG CTAAGCTTAA TAATGGTATT
TGTTTATGGC AAAGCAATTG TTCAAAGGTT AATATAAATA CTTTTTCTTT AGAGAATAAT
TATGATGATT TAGTAAAAAA AGTTAGTAAA TCTTGTAAAA AGTTTGGACT TAAATTTGGT
ATTCATCTTA CTTTAAAAGA TTATCTCACT TTAAACTTAA CTCCTGAAGA ATATGATAAT
TACTACACGT CTCAATTAAA AGAACTTATG ACTAATTATT CACAAATAAG TGAAGTTATT
ATGGATATGA CCTCTTTAGA TGAATATTAT GACTCCCTTG ATTTTGAAAG ATACTTTAAA
GTAGTTAAAG AAATAAATCC TAAATGTATG ATATCTAGCC CTATTGGACC TGATGTACGT
TGGACATATT ATTCTGATAT AGAGTCATCT AAAAACTATT TCTACTCTTC TATAAATTTA
GATCTTTTAA AAGAAGATTT TAACAACGAA GAGATTAGAA AAGGAAATAT TTATGGAGAC
ACTTGGATTG TTGGAGAAAG TATTTACTCT CTATCTGACT GCTTAAACCA TAATAAAGAT
TCATTATCAA ATTTAAAACA TGTGTATAAT AATTCATTAG GAAGAAATAC AAACTTGGTA
TTAGTTTTAT CTCCTAATAC AGATGGATTA TTAAATCATA ATGAATTAAG TTTACTTTCT
GATTTTTCTA AATATATAAA AGAAACTTTT TCTAATAATC TTATTAAGGG CTCTTCTATA
TTAGCTACAA ATTCATCTTC TAGCGATAGC TATAATTTAA TTGATGACTA CAAAAAATCT
TATTGGATAG CTAATGAAAA TGCTGTTAAC CCTTATATAG AGATAGATTT TAAAACTATC
ACTGAATTTA ATATTTTAGA AATTAGAGAG TGGATTGCTG AAGGTCAAAA CGTAGAAGAA
TTTAAAGTTT ATGCATATAA CAATGGTTGG TTTGAACTTT ATAATGGTAC TTCTATTGGA
TATAGACATA TAGCAAAACT TAATAATATC AAAACTGATA AAATTAAAAT TTCATTTACT
AAATATAAAA ACCCACCTAT GATTAATCAT ATTGGTGCAT ATTTAGGATA A
 
Protein sequence
MNTTLVNLTP TKEQRKYQED ELIAFLDLQM NTFTANSENN DEAPSLLKKE TSFNTDIWIE 
NLFKFGFKKV ILTAKLNNGI CLWQSNCSKV NINTFSLENN YDDLVKKVSK SCKKFGLKFG
IHLTLKDYLT LNLTPEEYDN YYTSQLKELM TNYSQISEVI MDMTSLDEYY DSLDFERYFK
VVKEINPKCM ISSPIGPDVR WTYYSDIESS KNYFYSSINL DLLKEDFNNE EIRKGNIYGD
TWIVGESIYS LSDCLNHNKD SLSNLKHVYN NSLGRNTNLV LVLSPNTDGL LNHNELSLLS
DFSKYIKETF SNNLIKGSSI LATNSSSSDS YNLIDDYKKS YWIANENAVN PYIEIDFKTI
TEFNILEIRE WIAEGQNVEE FKVYAYNNGW FELYNGTSIG YRHIAKLNNI KTDKIKISFT
KYKNPPMINH IGAYLG