Gene CPF_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1238 
Symbol 
ID4201260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1408498 
End bp1410330 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content29% 
IMG OID638082119 
Productglycosy hydrolase family protein 
Protein accessionYP_695684 
Protein GI110799990 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.247532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTAA TACCAAGACC AAAAAGTGTA ATTAATCATG AAGGTGAATT TTTTATAGAA 
AGAGATACTG AAATTATATT AAGTAGTGAA TTATCCTTTG AAGATTTAAA TTTGGCAATT
ATGATTCAAA AGGAAATAGA AAAAGTATTA GATTTTAAAC TTAACATAAA TAAACTTTTT
ATGGATAAGA AATATAGTAA CTCAATAATT TTAAGAGAAT TTAAATTTGA AAATGAAGAG
GAATATAAAA TAGAAATAAA GGAGAATCAG GTAATAATAG AAGGGTTTGG TGCTGGATTA
TTCTATGGAT GTCAGAGCTT TAGACAGCTT GTAAGAGAGT TTGGAGCATG TATTCCAAAT
CTAACAATAG AAGATTCTCC ATATTTTAAA TATCGTGGAT TTTATCATGA TGTAACAAGG
GGAATGGTAC CAACCTTAGA TACATTAAAA AGATTAGTTG ATAAGGCAGC TTTTTATAAA
ATAAATCAGT TGCAACTATA TATAGAGCAT ACCTTTGCTT TTAAGGGAAT GAGTGAAGTT
TGGATGGATA AGGATCCTTT AACAGCGGAG GAAATATTGA TTTTAGACAA GTATTGCAAA
GAGAGACATG TGGAACTTGT ACCATCATTA TCAACTTTTG GTCATCTATA TGAAGCTTTA
AGAAGTAAAT CCTTTAGAGA ACTTTGTGAA TTAGAAATAG GAGATGAAGA ATATTCTTTT
GTAGATAGAA TGGCACATCA TACTTTAGAT GTTACTAATC CTAAAAGTTT AGACTTTGTT
GAATCAATGC TTTTAGAATT TATTCCTTTA TTCAGTTCAG ATAAGTTTAA TATTTGCTGT
GATGAAACCT TTGATTTAGG AAAAGGAAAG AGTAGAGAGA AGGCTGAAAA ATTAGGAGTA
GGTAAAATAT ATACAGAGTT TTTAAATAAG GTATACAACA TTGTAAAAAG GTTCAATAAA
AATGTTATGT TCTGGGGAGA TATAATAGTT GGATATCCAG AGCTTTTAAG TGATATACCA
GAGGATTTAA CTTGTTTAAC TTGGAACTAT CATCCACAGG CTAATGATGT AGCCACAAAA
ATTATAGCAG AGAACAATAA AGTACAATAT GTTTGCCCTG GTGTAGGTGG ATGGAATATG
ATGATGAATC TTATAGAAGG CTCTTTTAGT AATATAAGAA GAATGGTTAA CCATGGAATG
AAATATGGAG CTATAGGTGT TTTAAATACA AACTGGGGAG ACTATGGAAA TATAAATCTA
TTAGCTAATT CAATGCCATC TATGATTTAT GGAGCAGGAA TTTCATGGAA TCCAAAGGAA
GAGGAGTTTA ATGAAATTTT TAAGTCTATA TCCCTTATGG AATTTGGTGA TGAATCTATG
AAGGTGGTTT CTTTAATGGA TAAGCTTTCT AAAAATCAAG TTGCAGGTTG GGGAGAACTT
GTTAGATGGA AAGAAAAGTT TAATGAAAGA GAAGAAACTA AGGAAGAAAT TAAGAATATA
GATACTTTAA AAGTTTTTGA AGGATATAAG GTTGCTTCAG AGGTAAGAAG AGAGTTTATT
AAATTACTTA AAAATACAGA GGATAAAGAA GCTATTCAAA GCTTTATTGT GTCTTCAAAG
GGATGGGAGC TTATAGATAA ATTCTTCATG GTATTACTTG AAAGAGAGTT TAATAAAAAA
AGTTTATTAG ATATAGATAA AAAAGACTTA GCTAAGGATT TAGAGCTTTG GTTTTATGAT
TACACATCAA TATGGAGAAA ATATAATAAA GAAAGCGAGC TTAATAGAAT AAGAGAAGTT
ATAGTATATA TGTGCTCATA TTTAAGGGGT TAA
 
Protein sequence
MHLIPRPKSV INHEGEFFIE RDTEIILSSE LSFEDLNLAI MIQKEIEKVL DFKLNINKLF 
MDKKYSNSII LREFKFENEE EYKIEIKENQ VIIEGFGAGL FYGCQSFRQL VREFGACIPN
LTIEDSPYFK YRGFYHDVTR GMVPTLDTLK RLVDKAAFYK INQLQLYIEH TFAFKGMSEV
WMDKDPLTAE EILILDKYCK ERHVELVPSL STFGHLYEAL RSKSFRELCE LEIGDEEYSF
VDRMAHHTLD VTNPKSLDFV ESMLLEFIPL FSSDKFNICC DETFDLGKGK SREKAEKLGV
GKIYTEFLNK VYNIVKRFNK NVMFWGDIIV GYPELLSDIP EDLTCLTWNY HPQANDVATK
IIAENNKVQY VCPGVGGWNM MMNLIEGSFS NIRRMVNHGM KYGAIGVLNT NWGDYGNINL
LANSMPSMIY GAGISWNPKE EEFNEIFKSI SLMEFGDESM KVVSLMDKLS KNQVAGWGEL
VRWKEKFNER EETKEEIKNI DTLKVFEGYK VASEVRREFI KLLKNTEDKE AIQSFIVSSK
GWELIDKFFM VLLEREFNKK SLLDIDKKDL AKDLELWFYD YTSIWRKYNK ESELNRIREV
IVYMCSYLRG