Gene CPR_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1051 
Symbol 
ID4204721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1196679 
End bp1198511 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content29% 
IMG OID642565607 
Productglycosy hydrolase family protein 
Protein accessionYP_698373 
Protein GI110802276 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0315037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTAA TACCAAGACC AAAAAGTGTA ATTAATCATG AAGGTGAATT TTTTATAGAA 
AGAGATACTG AAATTATATT AAGTAGTGAA TTATCCTTTG AAGATTTAAA TTTGGCAATT
ATGACTCAAA AGGAAATAGA AAAAGTATTA GATTTTAAAC TTAACATAAA TAAACTTTTT
ATGGATAAGA AATATAGTAA CTCAATAATT TTAAGAGAAT TTAAATTTGA AAATGAAGAG
GAATATAAAA TAGAAATAAA AGAGAATCAG GTAATAATAG AAGGGTTTGG TGCTGGATTA
TTCTATGGAT GTCAGAGTTT TAGACAACTT GTAAGAGAGT TTGGAGCATG TATTCCAAAT
CTAATAATAG AAGATTCTCC ATATTTTAAA TATCGTGGAT TTTATCATGA TGTAACAAGG
GGAATGGTAC CAACCTTAGA TACATTAAAA AGATTAGTTG ATAAGGCAGC TTTTTATAAA
ATAAATCAGT TGCAACTATA TATAGAGCAT ACCTTTGCTT TTAAGGGAAT GAGCGAAGTT
TGGATGGATA AGGATCCTTT AACAGCAGAG GAAATATTGA TTTTAGACAA GTATTGCAAA
GAAAGACATG TGGAACTTGT ACCATCATTA TCAACCTTTG GTCATCTATA TGAAGCTTTA
AGAAGTAAAT CCTTTAGAGA ACTTTGTGAA TTAGAAATAG GAGATGAAGA ATATTCTTTT
GTAGATAGAA TGGCACATCA TACTTTAGAT GTTACTAATC CTAAAAGTTT AGGCTTTGTT
GAATCAATGC TTTTAGAATT TATTCCTTTA TTCAGTTCAG ATAAGTTTAA TATTTGCTGT
GATGAAACCT TTGATTTAGG AAAAGGAAAG AGTAGAGAGA AGGCTGAAAA ATTAGGAGTA
GGTAAAATAT ATACAGAGTT TTTAAATAAG GTATACAACA TTGTAAAAAG GTTCAATAAA
AATGTTATGT TCTGGGGAGA TATAATAGTT GGATATCCAG AGCTTTTAAG TGATATACCA
GAGGATTTAA CTTGTTTAAC TTGGAACTAT CATCCACAGG CTAATGATGT AGCCACAAAA
ATTATAGCAG AGAACAATAA AGTACAATAT GTTTGCCCTG GTGTAGGTGG ATGGAATATG
ATGATGAATC TTATAGAAGG CTCTTTTAGT AATATAAGAA GAATGGTTAA CCATGGAATG
AAATATGGAG CTATAGGTGT TTTAAATACA AACTGGGGAG ACTATGGAAA TATAAATCTA
TTAGCTAATT CAATGCCATC TATGATTTAT GGAGCAGGAA TTTCATGGAA TCCAAAGGAA
GAGGAGTTTA ATGAAATTTT TAAGTCTATA TCCCTTATGG AATTTGGTGA TGAATCTATG
AAGGTGGTTT CTTTAATGGA TAAGCTTTCT AAAAATCAAG TTGCAGGTTG GGGAGAACTT
GTTAGATGGA AAGAAAAGTT TAATGAAAGA GAAGAAACTA AGGAAGAAAT TAAGAATATA
GATACTTTAA AAGTTTTTGA AGGATATAAG GTTGCTTCAG AGGTAAGAAG AGAGTTTATT
AAATTACTTA AAAATACAGA GGATAAAGAA GCTATTCAAA GCTTTATTGT GTCTTCAAAG
GGATGGGAGC TTATAGATAA ATTCTTCATG GTATTACTTG AAAGAGAGTT TAATAAAAAA
AGTTTATTAG ATATAGATAA AAAAGACTTA GCTAAGGATT TAGAGCTTTG GTTTTATGAT
TACACATCAA TATGGAGAAA ATATAATAAA GAAAGCGAGC TTAATAGAAT AAGAGAAGTT
ATAGTATATA TGTGCTCATA TTTAAGGGGT TAA
 
Protein sequence
MHLIPRPKSV INHEGEFFIE RDTEIILSSE LSFEDLNLAI MTQKEIEKVL DFKLNINKLF 
MDKKYSNSII LREFKFENEE EYKIEIKENQ VIIEGFGAGL FYGCQSFRQL VREFGACIPN
LIIEDSPYFK YRGFYHDVTR GMVPTLDTLK RLVDKAAFYK INQLQLYIEH TFAFKGMSEV
WMDKDPLTAE EILILDKYCK ERHVELVPSL STFGHLYEAL RSKSFRELCE LEIGDEEYSF
VDRMAHHTLD VTNPKSLGFV ESMLLEFIPL FSSDKFNICC DETFDLGKGK SREKAEKLGV
GKIYTEFLNK VYNIVKRFNK NVMFWGDIIV GYPELLSDIP EDLTCLTWNY HPQANDVATK
IIAENNKVQY VCPGVGGWNM MMNLIEGSFS NIRRMVNHGM KYGAIGVLNT NWGDYGNINL
LANSMPSMIY GAGISWNPKE EEFNEIFKSI SLMEFGDESM KVVSLMDKLS KNQVAGWGEL
VRWKEKFNER EETKEEIKNI DTLKVFEGYK VASEVRREFI KLLKNTEDKE AIQSFIVSSK
GWELIDKFFM VLLEREFNKK SLLDIDKKDL AKDLELWFYD YTSIWRKYNK ESELNRIREV
IVYMCSYLRG