Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0841 |
Symbol | |
ID | 4206184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 970085 |
End bp | 973153 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642565400 |
Product | alpha-mannosidase 2c1 |
Protein accession | YP_698166 |
Protein GI | 110803739 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATTTC AGATAAATGA ACGCAAGAAT ATTGTAAACG GTTTGTGTGA AAAGCTAGAA AAGGCAATAT ACAATGTGGT TGCGCCTTTA GAAATAGATA TGTATATAAC TAAGGAGCCA GTAAGTTATG AAAATAGACT AACTGGAGAG CATAAAAAAG GAGTTATAGG AGAGTCTTGG GGAGAACTTT GGGACTGTGG TTGGTTTAAC TTTAAGGGAG AAGTACCTAA AAGTTATGAA GGAGAAAATA TTGTCCTTTT AATAGATATA AGTGGAGAAG CCTTTGTGGT AGATAAGGAT GGAAATCCTA TGAGAGGGCT TACAACTTTA AACTCAGAGT TTGATTTAAG CTTAGGAATG CCAGGTAAGA GAGAGGTTCC TATGTTTGAG AGAGCCAAAG GTGGAGAAGT TATAGATATA TGGGCAGATT GTGCATGTAA CGATTTGTTT GGTAAATATA GAGATAATGG AATTATAAAA GATGCATATA TTGCAACTTG TAATGAAGAA GTAAAGGCTC TTTATTATGA TGTAGAGGTT TTACATGAAC TTATGAATCA ATTACCAGAA GATAGTGCTA GATATAATAC TATATTAAAT GCTCTTTATG AAGCATCAAA GGTATTAAGT ATTAAAACTG GAATGAGTGG ATGTGATTTC TTAAAGGACG GAATTGTTAC TCAACAAGAA GTTACAATCT TAAATGAAGA AGAAGTTAAA AAAGCCAGAG AAATCTTAAA GAAAGAATTG GATAAAAAAG GTGGAGATCC TTCACTTTCA GTTTCAGCTA TTGGACATGC TCATATAGAT TTAGCTTGGT TATGGCCAAT AAGGGAAACT ATAAGAAAGG GAGCTAGAAC ATATTCAACA GTTTTAGCTA ATATGGAAAA ATATCCTGAG TATGTATTTG GAGCAAGCCA ACCTCAACTT TATCAGTGGA TGAAAGAATA CTATCCAGAT TTATATGAAA GAATTAAAGA AAAAATAGAA GAAGGAAGAT GGGAAGCTCA AGGAGGTATG TGGGTTGAAC CAGATACTAA TGTTCCATCA GGAGAATCTT TAGTAAGACA ATTATTATAT GGAAAGAGAT ACTTTAGAGA AGAATTCAAA AAAGAGATGG ATACTTTATG GCTTCCAGAT GTATTTGGAT ATTCAGCAGC GCTTCCACAA ATTCTTAAAA AGAGTGGTGT AGATTATTTC ATGACAATTA AATTATCATG GAACAATCAT AACCAATTTC CACATCATAC CTTTGTTTGG GAAGGATTAG ATGGAAGTAG AGTATTATCA CACATGCCAC CAGAGGGAAC TTATAATAGT TCAGCTGCAC CAAGGGCTGT TGCTAAATCA GAGAAAGCTT TCTTAGATAA AGGCTTATCA GATGAATGTT TAATGCTATT TGGAATTGGT GATGGAGGCG GTGGACCAGG AGAAGAGCAT TTAGAGAGAT TAAGAAGAGA AAGAAATATA AATGGAATAG CTCCTGTTAA GCAAGAACCT TCAAGTAATT TCTTTAAGAG AATAGAAAAG GATATAGATA AATACAGTGT GTGGAGTGGG GAGTTATACT TAGAGAAACA CCAAGGAACG TACACTACAC ATGGTAAGAA TAAAAAATAT AATAGAAAGA TGGAAATAGC TCTTAAGAAC TTAGAGTTAG CAGCAATGCA AGCTAAAGTT TTAGGAAAAG GAGAATATCC TCAAGAGGAA ATAGAAAAAG TTTGGAAGGA AGTTCTTTTA TATCAATTTC ATGATATTTT ACCAGGATCA TCCATAGGCA GAGTTTATGA CGAGTCAGTA GAAGCTTATG AAAAGATGCT TGCTAAGGTA GAAGGAATGA CTGAAAAGTT ATATAGAGAT ATTTTAGAAA GTGCATATAG TAAGGAAGAA GGAGCTATAT TAGTAAACTC TTTATCATGG AGTAGAGAAA AGTGGATTAA GCTTTATGAT AGATGGACAA GAGTTAACTT AAACTCTTTA AGTGGAGAAA TAATTAAAAA AGAAGAAATT GAGCCATATT GTGAAATTTC AACTTTAAAA GTTTCGGAAA CTAACTTAGA AAATGGATTA GTTAAAGTTG AGTTTAATGA AAATGGAACA ATTGCAAGAA TTTTTGATAA AGAATTAAAT AAAGAAGTCT TAAGAGATGA AAAAGGAAAT GAACTTAGAG TTTATGAAGA TAATGGAGAT GCTTGGGATT TCTCAGGAGT TTATGAAGAT AGACCATCTG AAATTTTTGA TTTAGTAAAA TCAGAAGTTA TTGTAGATGG ACCAAAGGCA ACTATTAAGC AAAACTATAA ATATGGAGAT TCTACTTTAG TACAAGAAAT AAGCATTTTA GAAGGTAGTA AGAGAATCGA CTTCAAAACA AAGGTTAATT GGAAAGAAAA TGGTAAGATG CTTAGAACAT CTTTCAATAC TAATGTTTAT ACTAGAGAGG CAGATTGTGA AATACAATTT GGTACAATAA AGAGACCTAC TCATGGAAAT ACTTCATGGG ATATGGCTAA GGGTGAAATA TGTGCTCAAA GATGGATAGA TTTATCACAA AGGGATTATG GAGTTGCTTT AATAAATGAT TCTAAGTATG GACACAATGT TTCAGAAACT AAGATAGACT TAAATCTATT AAGAAGTCCA GGATATCCAG ATCCAAATGC TGATAGAGGA GAGCATGAAT TTACTTATTG TTTATTCCCA CACAGAGGAG ACTATATTGA AGGAAATGTT GTTCATGAAG CTCATGAAGT TAATGCTCCA ATACAAGTAA TTTATAGTGA AAAATTAGGT GAAAACCTAG TTAAAGAAGC AATGGTTTCA ATAGATTGTG AAAATGTAAT AATTGATACA ATTAAAAAAG CTGAAGATGG TGACCAAATG ATAATCAGAT TATATGAGTG TCATGGGGAA GATGCTAAGG CTAAAGTTAA TATAAATCCT CCATATGAAA AGGTAGAAAT GGTTAACTTA ATAGAAGATA GCATAGATGA AAAAGAATTA AACAAAGAAA CAATGCACCT TACATTTAAA CCATTTGAGG TTCATACTTT AAAAGTAACA TTAAAGTAG
|
Protein sequence | MLFQINERKN IVNGLCEKLE KAIYNVVAPL EIDMYITKEP VSYENRLTGE HKKGVIGESW GELWDCGWFN FKGEVPKSYE GENIVLLIDI SGEAFVVDKD GNPMRGLTTL NSEFDLSLGM PGKREVPMFE RAKGGEVIDI WADCACNDLF GKYRDNGIIK DAYIATCNEE VKALYYDVEV LHELMNQLPE DSARYNTILN ALYEASKVLS IKTGMSGCDF LKDGIVTQQE VTILNEEEVK KAREILKKEL DKKGGDPSLS VSAIGHAHID LAWLWPIRET IRKGARTYST VLANMEKYPE YVFGASQPQL YQWMKEYYPD LYERIKEKIE EGRWEAQGGM WVEPDTNVPS GESLVRQLLY GKRYFREEFK KEMDTLWLPD VFGYSAALPQ ILKKSGVDYF MTIKLSWNNH NQFPHHTFVW EGLDGSRVLS HMPPEGTYNS SAAPRAVAKS EKAFLDKGLS DECLMLFGIG DGGGGPGEEH LERLRRERNI NGIAPVKQEP SSNFFKRIEK DIDKYSVWSG ELYLEKHQGT YTTHGKNKKY NRKMEIALKN LELAAMQAKV LGKGEYPQEE IEKVWKEVLL YQFHDILPGS SIGRVYDESV EAYEKMLAKV EGMTEKLYRD ILESAYSKEE GAILVNSLSW SREKWIKLYD RWTRVNLNSL SGEIIKKEEI EPYCEISTLK VSETNLENGL VKVEFNENGT IARIFDKELN KEVLRDEKGN ELRVYEDNGD AWDFSGVYED RPSEIFDLVK SEVIVDGPKA TIKQNYKYGD STLVQEISIL EGSKRIDFKT KVNWKENGKM LRTSFNTNVY TREADCEIQF GTIKRPTHGN TSWDMAKGEI CAQRWIDLSQ RDYGVALIND SKYGHNVSET KIDLNLLRSP GYPDPNADRG EHEFTYCLFP HRGDYIEGNV VHEAHEVNAP IQVIYSEKLG ENLVKEAMVS IDCENVIIDT IKKAEDGDQM IIRLYECHGE DAKAKVNINP PYEKVEMVNL IEDSIDEKEL NKETMHLTFK PFEVHTLKVT LK
|
| |