Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0848 |
Symbol | |
ID | 4203308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1004144 |
End bp | 1007212 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638081731 |
Product | glycosy hydrolase family protein |
Protein accession | YP_695298 |
Protein GI | 110801231 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.553553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATTTC AGATAAATGA ACGCAAGAAT ATTGTAAACG GTTTGTGTGA GAAGCTAGAA AAGGCAATAT ACAATGTGGT TGCGCCTTTA GAAATAGATA TGTATATAAC TAAGGAGCCA GTAAGTTATG AAAATAGACT AACTGGAGAG CATAAAAAAG GAATTATAGG AGAGTCTTGG GGAGAACTTT GGGACTGTGG TTGGTTTAAC TTTAAGGGAG AAGTACCTAA AAGTTATGAA GGAGAAAAGA TTGTCCTTTT AATAGATATA AGTGGGGAAG CCTTTGTGGT AGATAAGAAT GGAAATCCTA TGAGAGGGCT TACAACTTTA AATTCAGAGT TTGATTTAAG CTTAGGAATG CCAGGTAAGA GAGAGGTTCC TATGTTTGAG AGAGCTGAAG GTGGAGAAGT TATAGATATA TGGGCAGATT GTGCATGTAA CGATTTGTTT GGTAAATATA GAGATAATGG AATTATAAAA GATGCATATA TTGCAACTTG TAATGAAGAA GTAAAGGCTC TTTACTATGA TGTAGAGGTT TTACATGAAC TTATGAATCA ATTACCAGAA GATAGTGCTA GATATAATAC TATATTAAAT GCTCTTTATG AAGCATCAAA GGTATTAAGT ATTAAAACTG GAATGAGTGG ATGTGATTTC TTAAAGGACG GAGTTGTTAC TCAACAAGAA GTTACAATCT TAAATGAAGA AGAAGTTAAA AAAGCCAGAG AAATTTTAAA GAAAGAATTA GATAAAAAAG GTGGAGATCC TTCACTTTCA GTTTCAGCTA TTGGACATGC TCATATAGAT TTAGCTTGGT TATGGCCAAT AAGGGAAACT ATAAGAAAGG GAGCTAGAAC ATATTCAACA GTTTTAGCTA ATATGGAAAA ATATCCTGAG TATGTATTTG GAGCAAGCCA ACCTCAAATT TATCAGTGGA TGAAAGAATA CTATCCAGAT TTATATGGAA GAATTAAAGA AAGAATAGAA GATGGAAGAT GGGAAGCTCA AGGAGGTATG TGGGTTGAAC CAGATACTAA TGTTCCATCA GGAGAATCTT TAGTAAGACA ATTATTATAT GGAAAGAGAT ACTTTAGAGA AGAATTCAAA AAGGAAATGG ATACTTTATG GCTTCCAGAC GTATTTGGAT ATTCAGCAGC TCTTCCACAA ATTCTTAAAA AGAGTGGTGT AGATTATTTC ATGACAATTA AATTATCATG GAACAATTAT AACCAATTTC CACATCATAC CTTTGTTTGG GAAGGATTAG ATGGAAGTAG AGTATTATCA CACATGCCAC CAGAGGGAAC TTATAATAGT TCAGCTGCAC CAAGGGCTGT TGCTAAGTCA GAGAAAGCCT TCTTAGATAA AGGCTTATCA GATGAATGCT TAATGCTATT TGGAATTGGT GATGGAGGCG GTGGCCCAGG AGAAGAGCAT TTAGAGAGAT TAAGAAGAGA AAGAAATATA AATGGAATAG CTCCTGTTAA GCAAGAACCT TCAAGCAATT TCTTTAAGAG AATAGAAAAA GATATAGATA AATACAGTGT GTGGAGTGGA GAGTTATATT TAGAGAAACA CCAAGGAACT TACACTACAC ATGGTAAGAA TAAAAAATAT AATAGAAAGA TGGAAATAGC TCTTAAGAAC TTAGAGTTAG CAGCAATGCA AGCTAAAGTT TTAGGAAAGG GAGAATATCC TCAAGATGAA ATAGAAAAAG TTTGGAAGGA AGTTCTTTTA TATCAATTCC ATGATATTTT ACCAGGATCA TCAATAGGAA GAGTTTATGA CGAGTCAGTA GAAGCCTATG AAAAGATGCT TGCTAAGGTA GAAGGAATGA CTGAAAAGTT ATATAGAGAT ATTTTAGAAA GTGCAGATAG CAAAGAAGAG GGAGCTATAT TAGTAAACTC TTTATCATGG AGTAGAGAGC AATGGATTAA GCTTTATGAT AGATGGACAA AGATTAACTT AAACTCTTTA AGTGGAGAAA CAATCAGAAA AGAAGAAATT GAGCCATATT GTGAAATTTC AACTTTAAAA GTTTCAGAAA ATAACTTAGA AAATGGATTA GTTAAAGTTG AGTTTAATGA AAATGGAACA ATTGCAAGAA TTTTTGATAA AGAATTAAAT AAAGAAGTCT TAAGAGATGA AAAAGGAAAT GAACTTAGAG TTTATGAAGA TAATGGAGAT GCTTGGGACT TCTCAGGAGT TTATGAAGAT AGACCATCTG AAATTTTTGA GTTAGTAAAA TCAGAAGTTA TTGTAGATGG ACCAAAGGCA ACTATTAAGC AAAATTATAA ATATGGAGAT TCTACTTTAG TACAAGAAAT AAGCATTTTA GAAGGTAGTA AGAGAATCGA CTTTAAAACA AAGGTTGATT GGAAAGAAAA TGGTAAGATG CTTAGAACAT CTTTCAATAC TAATGTTTAT ACTAGAGAGG CAGATTGTGA AATACAATTT GGTACAATAA AGAGACCTAC TCATGGAAAT ACTTCATGGG ATATGGCTAA GGGTGAAATA TGTGCTCAAA GATGGATAGA TTTATCACAA AGGGATTATG GAGTTGCTTT AATAAATGAT TCTAAGTATG GACACAATGT CTCAGAAACT AAAATAGACT TAAATCTATT AAGAAGTCCA GGATATCCAG ATTCAAATGC TGATAGAGGA GAGCATGAAT TTACTTATTG CTTATTCCCA CACAGAGGAG ACTATATTGA AGGAAATGTT GTTCATGAAG CTCATGAAGT TAATGCTCCA ATACAAGTAA TTTATAGTGA AAACTTAGGT GAAAACCTAG TTAAAGAAGC AATGGCTTCA ATAGATTGTG AAAATGTAAT AATTGATACA ATTAAAAAAG CTGAAGATGG TGACCAAATG ATAATCAGAT TATATGAGTG TCATGGAGAA GATGCTAAGG CTAAAGTTAA TATAAATGTT CCATATGAAA ATATAGAAAT GGTTAACTTA ATAGAAGATA GCATAGATGG AAAGGAATTA AACAAAGAAA CAATGCACCT TACATTTAAA CCATTTGAGG TTCATACTTT AAAAGTAACA TTAAAGTAG
|
Protein sequence | MLFQINERKN IVNGLCEKLE KAIYNVVAPL EIDMYITKEP VSYENRLTGE HKKGIIGESW GELWDCGWFN FKGEVPKSYE GEKIVLLIDI SGEAFVVDKN GNPMRGLTTL NSEFDLSLGM PGKREVPMFE RAEGGEVIDI WADCACNDLF GKYRDNGIIK DAYIATCNEE VKALYYDVEV LHELMNQLPE DSARYNTILN ALYEASKVLS IKTGMSGCDF LKDGVVTQQE VTILNEEEVK KAREILKKEL DKKGGDPSLS VSAIGHAHID LAWLWPIRET IRKGARTYST VLANMEKYPE YVFGASQPQI YQWMKEYYPD LYGRIKERIE DGRWEAQGGM WVEPDTNVPS GESLVRQLLY GKRYFREEFK KEMDTLWLPD VFGYSAALPQ ILKKSGVDYF MTIKLSWNNY NQFPHHTFVW EGLDGSRVLS HMPPEGTYNS SAAPRAVAKS EKAFLDKGLS DECLMLFGIG DGGGGPGEEH LERLRRERNI NGIAPVKQEP SSNFFKRIEK DIDKYSVWSG ELYLEKHQGT YTTHGKNKKY NRKMEIALKN LELAAMQAKV LGKGEYPQDE IEKVWKEVLL YQFHDILPGS SIGRVYDESV EAYEKMLAKV EGMTEKLYRD ILESADSKEE GAILVNSLSW SREQWIKLYD RWTKINLNSL SGETIRKEEI EPYCEISTLK VSENNLENGL VKVEFNENGT IARIFDKELN KEVLRDEKGN ELRVYEDNGD AWDFSGVYED RPSEIFELVK SEVIVDGPKA TIKQNYKYGD STLVQEISIL EGSKRIDFKT KVDWKENGKM LRTSFNTNVY TREADCEIQF GTIKRPTHGN TSWDMAKGEI CAQRWIDLSQ RDYGVALIND SKYGHNVSET KIDLNLLRSP GYPDSNADRG EHEFTYCLFP HRGDYIEGNV VHEAHEVNAP IQVIYSENLG ENLVKEAMAS IDCENVIIDT IKKAEDGDQM IIRLYECHGE DAKAKVNINV PYENIEMVNL IEDSIDGKEL NKETMHLTFK PFEVHTLKVT LK
|
| |