Gene CPF_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0848 
Symbol 
ID4203308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1004144 
End bp1007212 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content33% 
IMG OID638081731 
Productglycosy hydrolase family protein 
Protein accessionYP_695298 
Protein GI110801231 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTTC AGATAAATGA ACGCAAGAAT ATTGTAAACG GTTTGTGTGA GAAGCTAGAA 
AAGGCAATAT ACAATGTGGT TGCGCCTTTA GAAATAGATA TGTATATAAC TAAGGAGCCA
GTAAGTTATG AAAATAGACT AACTGGAGAG CATAAAAAAG GAATTATAGG AGAGTCTTGG
GGAGAACTTT GGGACTGTGG TTGGTTTAAC TTTAAGGGAG AAGTACCTAA AAGTTATGAA
GGAGAAAAGA TTGTCCTTTT AATAGATATA AGTGGGGAAG CCTTTGTGGT AGATAAGAAT
GGAAATCCTA TGAGAGGGCT TACAACTTTA AATTCAGAGT TTGATTTAAG CTTAGGAATG
CCAGGTAAGA GAGAGGTTCC TATGTTTGAG AGAGCTGAAG GTGGAGAAGT TATAGATATA
TGGGCAGATT GTGCATGTAA CGATTTGTTT GGTAAATATA GAGATAATGG AATTATAAAA
GATGCATATA TTGCAACTTG TAATGAAGAA GTAAAGGCTC TTTACTATGA TGTAGAGGTT
TTACATGAAC TTATGAATCA ATTACCAGAA GATAGTGCTA GATATAATAC TATATTAAAT
GCTCTTTATG AAGCATCAAA GGTATTAAGT ATTAAAACTG GAATGAGTGG ATGTGATTTC
TTAAAGGACG GAGTTGTTAC TCAACAAGAA GTTACAATCT TAAATGAAGA AGAAGTTAAA
AAAGCCAGAG AAATTTTAAA GAAAGAATTA GATAAAAAAG GTGGAGATCC TTCACTTTCA
GTTTCAGCTA TTGGACATGC TCATATAGAT TTAGCTTGGT TATGGCCAAT AAGGGAAACT
ATAAGAAAGG GAGCTAGAAC ATATTCAACA GTTTTAGCTA ATATGGAAAA ATATCCTGAG
TATGTATTTG GAGCAAGCCA ACCTCAAATT TATCAGTGGA TGAAAGAATA CTATCCAGAT
TTATATGGAA GAATTAAAGA AAGAATAGAA GATGGAAGAT GGGAAGCTCA AGGAGGTATG
TGGGTTGAAC CAGATACTAA TGTTCCATCA GGAGAATCTT TAGTAAGACA ATTATTATAT
GGAAAGAGAT ACTTTAGAGA AGAATTCAAA AAGGAAATGG ATACTTTATG GCTTCCAGAC
GTATTTGGAT ATTCAGCAGC TCTTCCACAA ATTCTTAAAA AGAGTGGTGT AGATTATTTC
ATGACAATTA AATTATCATG GAACAATTAT AACCAATTTC CACATCATAC CTTTGTTTGG
GAAGGATTAG ATGGAAGTAG AGTATTATCA CACATGCCAC CAGAGGGAAC TTATAATAGT
TCAGCTGCAC CAAGGGCTGT TGCTAAGTCA GAGAAAGCCT TCTTAGATAA AGGCTTATCA
GATGAATGCT TAATGCTATT TGGAATTGGT GATGGAGGCG GTGGCCCAGG AGAAGAGCAT
TTAGAGAGAT TAAGAAGAGA AAGAAATATA AATGGAATAG CTCCTGTTAA GCAAGAACCT
TCAAGCAATT TCTTTAAGAG AATAGAAAAA GATATAGATA AATACAGTGT GTGGAGTGGA
GAGTTATATT TAGAGAAACA CCAAGGAACT TACACTACAC ATGGTAAGAA TAAAAAATAT
AATAGAAAGA TGGAAATAGC TCTTAAGAAC TTAGAGTTAG CAGCAATGCA AGCTAAAGTT
TTAGGAAAGG GAGAATATCC TCAAGATGAA ATAGAAAAAG TTTGGAAGGA AGTTCTTTTA
TATCAATTCC ATGATATTTT ACCAGGATCA TCAATAGGAA GAGTTTATGA CGAGTCAGTA
GAAGCCTATG AAAAGATGCT TGCTAAGGTA GAAGGAATGA CTGAAAAGTT ATATAGAGAT
ATTTTAGAAA GTGCAGATAG CAAAGAAGAG GGAGCTATAT TAGTAAACTC TTTATCATGG
AGTAGAGAGC AATGGATTAA GCTTTATGAT AGATGGACAA AGATTAACTT AAACTCTTTA
AGTGGAGAAA CAATCAGAAA AGAAGAAATT GAGCCATATT GTGAAATTTC AACTTTAAAA
GTTTCAGAAA ATAACTTAGA AAATGGATTA GTTAAAGTTG AGTTTAATGA AAATGGAACA
ATTGCAAGAA TTTTTGATAA AGAATTAAAT AAAGAAGTCT TAAGAGATGA AAAAGGAAAT
GAACTTAGAG TTTATGAAGA TAATGGAGAT GCTTGGGACT TCTCAGGAGT TTATGAAGAT
AGACCATCTG AAATTTTTGA GTTAGTAAAA TCAGAAGTTA TTGTAGATGG ACCAAAGGCA
ACTATTAAGC AAAATTATAA ATATGGAGAT TCTACTTTAG TACAAGAAAT AAGCATTTTA
GAAGGTAGTA AGAGAATCGA CTTTAAAACA AAGGTTGATT GGAAAGAAAA TGGTAAGATG
CTTAGAACAT CTTTCAATAC TAATGTTTAT ACTAGAGAGG CAGATTGTGA AATACAATTT
GGTACAATAA AGAGACCTAC TCATGGAAAT ACTTCATGGG ATATGGCTAA GGGTGAAATA
TGTGCTCAAA GATGGATAGA TTTATCACAA AGGGATTATG GAGTTGCTTT AATAAATGAT
TCTAAGTATG GACACAATGT CTCAGAAACT AAAATAGACT TAAATCTATT AAGAAGTCCA
GGATATCCAG ATTCAAATGC TGATAGAGGA GAGCATGAAT TTACTTATTG CTTATTCCCA
CACAGAGGAG ACTATATTGA AGGAAATGTT GTTCATGAAG CTCATGAAGT TAATGCTCCA
ATACAAGTAA TTTATAGTGA AAACTTAGGT GAAAACCTAG TTAAAGAAGC AATGGCTTCA
ATAGATTGTG AAAATGTAAT AATTGATACA ATTAAAAAAG CTGAAGATGG TGACCAAATG
ATAATCAGAT TATATGAGTG TCATGGAGAA GATGCTAAGG CTAAAGTTAA TATAAATGTT
CCATATGAAA ATATAGAAAT GGTTAACTTA ATAGAAGATA GCATAGATGG AAAGGAATTA
AACAAAGAAA CAATGCACCT TACATTTAAA CCATTTGAGG TTCATACTTT AAAAGTAACA
TTAAAGTAG
 
Protein sequence
MLFQINERKN IVNGLCEKLE KAIYNVVAPL EIDMYITKEP VSYENRLTGE HKKGIIGESW 
GELWDCGWFN FKGEVPKSYE GEKIVLLIDI SGEAFVVDKN GNPMRGLTTL NSEFDLSLGM
PGKREVPMFE RAEGGEVIDI WADCACNDLF GKYRDNGIIK DAYIATCNEE VKALYYDVEV
LHELMNQLPE DSARYNTILN ALYEASKVLS IKTGMSGCDF LKDGVVTQQE VTILNEEEVK
KAREILKKEL DKKGGDPSLS VSAIGHAHID LAWLWPIRET IRKGARTYST VLANMEKYPE
YVFGASQPQI YQWMKEYYPD LYGRIKERIE DGRWEAQGGM WVEPDTNVPS GESLVRQLLY
GKRYFREEFK KEMDTLWLPD VFGYSAALPQ ILKKSGVDYF MTIKLSWNNY NQFPHHTFVW
EGLDGSRVLS HMPPEGTYNS SAAPRAVAKS EKAFLDKGLS DECLMLFGIG DGGGGPGEEH
LERLRRERNI NGIAPVKQEP SSNFFKRIEK DIDKYSVWSG ELYLEKHQGT YTTHGKNKKY
NRKMEIALKN LELAAMQAKV LGKGEYPQDE IEKVWKEVLL YQFHDILPGS SIGRVYDESV
EAYEKMLAKV EGMTEKLYRD ILESADSKEE GAILVNSLSW SREQWIKLYD RWTKINLNSL
SGETIRKEEI EPYCEISTLK VSENNLENGL VKVEFNENGT IARIFDKELN KEVLRDEKGN
ELRVYEDNGD AWDFSGVYED RPSEIFELVK SEVIVDGPKA TIKQNYKYGD STLVQEISIL
EGSKRIDFKT KVDWKENGKM LRTSFNTNVY TREADCEIQF GTIKRPTHGN TSWDMAKGEI
CAQRWIDLSQ RDYGVALIND SKYGHNVSET KIDLNLLRSP GYPDSNADRG EHEFTYCLFP
HRGDYIEGNV VHEAHEVNAP IQVIYSENLG ENLVKEAMAS IDCENVIIDT IKKAEDGDQM
IIRLYECHGE DAKAKVNINV PYENIEMVNL IEDSIDGKEL NKETMHLTFK PFEVHTLKVT
LK