Gene CPF_0826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0826 
Symbol 
ID4202082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp977025 
End bp979439 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content29% 
IMG OID638081710 
Productglycosy hydrolase family protein 
Protein accessionYP_695277 
Protein GI110799258 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.241156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA TAATTCATAT CAATGATCAA TGGTTTTATG CAAATGATTA CAAAGATGAA 
TATTTAAAAA ATGAGTTTGA TTTTAGTAAT TTTGAAAGGG TAGATTTACC TCATACAAAT
ATAGAGTTAC CATATAATTA TTTTGATGAA AAATCATATC AGTTTATTTC TACTTATGTT
AAAACTTTAA AGTTTGATAA TAGCATTAAA GGCAAAAAAG TATTTTTAGA TTTTGAAGGA
GTTATGATAG CTGCAGAGGT ATATTTAAAT GGAATTCATG TTGGTGGACA TAAGGGTGGT
TACACTAATT TTTCAATAGA TATTACTGAG GCTTTAAAAA TTAATGAAGA TAATGTATTA
AAGGTTGTTG TTGACTCCAC AGAAAGACCT GATATACCTC CTCATGGATA TGTTGTTGAT
TATTTAACCT ATGGAGGAAT ATATAGAGAG GTTTCCTTAA GAATAGTTGA ACCTATATTT
ATAAATAATT TATATGCAAG GGCATATGAT TGCTTAAAGG AAGAAAAAAG ATTAGAACTT
GATATAGAAA TAAATAATTT TGAAAAATAT AGAGATGATT TAGAAATTGT TGTAGAGTTT
GGTGATGATA CTTTTGAAGA AACTTTAAGT ACAAAGTTAC CAATTGAAGA AGGAACAAGT
ATTAAAAATA TAGAAATAGA CCAATTAAAT ATGGTTAAGT TATGGGATAT AGAAAATCCT
AATCTTTATG AAATAAAAGT TAAGTTATTA AAAGATTCAG AAGTTATAGA TGAATATAAA
GATACCTTTG GATTTAGAGA AGCTGAATTT AGACAAGATG GTTTCTATTT AAATGGAAGA
AGAGTAAAGC TTGTTGGATT AAATCGTCAT CAGGCTTATC CATATGTAGG ATATGCTATG
CCTCAAAGGG TTCAAGAAAA GGATGCTGAG ATTTTAAAAT ATGAATTAGG ACTTAATATA
GTTAGAACAT CTCACTATCC ACAATCAGTA CATTTCTTAA GAAAATGTGA TGAGATTGGA
TTATTAGTTT TTGAAGAGAT ACCTGGTTGG CAACATATAG GTGATGAAGC TTGGCAGGCA
GAATCTATTA AAAATGTAGA GGAAATGATA AAAAGAGATT ACAATAGACC TTCCATAGTT
TTATGGGGCG TTAGAATAAA TGAGTCTCAA GATAGTCATG ATTTTTATGT GAAAACTAAT
GCTATGGCAA AGAGTTTAGA TCCTATTAGA CAAACTGGTG GGGTTAGATA CTTAGAAAAT
AGTGATTTCC TAGAAGATGT TTATACCATG AATGATTTTA TACACAGTGG GGGAGAAAAA
GTATTAAGAA CTCAAAGTGA AGTAACAGGA CAAGCTGATA AAGTTCCTTA TTTAGTAACT
GAGTATAATG GGCATATGTA TCCAACAAAA AGCTTTGATC AAGAGTGTAA AAAAGTTGAA
CATGCTTATA GACATTTGAG AGTTATTAAT GAATCCTTTG GCTTAGATGA AATAAGTGGA
GCCATAGGAT GGTGTGCTTT TGATTATAAT ACACATAGTT CCTTTGGTTC AGGAGATAAA
ATTTGTTACC ATGGAGTTTC TGATATGTTC AGAAATCCTA AGTATGCAGC TTATTCATAT
GCTAGCCAAA AGAAAGTAGA AGATGGTGTT GTTTTAGAGC CTATTACTTT AGGGGCTAAG
GGAGAAAGAG ATGGAGGAGC AATACTTCCA TTTACAGTGC TTACAAACTG TGATTATATT
AAAATATTTA AAGATGGAAT ATATATAGAT ACTTATTATC CTAACAAAGA AAAGTTCCCT
AATTTACCAC ATCCACCAAT AGAGGTTTCA CATATTTTAT CTATGGATTC AGAAATACCT
CTTACTGAAG AAGCAAAAAA AGAAATTAAA GACTTTGTAT TAAATAAATT AAAAGATTCT
AATTTAACTA ATTTAGCTGA AGAAGATTTT AAATATATTG AAGAATTTAG TGAAAGAGTA
AATATACCTG TATTTAAAAT AATGTCTTTA GTTTATAAAT TAGCTGGAGG TTGGGGAGAT
AAGGAAAACT CTTTAATAAT AAAAGGCTTT ATAGATAATA AAGAGGTTGC TTCAAAAGAA
ATAGGTGAGC TTAGAAGCAT GAATAAGTTA GAAGTTACAC CAGATGATTT AGAACTTTCA
TTAGATAAAA CAAGTTATGA TGCTACTAGA ATTGTGGTTA AACTTTTAGA TAACTTAGGA
GAGGTTCTTT TCTTAAATAA TGATTTTATT GAAGTAGAAA TAGATGGACC TTTAAGTATA
ATGGGACCAA GTAAGTTTGG AATCTCTGGT GGAATAACAG CTTTCTGGGT AAGAACTCAA
GGGCAAACTG GACTTTGCAA AATAAAGGTT AAGAGCATGT ACTTTGAAGA AGAAATTTCT
ATAGAAGTTA AGTAG
 
Protein sequence
MRKIIHINDQ WFYANDYKDE YLKNEFDFSN FERVDLPHTN IELPYNYFDE KSYQFISTYV 
KTLKFDNSIK GKKVFLDFEG VMIAAEVYLN GIHVGGHKGG YTNFSIDITE ALKINEDNVL
KVVVDSTERP DIPPHGYVVD YLTYGGIYRE VSLRIVEPIF INNLYARAYD CLKEEKRLEL
DIEINNFEKY RDDLEIVVEF GDDTFEETLS TKLPIEEGTS IKNIEIDQLN MVKLWDIENP
NLYEIKVKLL KDSEVIDEYK DTFGFREAEF RQDGFYLNGR RVKLVGLNRH QAYPYVGYAM
PQRVQEKDAE ILKYELGLNI VRTSHYPQSV HFLRKCDEIG LLVFEEIPGW QHIGDEAWQA
ESIKNVEEMI KRDYNRPSIV LWGVRINESQ DSHDFYVKTN AMAKSLDPIR QTGGVRYLEN
SDFLEDVYTM NDFIHSGGEK VLRTQSEVTG QADKVPYLVT EYNGHMYPTK SFDQECKKVE
HAYRHLRVIN ESFGLDEISG AIGWCAFDYN THSSFGSGDK ICYHGVSDMF RNPKYAAYSY
ASQKKVEDGV VLEPITLGAK GERDGGAILP FTVLTNCDYI KIFKDGIYID TYYPNKEKFP
NLPHPPIEVS HILSMDSEIP LTEEAKKEIK DFVLNKLKDS NLTNLAEEDF KYIEEFSERV
NIPVFKIMSL VYKLAGGWGD KENSLIIKGF IDNKEVASKE IGELRSMNKL EVTPDDLELS
LDKTSYDATR IVVKLLDNLG EVLFLNNDFI EVEIDGPLSI MGPSKFGISG GITAFWVRTQ
GQTGLCKIKV KSMYFEEEIS IEVK