Gene CPF_0815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0815 
Symbol 
ID4202623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp963334 
End bp966717 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content33% 
IMG OID638081699 
Productglycosy hydrolase family protein 
Protein accessionYP_695266 
Protein GI110799293 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4724] Endo-beta-N-acetylglucosaminidase D 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA GAGGCATTAT GAGTAAACGT TTGCTAGCGG TTTTGTGCAT GGCATCAGTA 
ACAACATCAT TAATTAGTTT TAGAGGATTA GAAGTTAAGG CTTCACCAAA TGTGCCATAT
GATCAAAAAG AATCATTAGC ACCATTAGGA GAAGTTTATA GCATTGAAAC TTTATTAAAT
TGGACACCGG AAAGTGATCC AGATGCTAAG TATAATAGAG CAAGCATTGA GCTTCAAGAT
AGGTTTATGG GAGACATTGT AAATGAAAAT GCTAATCCAG AAGCTAAAAT AATGAATGTT
GCTTTAACTA ATCCATATGT TGATAGAGCA CCATCACAAG GTAGTGATTC AATGGATGCT
TATGTGTTTT CATATTGGCA GTATGTAAAT TCATATGTTT ATTGGGGAGG ATCAAGTAGA
GGAATATTTG CACTTCCAAC ACCAGATGTA GTTGATAATG CACATAAAAA TGGTGTTCCA
GTTTTAGCTA CAATAGGATT TCCATGGGGG CCAGGAGAAG GGTATGTTGA ACAAGTAAGA
GCATTCCTTC AAAAAGATAA GAATGGAAAC TTCCCAGTTG CAGACAAAAT GATTGAAATA
GCTGAATATT ATGGATTTGA TGGATATTTC TTTAACCAAG AATCCTATGG CTGTGTAAAA
GATGATGCAG ATAGAATGGT AGAAATGCTT GAGTATATCA AAAAGAAAGC TCCAAATATG
GTTATAGGTT GGTATGATTC TATGACTGTT GATGGAAATG TTAAGTGGCA AGATGCTTTA
AATGATAAGA ATGCTGCTTT CTTCCAAAAT GGAGAAAATA GAACAACTGA CGAATTCTTT
TTAAATTATA ATTGGACTCC AGAAAAAATA GAGACTACAG TTAATACTGC TAAATCATTA
GGAAGAAGTC CTTTTGATGT TTATGCAGGA TTAGATGTTC AACAAAATGC ATATAATACT
CCATTTAATG ATGATTACTT ATTAGATGAG AATGGAAAGC TTAGACTTTC TTTAGCTATG
TATACTCCAA ACTCAACATT CTCAATGGCT AAGGATGTTT ATGATTTCTA TAAACATGAT
CAAAAATTCT GGGTAGGACC AACAGGAGAC CCATCAAAAT CAGATACAGA GCAAGATTGG
ACAGGATTAG CTAACTATGT TCCAGATAGA TCAGCAATAA ATGATTTACC ATTTGTGACA
AACTTTAATT TAGGACATGG GGAAGATTAT TATATAGATG GTTCACTTTC AAGGGATGAA
GAGTGGAACA ATAGAGCAAT GCAAGAATAT TTACCAACTT GGAGATGGAT AGTTGAATCA
AATGGTTCAA AACTTACTCC AGATTTTGAC TTTAAAACTG CATACAATGG AGGAAGTTCA
TTAAAGGTTG AAGGTAAGTT AGAGGCTGCA AATCCAAACC ACATTAAGTT ATACAGCACT
GATTTAAATA TAGAGAATAA CTCAACAGAA TTATCAATTG TTTATAAAAC AGAATCAGCT
CCAAACATGA AAGTAGGATT ATGTTTTGGA GAAAATTATG ATGAAGAAAA CTTTACATTC
TTTGATGTAA ATAAAAATAG TAATGGAGAA TGGACAGAGG TTAAAATACC TTTAGGTGAC
CATGTTGGAA AAACAATTTC TGCAATATCA TTAAAGTTTG AAAGTGGTCA AGATGTTGAA
AACTTTAAGA TAAATGTAGG ACAAATCTCC ATAGAAGAAA CTGCTGAAAA TAATAAAGGG
TTAAAAAATA GTGATGTTAT TTTAGAAGAA ACAATGGTTC ATAATTCAAA TAGTGCAGAG
GCTAGAATTT ACTGGAATGG ATTAGAATCA GGAAATGAAG ATGATTTAGC ATTCTATGAA
ATATATAGAA TTAAGCCAGA CGGATCTAGA GAGCTTATGG GAGCAACTCC AAATGATGCT
TACTATGTTC AAGAGTTTGA GAGATATGGA GAAGAAATGG AGTTTGACAT TGAAGTAGTT
CCTGTAGATG TAAATTATAA TAGAGGAGAA GGAAAGAGAG TAACATTTAA TTGGGGAATT
CCTCATGATG CTACACAAAT TCCAGATAAA AAGGTTTATG AAAACCTTGC CTTATATAAA
TCAGTTCAAA CTAGTGCAGA AGGTGCTGCA GAACCAGGTC TTAAAGCCGT TGATGGAAAA
GTAGATAATA ATAGTAAGTG GTGTGCCGCT GGAGCTAAGG ATGGTTGGTT AACAATTGAC
TTAGGAGAGC CAAAAAATAT TCAAAGATGG GTTGTTAAAC ATGCTGAAGC TGGTGGAGAA
GCAAAAGATA TGAATACAAA AGATTTTGCT TTAGAAGTTT CATATGATGG AGGACAAACT
TATCAAGAGG TTGATGTTGT AACTGGAAAT GAGGATGCTA TTACTGATAG AAATCTAGAA
AATCCAATAG TTGCACAACA TTTAAGACTT AGAATTGATA ATTCAGGAAG TTCACCTTGG
GCAGCTATAA GAATATACGA GTTCCAATTA TATGAGGAAA CATTTAAAGA TCAAACAACT
GAGATTCCAA TGAGATTTGT AAATGCTAAA AATAATAAAG GGGCTAATGA CTCTGTATTA
TTTACAAGAG GTAAAGAAGG ACAAACTGTT AATCTTTATA AGGGATTAAA TGCTGAAACT
CCTTTTGCTA CAGCTACTAT AGACTCTAAT GGAGAAGCTA AATTTGAAGG ATTAGATTTT
GGTGAAGATG GTGGAAGAGT TTACTATGGA ACAGTTGAAG AAGGTAAGAG AGAAAGTTTA
AGAATGAGTG CTGCTTTTGA AGGAGAAAAT TGGGAATATT CAAAGGTTCC AGAAAGATTT
GAATTAACTC CTTACAAGGC ACCAGGAATG AATGGAGAAA ATCATAATTA TGGTACTTTA
AAAATAAATG ATTTAGAAGC TGGAGATATT GTAAGCATAT TTAATGACAA GGATTCAATA
TTCCCAGAGA AGGTTAGTGT TCCAGTAGCC AAAGGGGGAG ATACAGCTAT TTTAGATAGA
ATATCTATTA ATCCAGAGGG TGGAGTACTA ACATTAGAAG TTAAATCAGA AGGCAAGAAG
GCTAGAAGAA TTGAAGTTCC ATATTCAGGT TTTGGAGAAA TAGAAATGAA AATAGAAAGT
GAAGCACCTA CTAACTTAAG AACAGTAGAT GTTCAAAAGA AGAGTGTAAG TCTTGCTTGG
GATTCACCAG AAAATACTTA TGGATTAGAT GGATATATAA TCTATAAGGA TAACAAGAAA
GTTGGAGAAG TTTCAGCAGA TCAAACTGAA TTTACAGTTG GAAAGTTAAA CAGACATACA
ATTTATAACT TTAAAGTTGC TGCTAAGTAT TCTAATGGAG AAATTTCAAA AAGAGATACA
ATAACAGTAA GAACAGCTAG ATAG
 
Protein sequence
MRKRGIMSKR LLAVLCMASV TTSLISFRGL EVKASPNVPY DQKESLAPLG EVYSIETLLN 
WTPESDPDAK YNRASIELQD RFMGDIVNEN ANPEAKIMNV ALTNPYVDRA PSQGSDSMDA
YVFSYWQYVN SYVYWGGSSR GIFALPTPDV VDNAHKNGVP VLATIGFPWG PGEGYVEQVR
AFLQKDKNGN FPVADKMIEI AEYYGFDGYF FNQESYGCVK DDADRMVEML EYIKKKAPNM
VIGWYDSMTV DGNVKWQDAL NDKNAAFFQN GENRTTDEFF LNYNWTPEKI ETTVNTAKSL
GRSPFDVYAG LDVQQNAYNT PFNDDYLLDE NGKLRLSLAM YTPNSTFSMA KDVYDFYKHD
QKFWVGPTGD PSKSDTEQDW TGLANYVPDR SAINDLPFVT NFNLGHGEDY YIDGSLSRDE
EWNNRAMQEY LPTWRWIVES NGSKLTPDFD FKTAYNGGSS LKVEGKLEAA NPNHIKLYST
DLNIENNSTE LSIVYKTESA PNMKVGLCFG ENYDEENFTF FDVNKNSNGE WTEVKIPLGD
HVGKTISAIS LKFESGQDVE NFKINVGQIS IEETAENNKG LKNSDVILEE TMVHNSNSAE
ARIYWNGLES GNEDDLAFYE IYRIKPDGSR ELMGATPNDA YYVQEFERYG EEMEFDIEVV
PVDVNYNRGE GKRVTFNWGI PHDATQIPDK KVYENLALYK SVQTSAEGAA EPGLKAVDGK
VDNNSKWCAA GAKDGWLTID LGEPKNIQRW VVKHAEAGGE AKDMNTKDFA LEVSYDGGQT
YQEVDVVTGN EDAITDRNLE NPIVAQHLRL RIDNSGSSPW AAIRIYEFQL YEETFKDQTT
EIPMRFVNAK NNKGANDSVL FTRGKEGQTV NLYKGLNAET PFATATIDSN GEAKFEGLDF
GEDGGRVYYG TVEEGKRESL RMSAAFEGEN WEYSKVPERF ELTPYKAPGM NGENHNYGTL
KINDLEAGDI VSIFNDKDSI FPEKVSVPVA KGGDTAILDR ISINPEGGVL TLEVKSEGKK
ARRIEVPYSG FGEIEMKIES EAPTNLRTVD VQKKSVSLAW DSPENTYGLD GYIIYKDNKK
VGEVSADQTE FTVGKLNRHT IYNFKVAAKY SNGEISKRDT ITVRTAR