Gene CPR_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2331 
Symbol 
ID4206343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2552800 
End bp2554818 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content29% 
IMG OID642566881 
Productglycogen debranching protein 
Protein accessionYP_699596 
Protein GI110803375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID[TIGR01561] glycogen debranching enzyme, archaeal type, putative 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTG GAAAAAAAGA TTGGACTAGT TATGATAGAG GGATAGAAAA AGAGTGGATG 
CTTACTAATG GTATAAGTGG ATTTTGTGGA GGAACTGCTA TAGGAGCAAA CTGTAGAAAA
TATCAAGGGT TATTAGTAGC GTGTACTAAT TCTCCAGAAG AGAGATATAT GATTCTTTCA
AATTTAAATG AGGAAATAAA TATTGATGGA AAGCTTCACC AGTTGAGTTC TTGTAAATAT
ACAGATAAAA TAGTGGATGG ATTTAAAAAC CTTCAAGGAT TTTATTATGA TGGACTACCT
CATTTTAGAT TTTTTGTAGA TGGAGTAGTT ATAAATAAAA AGATTGCTAT GGAATATGGA
AAGAACACTG TAGTAATAGA ATATGACATA TTAAATGGAA GTAAAGAATC AAAAATAAAT
ATTAAACCAC TATTTACTTT TAGGAATCCA GGAATATGTA GTGGAGAAGA AAATTTGAAA
TTTTCAAAAA CTGTGGATAA AAATAAAATT CAACTTACTC CACAGTTAAA TAAGGACATA
AATATAAGAT TTTTTACCTA TGGTGGAAAA ATATTAGATA AAACTGATAA AATTGTTGAT
AAAATTTATT ATGATGTGGA TAAAAGTACA GGTGATAAGT TTGTGGATAA ATATTACATT
CCAGGAACTG TTGAAATAAA TTTAAATCCT AATGAAAGAA AAAAGGTTTA TTTACTTTGT
ACTATAGAAA ATGAAGAGAA TTTTGACTGT GAAAAGATAA TAGAAAATGA AGAGAAAAGA
ATAAATAATC TTAAAAATAC CTTCGCTGAT TCAAGAATTT TAGCTAAATA TCTTCCAATT
GCAGGGGATC AATTTATAGT AAATAGAGAG TCTATAAAGG GAAAAACAAT TTTAGCTGGC
TACCCTTGGT TTTTAGATTG GGGAAGAGAT GCTATGATTG CTATTAATGG ATTAACTTTA
TCAACTGGAA GATTAGAAGA TGCTAAAGAT ATTATAAGAA GTTTTAGCCT TTATGAGAAA
GATGGATTAA TACCAAATAT GTTCCCTGGA AAAGGACAAG AGCCACTTTA TAATACAGTA
GATGCTTCCT TATGGTTTAT AAATGCAGTT TATAATTATT TGCTATATGC TAATAGTTAC
GAAGCTTTAG AGTTTGTTGA GAAAGAAGTT TATAAAACTA TAAAAAATAT AATAAAGGCT
TATAAAGAAG GTACTAAGTT CTCAATAAAA ATGGATGAAG AGGATTATTT AATAAATGCT
GGTTCAGGAT TAGACCAAGT AACCTGGATG GATGTTAGAG TAAATGGCAT AGTGGTTACT
CCAAGACATG GTAAACCAGT TGAAATAAAT GCTTTATGGT ATAATGCCTT AAAAATAGCA
GCCTTATTAA AAAATAAGTT TGAAAAAGAA GAAAAGAATG AGTATGAAGA ATTAGCTAAA
AAGGTTAAAG ATTCATTTAC TAAGACTTTT TGGAATGAGG ATAGAAAGTA TCTTTACGAT
GTTGTAAATA ATCATGAGAA GGATGATAGT TTAAGACCTA ATCAGATTTG GGCAGTATCA
TTACCTTTTA CAATGTTAGA TAGAGAAAAA GAAAAGCATA TTGTTCAAAA GGTTTTTGAA
GAATTATACA CTCCTTATGG ATTAAGAAGT TTAAGTAGAA ACAATAAGGA CTATCATGGA
ATTTATATAG GAAAGCTTTT TGATAGAGAT ATGGCTTATC ATCAAGGAAC AACATGGGCA
TTTCCTCTTG GAGGATTTTT CACAGCTTAC TGTAAGGTAA ATGATTATTC TAAAGAAGCT
GTAGATTTAA TTGATTCTTT AATGAGAGAT ATGGAAGATC ATATTAAAGA TCAATGCTTA
GGAAGTATTG CTGAAATTTT CGATGGTGAT AATCCTTATA AAGCTAGAGG ATGCTATGCT
CAAGCTTGGA GTGTTGGAGA AATGCTTAGA GTTTATTATG AGGATATTCT AGGAAACTAT
AAAAAATTAA AAAAAGTCCA CAAAGATATA TTTATATAA
 
Protein sequence
MKFGKKDWTS YDRGIEKEWM LTNGISGFCG GTAIGANCRK YQGLLVACTN SPEERYMILS 
NLNEEINIDG KLHQLSSCKY TDKIVDGFKN LQGFYYDGLP HFRFFVDGVV INKKIAMEYG
KNTVVIEYDI LNGSKESKIN IKPLFTFRNP GICSGEENLK FSKTVDKNKI QLTPQLNKDI
NIRFFTYGGK ILDKTDKIVD KIYYDVDKST GDKFVDKYYI PGTVEINLNP NERKKVYLLC
TIENEENFDC EKIIENEEKR INNLKNTFAD SRILAKYLPI AGDQFIVNRE SIKGKTILAG
YPWFLDWGRD AMIAINGLTL STGRLEDAKD IIRSFSLYEK DGLIPNMFPG KGQEPLYNTV
DASLWFINAV YNYLLYANSY EALEFVEKEV YKTIKNIIKA YKEGTKFSIK MDEEDYLINA
GSGLDQVTWM DVRVNGIVVT PRHGKPVEIN ALWYNALKIA ALLKNKFEKE EKNEYEELAK
KVKDSFTKTF WNEDRKYLYD VVNNHEKDDS LRPNQIWAVS LPFTMLDREK EKHIVQKVFE
ELYTPYGLRS LSRNNKDYHG IYIGKLFDRD MAYHQGTTWA FPLGGFFTAY CKVNDYSKEA
VDLIDSLMRD MEDHIKDQCL GSIAEIFDGD NPYKARGCYA QAWSVGEMLR VYYEDILGNY
KKLKKVHKDI FI