Gene CPR_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0084 
Symbol 
ID4204178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp99856 
End bp101676 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content29% 
IMG OID642564633 
Productpullulanase 
Protein accessionYP_697423 
Protein GI110802434 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAG TTTATATTTA TCATGATTCT CAAGGTACCT TTTATAGGGA ACCTTTTGGA 
GCAGTTTCTG TAGGTTCTAA GGTAAGTTTA AGATTAGAAT GCAAAGAGTG TGGAGAAGTT
TTTATAGAGG TAATAAAGTT TGATGGAAGT AGATATTTAA TACCTATGAC AATTGAAGAG
AGAAGAAATG AATGCATAAT TTATAAAGGA ATAATAGACA CTACAAACTC CTTAGGAGTA
ATAAATTATT ACTTTAAATA CATTAAGGAT GGATTTACTA AGTATTATGG AAATAATGAT
GAGTGTTTAG GAGGAGAGGG AAAGATATAC TATGATTTTC CTAATTATTA TCAAATAACA
GTTTATGAGG ATAATAAAAT TCCTAGTTGG TATAAGGAAG GTATTATTTA TCAGATATTT
GTGGATAGAT TTTTTAATGG AAATAAGGAT AGTATGATAT TAAATAAAAA GAAAAATAGC
TTTATATATG GTAATTGGTA TGATGAGCCA ATGTATATAA GAGATAGTAA TGGAAATATT
AAACGGTGGG ATTTTTATGG AGGGAATTTA AGGGGAGTAA TTGAAAAATT AGATTATATA
AAATCTTTAG GGGTAAATAT TATATACATG AATCCAATCT TTGATGCTGT GAGTTGTCAT
AAATATGATA CTGGAGATTA TGAAAATATT GATAAGATGT ACGGAACTAA CAGTGATTTT
AATGAATTGT GCCAAAAAGC TGAGGAAAAA GGTATAAGGA TAATATTAGA TGGAGTTTTT
AGTCATACAG GATCAGATAG TAGGTACTTT AATAAATACG GAAACTATGG AGAGCTGGGA
GCCTATGAAT CTAAATACTC TAAATATTAT AATTGGTATA GGTTTTATGA TTATCCTAAT
AGTTATGAAT GTTGGTGGGG TTTTGAAAAC CAGCCTAATG TAGAGGAATT AGAAAAGACA
TATTCAGATT ATATAGTTAA TAGTGAAAAT TCAATAATAG CGAAGTGGCT TAGATTAGGA
GCAAGCGGAT GGAGGTTAGA TGTAGCAGAT GAACTTCCAG ATGAATTCAT ACAAATGATT
AAGGAAAGGA TGAAAAATGA GAAAGAAGAT AGTGTGCTTA TAGGAGAGGT TTGGGAAGAC
GCTTCAAATA AGGTTAGCTA TTCAAAAAGA AGAAAGTATT TATTAGGAAA TGAATTAGAT
TCTGTAACAA ATTACCCTTA TAGAGATATA ATTTCTAATT TTTTAAATGA AGAAATAAGT
TCAAAGGATT TTTATAAAGT AATAATGAGC ATAAAAGAAA ATTATCCAAG AGAAAATTTT
TTTGCAAACA TGAATATTCT AGGTAACCAT GACACAGAAA GAATACTTAC AGTATTAAAA
GAGAATTTAA ATAAGTTAAA ATTAGCCCTA TGTCTTCAAA TGACTTTACC TGGAGTTCCC
TTAATTTATT ATGGTGATGA GGCAGGACTT TTAGGAAATA AGGATCCTGA AAATAGAAAG
ACCTATCCTT GGGGACGAGA AAATAAGGAA ATATTAAGTT ATTATAGTTT TTTCGGAAAC
TTTAGAAAGA ATGAAGAGGT TTTAAGAAAG GGAGATTTTT ATATTTTTAA GGATACACCT
GAGGATATCA TTGCTTTTAA GAGAGTTTAT AAAGATAAAG AAATGATAAT TATAGTAAAC
AGGAGTAATT CTAGAAAAAC CATAACCTTA GATAGCGAGA AAAGAAGATA TAAAGATAAA
TTTTCTAAGG AAGAATTTTA TGGTGATGGT AGCATTACTT TAGAGGTAGA AAGAGAAAAT
TATAAAATTT TAACTAATTA G
 
Protein sequence
MGKVYIYHDS QGTFYREPFG AVSVGSKVSL RLECKECGEV FIEVIKFDGS RYLIPMTIEE 
RRNECIIYKG IIDTTNSLGV INYYFKYIKD GFTKYYGNND ECLGGEGKIY YDFPNYYQIT
VYEDNKIPSW YKEGIIYQIF VDRFFNGNKD SMILNKKKNS FIYGNWYDEP MYIRDSNGNI
KRWDFYGGNL RGVIEKLDYI KSLGVNIIYM NPIFDAVSCH KYDTGDYENI DKMYGTNSDF
NELCQKAEEK GIRIILDGVF SHTGSDSRYF NKYGNYGELG AYESKYSKYY NWYRFYDYPN
SYECWWGFEN QPNVEELEKT YSDYIVNSEN SIIAKWLRLG ASGWRLDVAD ELPDEFIQMI
KERMKNEKED SVLIGEVWED ASNKVSYSKR RKYLLGNELD SVTNYPYRDI ISNFLNEEIS
SKDFYKVIMS IKENYPRENF FANMNILGNH DTERILTVLK ENLNKLKLAL CLQMTLPGVP
LIYYGDEAGL LGNKDPENRK TYPWGRENKE ILSYYSFFGN FRKNEEVLRK GDFYIFKDTP
EDIIAFKRVY KDKEMIIIVN RSNSRKTITL DSEKRRYKDK FSKEEFYGDG SITLEVEREN
YKILTN