Gene Cphy_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2304 
Symbol 
ID5745363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2836995 
End bp2840126 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content39% 
IMG OID641293394 
Productpullulanase, type I 
Protein accessionYP_001559404 
Protein GI160880436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02103] alpha-1,6-glucosidases, pullulanase-type
[TIGR02104] pullulanase, type I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAGAA TTATAGTTGG AATGCTAGTT GTTGCCCTTT TGCTTTCGAT GATAGGTGTT 
CAGACAAAGA ATGCAAAAGG AGCTACAGAT ACCATTACGA TTCATTATCA TAGGGATGAT
GGCGATTATG AAAAATGGAA TCTGTGGTTA TGGGCAGAAG GAAAAGATGG CGCGGCATAT
TATTTTGATG GAGAAGATGC ATTTGGGCCG TATGTTTCGG TTTCTCTGGA TAAGAGTGCA
GACAGGATAG GGTTTATTGT CCGTACGGAT TCTTGGGAAA AAGATGTTTC GGAGGACCGG
TTTATTGATA CATCACTTGG GGATGAAATC TGGATATCCA GTGGAGAGAG CACATTTTCT
TATGAAGCAC CAGAAGGGTA TGAAAAAGAG GTATCAATAG AATCTTTCCA GCTTAAGCTT
AATTACTTAA GGTATGATGA AGAGTATACG GATATTTCAT TTCGATTAAC CTTTGAGGAT
GGGACGACAG ATTTTCTTAC TAAAGAGCAT ATGCGTATTG AAAATGGTAT ATTAAAAGCA
GAAAAAGAAG TCAAATATGG TAAAAAGATA ACACTTGATG TATTAAAAAA TGGGTTAGAA
GAGGATTATC AAGGTGTTTC TTTTTCTACG GCCAAAATTG ATGAGGAAAG TAAGCTAGAG
ATGTATTGGA TGCAGGGAAC AGGAACTATT TCACCGAAGG CTGACTTTAT CAAGAGAAGT
AAGGAAATTG AATCGGCATT GATTACTTCC ATGAAGGAAA TAACAGTCAA GCTTTCCGTT
CCTTGCAGAG TAGATGATAT CAAGCAGGAT GGATTTAAAC TTTCGCCGAA ATTAGCTGTT
TCCAAGGTGG AAGCGACAAG TACAAGGGAC AGTGAATACA AAACAATCAA GGAAGGATAT
GCAGATACTT TTATTATCAC GATGGAGGAA CCTTTGGATA TGTCAAAGAA ATATGCATTA
TCAAAAACAG ACTATGGAAG CCGGAATTTA ACCCTTGATT CAGGGCTCTA TACCTCAGAA
GAATTTGAAG CCGCCTATAC TTATGAGGGT AATGATTTAG GTGCAACGTA TAGCAAGGAA
AAGACTGTAT TTAAGGTATG GTCACCTTCT GCGGAAAGTA TTTCTGTACT TTTTTATCCG
CATGGAGAAG CCAAAGACGG GGAGAAACCG GAGATAACTT ACCCTATGAA GCAAACTGGC
GCTGGTGTTT GGCAGGCAGA AATAGAAGGA GATTTAAAAA ATAAATATTA TGTATACCAA
GTTACAGTAG ATGGAAAAAC AAAGCTTGTA GTTGATCCCT ATGCAAAAGC GGCAGGTGTA
AACGGTGAGC GAGGTATGGT AATTGATTTA TCTGAAACGG ACCCAGATGG CTTTAGAGAA
CATAGTTCAC CAGAATTTAA AAATCCGGTG GATGCGGTGA TATATGAAAT TCACGTTAGA
GACCTTTCCA TGAATGAAAA CTCTGGAATT GAAAATAAGG GGAAGTTTCT TGGATTTACA
GAAACAGGGA CAACGAATAG TGCTGGTCTT TCTACTGGTC TTGACCATAT GAAGGAACTT
GGTGTCACCC ATGTCCACCT TCTTCCAAGC TTTGATTATA AAACCATTGA TGAGTCGAAA
CTGGGTGAAA ATAAGTTTAA CTGGGGATAT GACCCACAGA ACTATAATCT TCCGGAGGGT
TCTTATACTA CGGATCCTTA TCAAGGGGAA GTACGTGTCA GGGAATATAA GGAGATGGTT
CAGGCTCTGC ATGAAAATGG CCTACATGTA GTTATGGACG TGGTTTATAA TCATACTTAT
ACGGCAGGAG ATTCCAACTT TACATCCTTA GTGCCAGGAT ATTATTACAG AACTGACATA
AACGGAAATT TCACTAATGG CTCTGGTTGT GGTAATGAGA CAGCTTCAGA ACGTGCCATG
GTAAGAAAAT TTATTGTTGA TAGCGTAGTT TATTGGGCGA CAGAGTATAA AGTAGATGGC
TTCCGGTTTG ACTTAATGGG GTTACATGAC ATAGAAACAA TGAACATGGT AAGAGAAGCT
TTAGATAAAA TAGATCCATC TATTCTTCTG TACGGGGAAG GATGGACAGG AGGGTCTACT
CCTCTACCAG ATTCAAAGCA GGCAATTAAG AATAATGCAG TAGAACTCAA TGAAAGAATT
GCATGCTTTA GCGATGATAT ACGAGACGCC ATAAAGGGTA GTGTATTTGA TGCTTCTGAT
ACAGGATTTA TTAACAGTGG AAAACGCAAT GTCTCCAATA GGGATGAATC CATAAAATTC
GGTATCGTAG CATCTGTTTC TCATCCACAA GTGAATTTAA GCGGTGTGCC ATATTCCAGC
CGTTTTTGGG CAAATGAGCC GTCACAGACC ATTAATTATG CATCTGCACA TGACAACCTG
ACTTTATGGG ATAAGCTGTT AGAAACGAAT AAAATGGCGT CTAAAGAAGA GTTGGTACAG
ATGAACAAAT TGTCTGCGGC AATTGTACTA ACTTCCCAAG GAATTCCGTT TTTCCAAGCA
GGGGAAGAAA TGGCTAGGAC AAAGAAGGGA AATGATAATT CCTATCAGTC GCCAGACAGC
ATTAATATGT TGAATTGGGA CAATAAGACA GAATACAAGG ATTTATTTGA ATATTACAAA
GGATTAATCG CTCTTAGAAA AACTTACGAT GCATTCCGTA TGCAGACAGC AGAAGAAATA
CAACAAAAGT TAGAATTTGT CGATTCTGAC TCTTCCGTGA TTGCTTACCG AATTCATGAT
GCGGTAAAAG ATGGTAGAGA AATCGCATTA ATATTCAATG GAACATTAGA AGAAAAGGAA
GTAGTACTTT CTGCAAATGC ATGGGATGTA TTAGTAAACC AAGATACTGC TGGAACCGAT
GTCATAGAAA CCATAACAGG TGGAACAATT AAAGTGCCCG CAAAATCTAC ACTAGTTCTT
CTAGAGAATA AAGATGCAGT AATAAAAGGC GATAAAGATG CAGTAAAGGG AGACGAAATC
CAAGAGCTAC CTACGAATAT GCAGGAAGTA GCAGAAAAAG AGAGCGGGAA TGCATGGTTA
TGGGTCGGCA TAGCTACAGT CTGTGTTCTT GCAGGAGGAG TCCTATTCTG GATTTTAAAA
AGAAAACGCT AG
 
Protein sequence
MRRIIVGMLV VALLLSMIGV QTKNAKGATD TITIHYHRDD GDYEKWNLWL WAEGKDGAAY 
YFDGEDAFGP YVSVSLDKSA DRIGFIVRTD SWEKDVSEDR FIDTSLGDEI WISSGESTFS
YEAPEGYEKE VSIESFQLKL NYLRYDEEYT DISFRLTFED GTTDFLTKEH MRIENGILKA
EKEVKYGKKI TLDVLKNGLE EDYQGVSFST AKIDEESKLE MYWMQGTGTI SPKADFIKRS
KEIESALITS MKEITVKLSV PCRVDDIKQD GFKLSPKLAV SKVEATSTRD SEYKTIKEGY
ADTFIITMEE PLDMSKKYAL SKTDYGSRNL TLDSGLYTSE EFEAAYTYEG NDLGATYSKE
KTVFKVWSPS AESISVLFYP HGEAKDGEKP EITYPMKQTG AGVWQAEIEG DLKNKYYVYQ
VTVDGKTKLV VDPYAKAAGV NGERGMVIDL SETDPDGFRE HSSPEFKNPV DAVIYEIHVR
DLSMNENSGI ENKGKFLGFT ETGTTNSAGL STGLDHMKEL GVTHVHLLPS FDYKTIDESK
LGENKFNWGY DPQNYNLPEG SYTTDPYQGE VRVREYKEMV QALHENGLHV VMDVVYNHTY
TAGDSNFTSL VPGYYYRTDI NGNFTNGSGC GNETASERAM VRKFIVDSVV YWATEYKVDG
FRFDLMGLHD IETMNMVREA LDKIDPSILL YGEGWTGGST PLPDSKQAIK NNAVELNERI
ACFSDDIRDA IKGSVFDASD TGFINSGKRN VSNRDESIKF GIVASVSHPQ VNLSGVPYSS
RFWANEPSQT INYASAHDNL TLWDKLLETN KMASKEELVQ MNKLSAAIVL TSQGIPFFQA
GEEMARTKKG NDNSYQSPDS INMLNWDNKT EYKDLFEYYK GLIALRKTYD AFRMQTAEEI
QQKLEFVDSD SSVIAYRIHD AVKDGREIAL IFNGTLEEKE VVLSANAWDV LVNQDTAGTD
VIETITGGTI KVPAKSTLVL LENKDAVIKG DKDAVKGDEI QELPTNMQEV AEKESGNAWL
WVGIATVCVL AGGVLFWILK RKR