Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2304 |
Symbol | |
ID | 5745363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2836995 |
End bp | 2840126 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641293394 |
Product | pullulanase, type I |
Protein accession | YP_001559404 |
Protein GI | 160880436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02103] alpha-1,6-glucosidases, pullulanase-type [TIGR02104] pullulanase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAGAA TTATAGTTGG AATGCTAGTT GTTGCCCTTT TGCTTTCGAT GATAGGTGTT CAGACAAAGA ATGCAAAAGG AGCTACAGAT ACCATTACGA TTCATTATCA TAGGGATGAT GGCGATTATG AAAAATGGAA TCTGTGGTTA TGGGCAGAAG GAAAAGATGG CGCGGCATAT TATTTTGATG GAGAAGATGC ATTTGGGCCG TATGTTTCGG TTTCTCTGGA TAAGAGTGCA GACAGGATAG GGTTTATTGT CCGTACGGAT TCTTGGGAAA AAGATGTTTC GGAGGACCGG TTTATTGATA CATCACTTGG GGATGAAATC TGGATATCCA GTGGAGAGAG CACATTTTCT TATGAAGCAC CAGAAGGGTA TGAAAAAGAG GTATCAATAG AATCTTTCCA GCTTAAGCTT AATTACTTAA GGTATGATGA AGAGTATACG GATATTTCAT TTCGATTAAC CTTTGAGGAT GGGACGACAG ATTTTCTTAC TAAAGAGCAT ATGCGTATTG AAAATGGTAT ATTAAAAGCA GAAAAAGAAG TCAAATATGG TAAAAAGATA ACACTTGATG TATTAAAAAA TGGGTTAGAA GAGGATTATC AAGGTGTTTC TTTTTCTACG GCCAAAATTG ATGAGGAAAG TAAGCTAGAG ATGTATTGGA TGCAGGGAAC AGGAACTATT TCACCGAAGG CTGACTTTAT CAAGAGAAGT AAGGAAATTG AATCGGCATT GATTACTTCC ATGAAGGAAA TAACAGTCAA GCTTTCCGTT CCTTGCAGAG TAGATGATAT CAAGCAGGAT GGATTTAAAC TTTCGCCGAA ATTAGCTGTT TCCAAGGTGG AAGCGACAAG TACAAGGGAC AGTGAATACA AAACAATCAA GGAAGGATAT GCAGATACTT TTATTATCAC GATGGAGGAA CCTTTGGATA TGTCAAAGAA ATATGCATTA TCAAAAACAG ACTATGGAAG CCGGAATTTA ACCCTTGATT CAGGGCTCTA TACCTCAGAA GAATTTGAAG CCGCCTATAC TTATGAGGGT AATGATTTAG GTGCAACGTA TAGCAAGGAA AAGACTGTAT TTAAGGTATG GTCACCTTCT GCGGAAAGTA TTTCTGTACT TTTTTATCCG CATGGAGAAG CCAAAGACGG GGAGAAACCG GAGATAACTT ACCCTATGAA GCAAACTGGC GCTGGTGTTT GGCAGGCAGA AATAGAAGGA GATTTAAAAA ATAAATATTA TGTATACCAA GTTACAGTAG ATGGAAAAAC AAAGCTTGTA GTTGATCCCT ATGCAAAAGC GGCAGGTGTA AACGGTGAGC GAGGTATGGT AATTGATTTA TCTGAAACGG ACCCAGATGG CTTTAGAGAA CATAGTTCAC CAGAATTTAA AAATCCGGTG GATGCGGTGA TATATGAAAT TCACGTTAGA GACCTTTCCA TGAATGAAAA CTCTGGAATT GAAAATAAGG GGAAGTTTCT TGGATTTACA GAAACAGGGA CAACGAATAG TGCTGGTCTT TCTACTGGTC TTGACCATAT GAAGGAACTT GGTGTCACCC ATGTCCACCT TCTTCCAAGC TTTGATTATA AAACCATTGA TGAGTCGAAA CTGGGTGAAA ATAAGTTTAA CTGGGGATAT GACCCACAGA ACTATAATCT TCCGGAGGGT TCTTATACTA CGGATCCTTA TCAAGGGGAA GTACGTGTCA GGGAATATAA GGAGATGGTT CAGGCTCTGC ATGAAAATGG CCTACATGTA GTTATGGACG TGGTTTATAA TCATACTTAT ACGGCAGGAG ATTCCAACTT TACATCCTTA GTGCCAGGAT ATTATTACAG AACTGACATA AACGGAAATT TCACTAATGG CTCTGGTTGT GGTAATGAGA CAGCTTCAGA ACGTGCCATG GTAAGAAAAT TTATTGTTGA TAGCGTAGTT TATTGGGCGA CAGAGTATAA AGTAGATGGC TTCCGGTTTG ACTTAATGGG GTTACATGAC ATAGAAACAA TGAACATGGT AAGAGAAGCT TTAGATAAAA TAGATCCATC TATTCTTCTG TACGGGGAAG GATGGACAGG AGGGTCTACT CCTCTACCAG ATTCAAAGCA GGCAATTAAG AATAATGCAG TAGAACTCAA TGAAAGAATT GCATGCTTTA GCGATGATAT ACGAGACGCC ATAAAGGGTA GTGTATTTGA TGCTTCTGAT ACAGGATTTA TTAACAGTGG AAAACGCAAT GTCTCCAATA GGGATGAATC CATAAAATTC GGTATCGTAG CATCTGTTTC TCATCCACAA GTGAATTTAA GCGGTGTGCC ATATTCCAGC CGTTTTTGGG CAAATGAGCC GTCACAGACC ATTAATTATG CATCTGCACA TGACAACCTG ACTTTATGGG ATAAGCTGTT AGAAACGAAT AAAATGGCGT CTAAAGAAGA GTTGGTACAG ATGAACAAAT TGTCTGCGGC AATTGTACTA ACTTCCCAAG GAATTCCGTT TTTCCAAGCA GGGGAAGAAA TGGCTAGGAC AAAGAAGGGA AATGATAATT CCTATCAGTC GCCAGACAGC ATTAATATGT TGAATTGGGA CAATAAGACA GAATACAAGG ATTTATTTGA ATATTACAAA GGATTAATCG CTCTTAGAAA AACTTACGAT GCATTCCGTA TGCAGACAGC AGAAGAAATA CAACAAAAGT TAGAATTTGT CGATTCTGAC TCTTCCGTGA TTGCTTACCG AATTCATGAT GCGGTAAAAG ATGGTAGAGA AATCGCATTA ATATTCAATG GAACATTAGA AGAAAAGGAA GTAGTACTTT CTGCAAATGC ATGGGATGTA TTAGTAAACC AAGATACTGC TGGAACCGAT GTCATAGAAA CCATAACAGG TGGAACAATT AAAGTGCCCG CAAAATCTAC ACTAGTTCTT CTAGAGAATA AAGATGCAGT AATAAAAGGC GATAAAGATG CAGTAAAGGG AGACGAAATC CAAGAGCTAC CTACGAATAT GCAGGAAGTA GCAGAAAAAG AGAGCGGGAA TGCATGGTTA TGGGTCGGCA TAGCTACAGT CTGTGTTCTT GCAGGAGGAG TCCTATTCTG GATTTTAAAA AGAAAACGCT AG
|
Protein sequence | MRRIIVGMLV VALLLSMIGV QTKNAKGATD TITIHYHRDD GDYEKWNLWL WAEGKDGAAY YFDGEDAFGP YVSVSLDKSA DRIGFIVRTD SWEKDVSEDR FIDTSLGDEI WISSGESTFS YEAPEGYEKE VSIESFQLKL NYLRYDEEYT DISFRLTFED GTTDFLTKEH MRIENGILKA EKEVKYGKKI TLDVLKNGLE EDYQGVSFST AKIDEESKLE MYWMQGTGTI SPKADFIKRS KEIESALITS MKEITVKLSV PCRVDDIKQD GFKLSPKLAV SKVEATSTRD SEYKTIKEGY ADTFIITMEE PLDMSKKYAL SKTDYGSRNL TLDSGLYTSE EFEAAYTYEG NDLGATYSKE KTVFKVWSPS AESISVLFYP HGEAKDGEKP EITYPMKQTG AGVWQAEIEG DLKNKYYVYQ VTVDGKTKLV VDPYAKAAGV NGERGMVIDL SETDPDGFRE HSSPEFKNPV DAVIYEIHVR DLSMNENSGI ENKGKFLGFT ETGTTNSAGL STGLDHMKEL GVTHVHLLPS FDYKTIDESK LGENKFNWGY DPQNYNLPEG SYTTDPYQGE VRVREYKEMV QALHENGLHV VMDVVYNHTY TAGDSNFTSL VPGYYYRTDI NGNFTNGSGC GNETASERAM VRKFIVDSVV YWATEYKVDG FRFDLMGLHD IETMNMVREA LDKIDPSILL YGEGWTGGST PLPDSKQAIK NNAVELNERI ACFSDDIRDA IKGSVFDASD TGFINSGKRN VSNRDESIKF GIVASVSHPQ VNLSGVPYSS RFWANEPSQT INYASAHDNL TLWDKLLETN KMASKEELVQ MNKLSAAIVL TSQGIPFFQA GEEMARTKKG NDNSYQSPDS INMLNWDNKT EYKDLFEYYK GLIALRKTYD AFRMQTAEEI QQKLEFVDSD SSVIAYRIHD AVKDGREIAL IFNGTLEEKE VVLSANAWDV LVNQDTAGTD VIETITGGTI KVPAKSTLVL LENKDAVIKG DKDAVKGDEI QELPTNMQEV AEKESGNAWL WVGIATVCVL AGGVLFWILK RKR
|
| |