Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1432 |
Symbol | |
ID | 4617686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 1298927 |
End bp | 1300015 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639784516 |
Product | cellulase |
Protein accession | YP_930932 |
Protein GI | 119872925 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.26952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.0787337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAGT TTATATCGTT GTTGAAAAAA CTTTCGGAGG CGAGAGGCCC TTCGGGTTTT GAAGATGAAG TTCGAGAGCT CGTGATTAAA GAAATGGAGC CGTATGTAGA TGAAGTAACA GTAGATAGAT GGGGGAATGT AATAGGTGTC AAAAGGGGTT CTTCTAGCTA CCGCGCTATG GTGGCGGCAC ATATGGACGA GATTGGGCTT GTCGTTGACC ACATAGATAA GGAGGGCTTC CTAAGGTTTA GGCCTATCGG CGGTTGGAAC GAGGTAACTC TCCTCGGCCA GCGGGTTTGG GTTAAAACTC TAGACGGTAG GTGGGTTAGG GGGGTTATCG GCGTTATGCC GCCGCACGTC ACTCCGTCTG GTAAAGAGAG GGAGGCCCCC GAGATGAAAG ATCTCTACAT AGACGTGGGG GCTAGAAACA GAGAGGAGGT CGAAAAGATG GGACTTTCTG TAGGGTCTGT AGCAGTTTTA GACAGAGAGT TCGCCATTCT TAACGAAAGG GTTGTCACTG GAAAGGCTTT TGACGATAGA GTAGGCTTGG CTGTTATGCT CTATACTCTG AGACAACTCG GCGATCTCCC CGCGACTCTA TACGCCGTGG CGACAGTACA AGAAGAGGTA GGGCTTCGCG GTGCGCAAAT CGCCGCCGAG AGAATTAACC CTCATTACGC CATCGCCTTA GACACCACCA TAGCGGCCGA CGTGCCTGGC GTAGGCGAGA GACTACATGT GACAAAGGTG GGGGCGGGGC CCGCTATAAA GGTCATCGAC GGCGGACGCG GGGGTCTCTT CATAGCTCAC CCAGGTCTCA GAGACCACAT TGTGAGAATC GCCAGAGAGG CCGGCATCCC GCACCAGCTA GAGGTTCTAT ATGGCGGCAC TACAGACGCC ATGGCTATAG CCTTTAGGCG CGAGGGCGTG CCCGCCGCCG CTATCTCTAT CCCAACTCGC TATGTCCACT CGCCTGTAGA GCTAGTAGAT CTGTCAGACG CGTTGAACGC CTCTAAACTG TTAAAAAACG TTCTAGAGAA AACGCCGCCT GACATAATAG ACAAATTCCT AGATAGGAGA GTAAAGTGA
|
Protein sequence | MEEFISLLKK LSEARGPSGF EDEVRELVIK EMEPYVDEVT VDRWGNVIGV KRGSSSYRAM VAAHMDEIGL VVDHIDKEGF LRFRPIGGWN EVTLLGQRVW VKTLDGRWVR GVIGVMPPHV TPSGKEREAP EMKDLYIDVG ARNREEVEKM GLSVGSVAVL DREFAILNER VVTGKAFDDR VGLAVMLYTL RQLGDLPATL YAVATVQEEV GLRGAQIAAE RINPHYAIAL DTTIAADVPG VGERLHVTKV GAGPAIKVID GGRGGLFIAH PGLRDHIVRI AREAGIPHQL EVLYGGTTDA MAIAFRREGV PAAAISIPTR YVHSPVELVD LSDALNASKL LKNVLEKTPP DIIDKFLDRR VK
|
| |