Gene Pisl_1432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1432 
Symbol 
ID4617686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1298927 
End bp1300015 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content52% 
IMG OID639784516 
Productcellulase 
Protein accessionYP_930932 
Protein GI119872925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.26952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0787337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGT TTATATCGTT GTTGAAAAAA CTTTCGGAGG CGAGAGGCCC TTCGGGTTTT 
GAAGATGAAG TTCGAGAGCT CGTGATTAAA GAAATGGAGC CGTATGTAGA TGAAGTAACA
GTAGATAGAT GGGGGAATGT AATAGGTGTC AAAAGGGGTT CTTCTAGCTA CCGCGCTATG
GTGGCGGCAC ATATGGACGA GATTGGGCTT GTCGTTGACC ACATAGATAA GGAGGGCTTC
CTAAGGTTTA GGCCTATCGG CGGTTGGAAC GAGGTAACTC TCCTCGGCCA GCGGGTTTGG
GTTAAAACTC TAGACGGTAG GTGGGTTAGG GGGGTTATCG GCGTTATGCC GCCGCACGTC
ACTCCGTCTG GTAAAGAGAG GGAGGCCCCC GAGATGAAAG ATCTCTACAT AGACGTGGGG
GCTAGAAACA GAGAGGAGGT CGAAAAGATG GGACTTTCTG TAGGGTCTGT AGCAGTTTTA
GACAGAGAGT TCGCCATTCT TAACGAAAGG GTTGTCACTG GAAAGGCTTT TGACGATAGA
GTAGGCTTGG CTGTTATGCT CTATACTCTG AGACAACTCG GCGATCTCCC CGCGACTCTA
TACGCCGTGG CGACAGTACA AGAAGAGGTA GGGCTTCGCG GTGCGCAAAT CGCCGCCGAG
AGAATTAACC CTCATTACGC CATCGCCTTA GACACCACCA TAGCGGCCGA CGTGCCTGGC
GTAGGCGAGA GACTACATGT GACAAAGGTG GGGGCGGGGC CCGCTATAAA GGTCATCGAC
GGCGGACGCG GGGGTCTCTT CATAGCTCAC CCAGGTCTCA GAGACCACAT TGTGAGAATC
GCCAGAGAGG CCGGCATCCC GCACCAGCTA GAGGTTCTAT ATGGCGGCAC TACAGACGCC
ATGGCTATAG CCTTTAGGCG CGAGGGCGTG CCCGCCGCCG CTATCTCTAT CCCAACTCGC
TATGTCCACT CGCCTGTAGA GCTAGTAGAT CTGTCAGACG CGTTGAACGC CTCTAAACTG
TTAAAAAACG TTCTAGAGAA AACGCCGCCT GACATAATAG ACAAATTCCT AGATAGGAGA
GTAAAGTGA
 
Protein sequence
MEEFISLLKK LSEARGPSGF EDEVRELVIK EMEPYVDEVT VDRWGNVIGV KRGSSSYRAM 
VAAHMDEIGL VVDHIDKEGF LRFRPIGGWN EVTLLGQRVW VKTLDGRWVR GVIGVMPPHV
TPSGKEREAP EMKDLYIDVG ARNREEVEKM GLSVGSVAVL DREFAILNER VVTGKAFDDR
VGLAVMLYTL RQLGDLPATL YAVATVQEEV GLRGAQIAAE RINPHYAIAL DTTIAADVPG
VGERLHVTKV GAGPAIKVID GGRGGLFIAH PGLRDHIVRI AREAGIPHQL EVLYGGTTDA
MAIAFRREGV PAAAISIPTR YVHSPVELVD LSDALNASKL LKNVLEKTPP DIIDKFLDRR
VK