Gene Pisl_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0985 
Symbol 
ID4618334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp882100 
End bp885084 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content44% 
IMG OID639784083 
Productglycoside hydrolase family protein 
Protein accessionYP_930503 
Protein GI119872496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.369995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.376092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACT ACATGAGGCG GATAATTTTG TTTCTAGTCT TAGCTTTATT TCTGTTAGCC 
CAGCCAATGA ATATAATATT TGTATTTCAT AACCACCAGC CTTGGTATAT AGACCTAGAA
AAAGGCGAGT TACTACTGCC ATGGGTGAGA ATACATTCTG TTGGAAATTA TCTAAAAGTG
CCCCTTCTTG TAAACCAAAG CGGCGTTTCC GTTGCATATA CTCTTTCTGG GAGTTTAATT
GAACAGATAA ATTGGTACGC TAACAGAACT TACGTCGACG CTAGATATAA GATATCGCAA
AAGATCGCAG AGGGGAAGCC TTTGACTCTG GAGGAGAAAT ACTCCATGTT GTTAGTGCCC
GGGGGATTCT TTGATATAAA TTGGCAAAAT ATTGTGTATA AACACCCAAG ATACACTGTA
TTGTTAGGGA TAAGAAATGA TGCATTTAGT AAATGTCCAC CTGGAAATAT AACATGCATC
GTATCTAGGT TCAGCGAGCA AGACTTTATT GACCTAGCCA CGCTTTTTAA CCTTCTCTGG
ATAGACCCAT ACATAGCACG ACAATACCCA GACGTTTGGG CTATGAGAAA TAAAACCTCC
TTTACGCGCA ACGACCTTAA GAGAGTCTTA GAGGTACACA TGGACCTAAT CTCAAAAGTT
TTGCCTTTAT ATAGAGCTTT GGCACAACAG AAGAGGGTAG AGCTCGTGCT AGTGCCATAC
TCACATCCTC TAATGCCTCT TTTGGCAGAC ATGGGAGCTT TAGAAGACTT AAAAGTTCAC
ATAAGACTCT CGCAGAATCT CTTTGAAAGA TATTTAGGAG TTTCGCCCAC GGGTGTTTGG
CCTCCTGAAC AAGCAGTTAA CGATGATGTG CTCAGACTTT TTACAGAATC CGGCTTTTTA
TGGACTATAA CAGACGAGGA CGTATTGAAG GCAACCATGC CCGGAGCTAG CCACTTCGGT
CTTTACTATG TAGATTATGG AGGACGACGT ATATACGTTT TCTTTAGAGA TAAAACACTG
TCTGACGGCC TAGGTTTTAG ATACGCCTCT ATGAAGCCAG AAGAGGCCTT GGCCGACTTT
ATGAATTATC TAAAGAGGGT GCCACGAGAC GAGTGTTCAG TAGTCGTAGT TGCATTAGAT
GGCGAAAATC CGTGGGAAAA CTATCCAAAT TTTGGAGATG ATTTTCTTAT TAAATTCTTT
GGAGGACTCG CACAGTTGGA GAAAAACGGC ACCATCAAGC TATGGAAACC TACAGACTTT
GTAAAGAGAT GTAGCGAAAA AGCTACGCCA TTACCACAAC GTGAATTTGA ATATTTCAAC
CTAAAAGTAG ATATATCCGT ATATACCTCT ATACGTGATC TACCAACCCG TATTGTCCAG
GGTAGAATTG CAGAAGGCTC GTGGTCTAGT GGGGGTAGTT TAGCCATTTG GATTGGCGAT
GTAGACGAAA ATGTTTGGTG GATGTGGCTA AAGAAGGCTA GAGAAGATGT AGGTCTAAAT
TTGAAGTGGG ATGTACTTTT CCCATTGTTA GTAGCTGAAG CCAGTGACTG GCCGTTTTGG
TACGGCGGCG AAATGGGTTC ACCACAGACC TTTGACCCTG TTGCAAAATC TGCACTGATA
GCATTTTACC GACGGGCTGG TTTACAGCCG CCTATGTATC TTTTCTCCTT GGCGTATCCC
GGCGGGACTC CCCGCGAAAT AGTTGGAAGG GGCGATGGAA AAGTAGCTCT CTACGAAGGG
CTTACGGTTT ATGTAAATAC GACACATATA TGGATAGAGG GCGCGGGGTG TGGCGTAGTC
TATTTCTCAA ATCCAACACT TCCGAGGTCG CCATATTTCT TCAGAGGCGC TGTATATGGA
ATACATGGCG AGAAATTACA CATATATGCC GATATGGCTA TCGATAGCTG TAATAACACA
GTATATCTCT CAGACGGCGG TAAGTTCTAT CCAGTTGGGA AAGCGGCTAG ATCGTACTTT
ATAGGCGCGC AACCTGGCGA CAAACTCTAC GTAGAGTTTA ATGGCCTTGT ATATGTGCTC
ACAATACCTG AAAGCCCAGT ACAACAAAAA TTGTTAATGG AAGTAGCTGA TCCGCCTGGA
GACGACTTTG GCACTGGTAA ATACCGCTAT CCTAAGAACC CCGTCTTCAA GCCCGGGGTT
TTTGACCTAT TAGGATTTAC GCTATACGAC CTGGGCGATA GGCTAAGGTT CATGTTTAGA
GTGAGAGAAT TTGGTGGAAA CCCATGGAGC GGGCCTGCAG GATTTTCGTT ACAGTTTTTC
CACGTCTATA TTAATAGAGG ACGTGGAGAG AGAAATGACA CACTTGGTCT AGGAGTGACT
CTTTGTAAAG AGGCGATGTG GGATGTGGCC TTATTAATAG GCCCTGGTTG GAGCGGCGGT
AATCGTATAG TTTATTCAGA TGGCTCATTT ATAGATGATG CTATGGCTAT AAGGCCAGGC
CCTAATAATA CAATTGTCGC AGATGTTCCC AAGAAGTACA TTGGCGAGTT TGAAAAAAGT
TGGAAATTAA CTGTGTTTCT CACATCGTGG GACGGCTACG GCCCAGACAA TATACGGAGA
TTTGGCGTAG TAGAAGATGA GTGGACTGTC GGCGGCGCAG ATGCCGCCGC GGTGTTGGCA
GGAGTGGCCC CGAGAGTGTT TGATGTGTTA GCGCCTACTG TAGATGCCCA GGTTAGGGCA
CTAACTTCCT ACAAAGTGTC CAGACTGCCT AATGGCACAT ACGTAGGCGC ACCAGCCTCT
GTGTGTATAT TTTACACACA AGAAAAGCCA GCCGCGACAA TTACTGCAAC AATCACCACT
ACAGTAACAC AGACAAGCCG TGAGACAGTT ACTACAACAC AGTACTTAAC GGAAACTCAG
AAAGAAACTG TAAGAGAAGT TAATTGGGTA GCTACAGCTA TAGTCTCAGT GTTAGCTTTT
ATACTTGGTT TAACACCTAG CCTATTGGCA AAAAGAGAGC GGTAA
 
Protein sequence
MAYYMRRIIL FLVLALFLLA QPMNIIFVFH NHQPWYIDLE KGELLLPWVR IHSVGNYLKV 
PLLVNQSGVS VAYTLSGSLI EQINWYANRT YVDARYKISQ KIAEGKPLTL EEKYSMLLVP
GGFFDINWQN IVYKHPRYTV LLGIRNDAFS KCPPGNITCI VSRFSEQDFI DLATLFNLLW
IDPYIARQYP DVWAMRNKTS FTRNDLKRVL EVHMDLISKV LPLYRALAQQ KRVELVLVPY
SHPLMPLLAD MGALEDLKVH IRLSQNLFER YLGVSPTGVW PPEQAVNDDV LRLFTESGFL
WTITDEDVLK ATMPGASHFG LYYVDYGGRR IYVFFRDKTL SDGLGFRYAS MKPEEALADF
MNYLKRVPRD ECSVVVVALD GENPWENYPN FGDDFLIKFF GGLAQLEKNG TIKLWKPTDF
VKRCSEKATP LPQREFEYFN LKVDISVYTS IRDLPTRIVQ GRIAEGSWSS GGSLAIWIGD
VDENVWWMWL KKAREDVGLN LKWDVLFPLL VAEASDWPFW YGGEMGSPQT FDPVAKSALI
AFYRRAGLQP PMYLFSLAYP GGTPREIVGR GDGKVALYEG LTVYVNTTHI WIEGAGCGVV
YFSNPTLPRS PYFFRGAVYG IHGEKLHIYA DMAIDSCNNT VYLSDGGKFY PVGKAARSYF
IGAQPGDKLY VEFNGLVYVL TIPESPVQQK LLMEVADPPG DDFGTGKYRY PKNPVFKPGV
FDLLGFTLYD LGDRLRFMFR VREFGGNPWS GPAGFSLQFF HVYINRGRGE RNDTLGLGVT
LCKEAMWDVA LLIGPGWSGG NRIVYSDGSF IDDAMAIRPG PNNTIVADVP KKYIGEFEKS
WKLTVFLTSW DGYGPDNIRR FGVVEDEWTV GGADAAAVLA GVAPRVFDVL APTVDAQVRA
LTSYKVSRLP NGTYVGAPAS VCIFYTQEKP AATITATITT TVTQTSRETV TTTQYLTETQ
KETVREVNWV ATAIVSVLAF ILGLTPSLLA KRER