Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_0985 |
Symbol | |
ID | 4618334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 882100 |
End bp | 885084 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639784083 |
Product | glycoside hydrolase family protein |
Protein accession | YP_930503 |
Protein GI | 119872496 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.369995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.376092 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACT ACATGAGGCG GATAATTTTG TTTCTAGTCT TAGCTTTATT TCTGTTAGCC CAGCCAATGA ATATAATATT TGTATTTCAT AACCACCAGC CTTGGTATAT AGACCTAGAA AAAGGCGAGT TACTACTGCC ATGGGTGAGA ATACATTCTG TTGGAAATTA TCTAAAAGTG CCCCTTCTTG TAAACCAAAG CGGCGTTTCC GTTGCATATA CTCTTTCTGG GAGTTTAATT GAACAGATAA ATTGGTACGC TAACAGAACT TACGTCGACG CTAGATATAA GATATCGCAA AAGATCGCAG AGGGGAAGCC TTTGACTCTG GAGGAGAAAT ACTCCATGTT GTTAGTGCCC GGGGGATTCT TTGATATAAA TTGGCAAAAT ATTGTGTATA AACACCCAAG ATACACTGTA TTGTTAGGGA TAAGAAATGA TGCATTTAGT AAATGTCCAC CTGGAAATAT AACATGCATC GTATCTAGGT TCAGCGAGCA AGACTTTATT GACCTAGCCA CGCTTTTTAA CCTTCTCTGG ATAGACCCAT ACATAGCACG ACAATACCCA GACGTTTGGG CTATGAGAAA TAAAACCTCC TTTACGCGCA ACGACCTTAA GAGAGTCTTA GAGGTACACA TGGACCTAAT CTCAAAAGTT TTGCCTTTAT ATAGAGCTTT GGCACAACAG AAGAGGGTAG AGCTCGTGCT AGTGCCATAC TCACATCCTC TAATGCCTCT TTTGGCAGAC ATGGGAGCTT TAGAAGACTT AAAAGTTCAC ATAAGACTCT CGCAGAATCT CTTTGAAAGA TATTTAGGAG TTTCGCCCAC GGGTGTTTGG CCTCCTGAAC AAGCAGTTAA CGATGATGTG CTCAGACTTT TTACAGAATC CGGCTTTTTA TGGACTATAA CAGACGAGGA CGTATTGAAG GCAACCATGC CCGGAGCTAG CCACTTCGGT CTTTACTATG TAGATTATGG AGGACGACGT ATATACGTTT TCTTTAGAGA TAAAACACTG TCTGACGGCC TAGGTTTTAG ATACGCCTCT ATGAAGCCAG AAGAGGCCTT GGCCGACTTT ATGAATTATC TAAAGAGGGT GCCACGAGAC GAGTGTTCAG TAGTCGTAGT TGCATTAGAT GGCGAAAATC CGTGGGAAAA CTATCCAAAT TTTGGAGATG ATTTTCTTAT TAAATTCTTT GGAGGACTCG CACAGTTGGA GAAAAACGGC ACCATCAAGC TATGGAAACC TACAGACTTT GTAAAGAGAT GTAGCGAAAA AGCTACGCCA TTACCACAAC GTGAATTTGA ATATTTCAAC CTAAAAGTAG ATATATCCGT ATATACCTCT ATACGTGATC TACCAACCCG TATTGTCCAG GGTAGAATTG CAGAAGGCTC GTGGTCTAGT GGGGGTAGTT TAGCCATTTG GATTGGCGAT GTAGACGAAA ATGTTTGGTG GATGTGGCTA AAGAAGGCTA GAGAAGATGT AGGTCTAAAT TTGAAGTGGG ATGTACTTTT CCCATTGTTA GTAGCTGAAG CCAGTGACTG GCCGTTTTGG TACGGCGGCG AAATGGGTTC ACCACAGACC TTTGACCCTG TTGCAAAATC TGCACTGATA GCATTTTACC GACGGGCTGG TTTACAGCCG CCTATGTATC TTTTCTCCTT GGCGTATCCC GGCGGGACTC CCCGCGAAAT AGTTGGAAGG GGCGATGGAA AAGTAGCTCT CTACGAAGGG CTTACGGTTT ATGTAAATAC GACACATATA TGGATAGAGG GCGCGGGGTG TGGCGTAGTC TATTTCTCAA ATCCAACACT TCCGAGGTCG CCATATTTCT TCAGAGGCGC TGTATATGGA ATACATGGCG AGAAATTACA CATATATGCC GATATGGCTA TCGATAGCTG TAATAACACA GTATATCTCT CAGACGGCGG TAAGTTCTAT CCAGTTGGGA AAGCGGCTAG ATCGTACTTT ATAGGCGCGC AACCTGGCGA CAAACTCTAC GTAGAGTTTA ATGGCCTTGT ATATGTGCTC ACAATACCTG AAAGCCCAGT ACAACAAAAA TTGTTAATGG AAGTAGCTGA TCCGCCTGGA GACGACTTTG GCACTGGTAA ATACCGCTAT CCTAAGAACC CCGTCTTCAA GCCCGGGGTT TTTGACCTAT TAGGATTTAC GCTATACGAC CTGGGCGATA GGCTAAGGTT CATGTTTAGA GTGAGAGAAT TTGGTGGAAA CCCATGGAGC GGGCCTGCAG GATTTTCGTT ACAGTTTTTC CACGTCTATA TTAATAGAGG ACGTGGAGAG AGAAATGACA CACTTGGTCT AGGAGTGACT CTTTGTAAAG AGGCGATGTG GGATGTGGCC TTATTAATAG GCCCTGGTTG GAGCGGCGGT AATCGTATAG TTTATTCAGA TGGCTCATTT ATAGATGATG CTATGGCTAT AAGGCCAGGC CCTAATAATA CAATTGTCGC AGATGTTCCC AAGAAGTACA TTGGCGAGTT TGAAAAAAGT TGGAAATTAA CTGTGTTTCT CACATCGTGG GACGGCTACG GCCCAGACAA TATACGGAGA TTTGGCGTAG TAGAAGATGA GTGGACTGTC GGCGGCGCAG ATGCCGCCGC GGTGTTGGCA GGAGTGGCCC CGAGAGTGTT TGATGTGTTA GCGCCTACTG TAGATGCCCA GGTTAGGGCA CTAACTTCCT ACAAAGTGTC CAGACTGCCT AATGGCACAT ACGTAGGCGC ACCAGCCTCT GTGTGTATAT TTTACACACA AGAAAAGCCA GCCGCGACAA TTACTGCAAC AATCACCACT ACAGTAACAC AGACAAGCCG TGAGACAGTT ACTACAACAC AGTACTTAAC GGAAACTCAG AAAGAAACTG TAAGAGAAGT TAATTGGGTA GCTACAGCTA TAGTCTCAGT GTTAGCTTTT ATACTTGGTT TAACACCTAG CCTATTGGCA AAAAGAGAGC GGTAA
|
Protein sequence | MAYYMRRIIL FLVLALFLLA QPMNIIFVFH NHQPWYIDLE KGELLLPWVR IHSVGNYLKV PLLVNQSGVS VAYTLSGSLI EQINWYANRT YVDARYKISQ KIAEGKPLTL EEKYSMLLVP GGFFDINWQN IVYKHPRYTV LLGIRNDAFS KCPPGNITCI VSRFSEQDFI DLATLFNLLW IDPYIARQYP DVWAMRNKTS FTRNDLKRVL EVHMDLISKV LPLYRALAQQ KRVELVLVPY SHPLMPLLAD MGALEDLKVH IRLSQNLFER YLGVSPTGVW PPEQAVNDDV LRLFTESGFL WTITDEDVLK ATMPGASHFG LYYVDYGGRR IYVFFRDKTL SDGLGFRYAS MKPEEALADF MNYLKRVPRD ECSVVVVALD GENPWENYPN FGDDFLIKFF GGLAQLEKNG TIKLWKPTDF VKRCSEKATP LPQREFEYFN LKVDISVYTS IRDLPTRIVQ GRIAEGSWSS GGSLAIWIGD VDENVWWMWL KKAREDVGLN LKWDVLFPLL VAEASDWPFW YGGEMGSPQT FDPVAKSALI AFYRRAGLQP PMYLFSLAYP GGTPREIVGR GDGKVALYEG LTVYVNTTHI WIEGAGCGVV YFSNPTLPRS PYFFRGAVYG IHGEKLHIYA DMAIDSCNNT VYLSDGGKFY PVGKAARSYF IGAQPGDKLY VEFNGLVYVL TIPESPVQQK LLMEVADPPG DDFGTGKYRY PKNPVFKPGV FDLLGFTLYD LGDRLRFMFR VREFGGNPWS GPAGFSLQFF HVYINRGRGE RNDTLGLGVT LCKEAMWDVA LLIGPGWSGG NRIVYSDGSF IDDAMAIRPG PNNTIVADVP KKYIGEFEKS WKLTVFLTSW DGYGPDNIRR FGVVEDEWTV GGADAAAVLA GVAPRVFDVL APTVDAQVRA LTSYKVSRLP NGTYVGAPAS VCIFYTQEKP AATITATITT TVTQTSRETV TTTQYLTETQ KETVREVNWV ATAIVSVLAF ILGLTPSLLA KRER
|
| |