Gene Pisl_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1967 
Symbol 
ID4617273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1783932 
End bp1785542 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content51% 
IMG OID639785058 
Productalpha amylase, catalytic region 
Protein accessionYP_931457 
Protein GI119873450 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000720647 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000000127887 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTTGCG TAGTTGAGGG CTGGAGGGCG CATCCCTACT ACGGCGAGGT TGCTAAAGTA 
CGAATAGCCG GCGGCGAGAC TATAGGCGAT TTCTCAGGCT GGATCCGCGG GGCTTTTAGA
GAGTACGTAG AGCTCCCGCC GGGGGTCTAT GAGGTTGTTG TAGACGACAG ACGAGAGGAG
TGTGTAGTGG CGCCGCCTGA ATACCCATGG CATTTCGTAG TTCCCTACAT GGCGGTGGAC
TGGGGAGATG TTGTAGAAAT CCGTATATAC GCCCCAGAGG AGCCCGAGGT GGGGCGTGGC
AGAGTTGTAA AGCTTGCAGA CCTAGGGCCC TTCTCTATCT ACCTCGGGTT AGTCAAAGGG
AGGAGATATA CACTCCGTTG TTGTGGTAAA ACTAGGCGTT TTAAATCCCC ACCGGTGGCC
AATGCGCCTG GAGTGACTGC GATGTATGAA GTTCTGCCAG ACCGAGCCGC CGATAGACTC
GGCTGTAGAG ACCTCAGGCG GGGTTACTGT GGCGGGACGT TAAGAGATGT GGCTAAGCTT
GCAATTACGG CGCTTGATAT AGCAGATACG CTCTATCTCC ACCCGATATA CCCCGCAATG
AGCTATCACC GCTACGACGT AGTTGACCAC CTACAGGTAG ACGAAAAACT AGGCGGCTGG
AGCGCCTTTG TGACGATGAG AGAGACATTG AGACGGCGGG GGATGAAGCT TGTGTTAGAT
ATCGTGCTTT ACCACGTGGG GTTGCGAAAC GCCATATTTC CAGAGGGCCC CTTTGTTTTT
AAAAGCCTAG ACTATACACA TCTTGTCAAG AAGATAGCAG AGAAGATGCC TATGGAGGCG
CTTCGCGAAT TATTTAGGGG GGAGCCGCCA TATGAGACTT TTCTAGACGT CTGGCTAATG
CCAAAGCTCG ATTATTCAAA GCCTAGCGCT GTGGAATACG GGAGGAAAGT CGTCGAGTTT
TGGAAAAGCC ACGTAGATGG ATTTAGGCTA GATGTAGCTC ACGGAATTCC GCCTGGCGCC
TGGGCGGAGA TTTTAAAACC GGCGGAAGGG CTTTATATAT TTGGAGAACA CATGGGAAAC
CCAGCGCCTT TCTATTGGGC AGTACCAGGG TTTACAGCGT ATTTACTCTA CAAGGCGGCG
GTAGATTGGC TGGCTAAAGA CCTAGATAGA TTTGTGAAGT GGACGAACTT ATACATAGCG
CTTACGCCAC CGGCGGCGTT GCCATATATG AACACCTTTC TTGAAAACCA TGATACAGAC
AGAGCCGCCT CTATTTTTGA CATAAATACG CTATATAGAG GGTATGCCCT GATTTTCTCA
CTGCCGGGAG TCCCCTCTAT TTACGCAGGT GGCGAATGTG GAGAAGTCGG CAGAGCAGAA
GACCACACTA ATAGAAGGCC TTACAGCCCC TGTCCCGACT CCCCACTTAG GGAGTTTTTA
AAGAGATTAT ACACGACGAG GAGAGAACTA GCTCTTTACA AAGGGCCTGC GTGGGTTGAG
ACAAGGCGCT CTGAGCTCGT TATACACAGG GGGGCTGTAG ACGTCGCCGT TGGCATAAAT
AAACTTATAG TGAGTAATAC CGAGAGATAT ATAGAACACA AGTTTTTATA G
 
Protein sequence
MTCVVEGWRA HPYYGEVAKV RIAGGETIGD FSGWIRGAFR EYVELPPGVY EVVVDDRREE 
CVVAPPEYPW HFVVPYMAVD WGDVVEIRIY APEEPEVGRG RVVKLADLGP FSIYLGLVKG
RRYTLRCCGK TRRFKSPPVA NAPGVTAMYE VLPDRAADRL GCRDLRRGYC GGTLRDVAKL
AITALDIADT LYLHPIYPAM SYHRYDVVDH LQVDEKLGGW SAFVTMRETL RRRGMKLVLD
IVLYHVGLRN AIFPEGPFVF KSLDYTHLVK KIAEKMPMEA LRELFRGEPP YETFLDVWLM
PKLDYSKPSA VEYGRKVVEF WKSHVDGFRL DVAHGIPPGA WAEILKPAEG LYIFGEHMGN
PAPFYWAVPG FTAYLLYKAA VDWLAKDLDR FVKWTNLYIA LTPPAALPYM NTFLENHDTD
RAASIFDINT LYRGYALIFS LPGVPSIYAG GECGEVGRAE DHTNRRPYSP CPDSPLREFL
KRLYTTRREL ALYKGPAWVE TRRSELVIHR GAVDVAVGIN KLIVSNTERY IEHKFL