Gene Pisl_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1056 
Symbol 
ID4618111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp949333 
End bp950649 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID639784152 
Producthypothetical protein 
Protein accessionYP_930572 
Protein GI119872565 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.919893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT GGATACTTCT ACTTCTTTTG ACATGTTTTT CTAACGCTTA TATAGTACAC 
ACTATATATA CGGTTGAAAT AAACGGTATA ATAGGCCCAT ATACGCTCTC GCAAATAGAG
AGAGCGATTT CTCTTGCCGA GCAAAACAAC GGCCTTGTCC TACTGTTGTT GTCTACGCCA
GGCGGTTTGG CTGATGTAAC ACTTCAAATT ATGAAAGAGG TAGGAAACTC GCCGGTTCCA
GTTGTGGGTT TCGTATATCC AGATTATAGT TACGCTTGGT CTGCGGGCAC TTATGTATTA
ATGTCGACGC ATATAGCGGC TATGGCGCCA CATACAGTAA TAGGGTCTTG TCAGCCTATT
GCTGGAGGGA CGCCGGTTAA CGAGTCGAAG ACTCTTAACG CATTGATAGG GTATCTAGAA
ACTGTTGCCA AGTCTTATGG GCGAAATGGT ACTTTTGCCA GGCTATGTAT AACTAAAAAT
GTGAATCTAG GCGCAGACGA GGCATTAAGA TATAAGGTTA TTGATATAAT TGCAAACGAT
ATAGACGATT TGTTAAGGAA AATTAATGGG AGTTCTATTG TCCTGAGAAA CCAGAGAGTT
GAAATCGTGG TTATTGGAAG TGCTATACAG CCGGTTACGC CCACGCCTAC AGAAACCTTA
CAGATGTGGT TGAGCGACCC CGTGGTGTCA AGTATTTTAT CTCTACTCGC ATTTATACTT
CTATTAACAG CCTTTTTGAC AGGCCACCCA GCGGCCATAG TAGCCGCGAT TATATTACTA
GTTGTCTCTA TGTTTTCTAT TCTCCCAACT GCATGGCTTA GTCTCGTTTT GATAATTATG
GGCGCGTCTC TTATCGTATT TGAAATTTTA GCAGGTATGG CGGCTCATGG AGTTGTCGCT
GGCGTCGGCG CAGTTTTAGT TATAATTGGC TTCTTATCGG CATACCCAGC TAGTGTGTTT
GGCAGAGAGC TTATACATAT CAAAGATTGG TGGTTGATAC AGCTCGGGCT TTATATAAAC
ATAGCAGTAC TAGTCGGCTT TATTGGCCTA ATAATTTACA AGGCTGTGGC AGTTCATAAA
GTAAAACCCC CCTCTGAGTT TTTGACAAAC CTAAGAGGTA TGGAGGGCGT AGCATTAGAC
GACATTGAAC CCGGCGTGCC GGGTTTTGTA AAAGTTTTTG GAGAGTATTG GAAGGCCGTA
TCAGATGTGT CTATAAAAAG GGGTTGTAAA ATAAGGGTTC TTGAGGTACA AGGTGATAGA
TTAAAGGTTG AGCCGGCGGG TATCAACACG CCGGAGGGGA CGGAGGGAGA AAAGTGA
 
Protein sequence
MKKWILLLLL TCFSNAYIVH TIYTVEINGI IGPYTLSQIE RAISLAEQNN GLVLLLLSTP 
GGLADVTLQI MKEVGNSPVP VVGFVYPDYS YAWSAGTYVL MSTHIAAMAP HTVIGSCQPI
AGGTPVNESK TLNALIGYLE TVAKSYGRNG TFARLCITKN VNLGADEALR YKVIDIIAND
IDDLLRKING SSIVLRNQRV EIVVIGSAIQ PVTPTPTETL QMWLSDPVVS SILSLLAFIL
LLTAFLTGHP AAIVAAIILL VVSMFSILPT AWLSLVLIIM GASLIVFEIL AGMAAHGVVA
GVGAVLVIIG FLSAYPASVF GRELIHIKDW WLIQLGLYIN IAVLVGFIGL IIYKAVAVHK
VKPPSEFLTN LRGMEGVALD DIEPGVPGFV KVFGEYWKAV SDVSIKRGCK IRVLEVQGDR
LKVEPAGINT PEGTEGEK