Gene Pisl_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1146 
Symbol 
ID4616987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1036106 
End bp1037416 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content47% 
IMG OID639784242 
Producthypothetical protein 
Protein accessionYP_930660 
Protein GI119872653 
COG category[R] General function prediction only 
COG ID[COG4882] Predicted aminopeptidase, Iap family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000092529 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTGAGG TTTATACAAA ATGCGTCTCA TATAGAGATT TAGTAGCAGG TTCTCCGCTA 
GAGAGGGAGT TTCTACAGTG GCTCTTGACG TTTTTAGATA CGCCAACTGT ATGGTTTCAC
TTATCTCCCG TGGAGGTTCT CTACTGGGAA GACTCGGGGA CGAGACTGGA GGTGGGTGAG
ACAGCTGTCA AAGGTTTGGC GTTGCCATAT TCCAGCTCTA TAGATGTAGA GGGGAGACTT
GTGCCCATTG ACGGAGATGT AGAGGGGAAT ATTGCTGTTG CTAAATTCCC CGAGGATGTC
GACGACGCTA AATATATAGT TATCGACGCC GCACGTCGCG GCGCTTCTGC CGTGGTTTTC
ACAGGCAGAC CCCCCCGGCG TATTGTAATA ACTGGGGAAT ATGGCTATAA ATTCGACGCG
GCGCCTCCCC CAATTCCTGC CGCCAGCTTT GAAAATATAG ACTCATATAT AAATAAGAGA
GTTAGGCTTG AGATAGAGGT AAAGAGCAGA ATTACATATA GCTATAGTCT CATAGCCTTC
AATAGTTTTG AAGATACGCC TATGATCTCG GCACATTGGG ACCACTGGCT TGTCGGCTCT
ACAGATAACT GCGCCGGCGT AGAAGCCGCA GTTCTCGCCT TTAGCGAGCT CGTGGCTGAA
GAGTTCCCAA TAGCTCTCGG CCTTTTCACC GCCGAGGAGG GCGTGGCGCC ACATGTGCCG
TCTTTTTACT GGGCGTGGGG CTCTCTCAAT TATTTAAAAC GATGGAGGCC TACTCTATTG
ATTAATATCG ACGCCATAGG TCTAGGAACT CCTCGTATGT ACGCAATGCC ATATCTACAC
GAGACGCTCA AGGGGCTGGG CCCTGTTGAA ATGCCAGAGG CTTATTTCGA CAGCGTTCAC
TACGAGAGGT GGGGTCTCCC CTCTATAACT ATCTCATCGC TAAAAGACAC TTGGGACATT
TACCACAGCC CCCTTGATAT AAATGTCGAC GCCGACAATA TTCTATACGT CGCCGAGTTG
GCAAAACGGT TGGCCAAAAT AAAGCCGCAG ATTCCGTCGA TTAGCCTTGA GGAGTATGGC
CTCCCGCCAA TTAATAACCA TTATATAGCT TGGTCATTGA TATATAACTA TCTCGTTATC
TTTAAAGATT TTACACATTC TGACATTATA TATACTAATG TCTTTAGATT TCTAAAGAGA
GACAGTAGGA GTTACCGACG TATAGACTTA ATGGGAGGCC CCACGCTATG TATAAACAAT
TGCGAAAACG CTTTTGAGAC ATACCGCGAG CTAGCGTTGC TCAGGCTCTA G
 
Protein sequence
MTEVYTKCVS YRDLVAGSPL EREFLQWLLT FLDTPTVWFH LSPVEVLYWE DSGTRLEVGE 
TAVKGLALPY SSSIDVEGRL VPIDGDVEGN IAVAKFPEDV DDAKYIVIDA ARRGASAVVF
TGRPPRRIVI TGEYGYKFDA APPPIPAASF ENIDSYINKR VRLEIEVKSR ITYSYSLIAF
NSFEDTPMIS AHWDHWLVGS TDNCAGVEAA VLAFSELVAE EFPIALGLFT AEEGVAPHVP
SFYWAWGSLN YLKRWRPTLL INIDAIGLGT PRMYAMPYLH ETLKGLGPVE MPEAYFDSVH
YERWGLPSIT ISSLKDTWDI YHSPLDINVD ADNILYVAEL AKRLAKIKPQ IPSISLEEYG
LPPINNHYIA WSLIYNYLVI FKDFTHSDII YTNVFRFLKR DSRSYRRIDL MGGPTLCINN
CENAFETYRE LALLRL