Gene Pisl_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0134 
Symbol 
ID4616787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp125938 
End bp127149 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content58% 
IMG OID639783217 
ProductIS605 family transposase OrfB 
Protein accessionYP_929660 
Protein GI119871653 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATGTTG TAACACACAC CGTCGCCGTC CCCAGCCCGC GGCTGAGTTG GAGGAAGTTC 
AATACGCTGA AGGAGCTTGT GGAGAAGTAT AGGGAGCTGG TCGTCCACCT CGTCGACTAC
GGCTTTAAAC ACGGCGTGAC GGGACAGATC TCGCTCCGCA AGGCCCTGTA TGAAGAACTG
CGCAGAAGGC ACCCCGACCT CCCATCCCAC TACGTCTACA CGGCGGCGCA GGACGCGGCC
GCCCGCGTAA AGAGCTTTAT GGCGCTGAAG CGGGAGGGAA AGGCGAAGAC GGAAAAGCCA
GAGATACGAA GGATCAGCAT TTGGCTTGAC GACCACCTCT GGATCCGCGA GGGCTTCACG
GCGGTAAGGG TGTCGACGCA CAGGGGGTGG GTCACGATTC CGCTTTGGCC CACCAGGCAG
TTCTGGCGCC ACATCAACGA GGGGTGGAGA CTGAAGACAC AGCCGAGGCT AAAGCTGGAC
GAGAAGAGGC GCATCGCCTA CGTCTACTTC GTCTTTGAGA AGGTTGTGGA GGAGAAGCCG
GCGAAGGGCG TTGTCTCCGT CGACCTAAAC GAGAACAACG TGGCTGTGAA GGCCGGCGGC
AGGGTGTACA TCCTTGAGAC TGGGATCAGG GACATAACGC TCGGCTACAA CAGCAGGAGA
GAGGTCATGC AGTCTCTGAA AGGAAATAGG TACGTGAGCC GGGCGCTGAA GAAAAACGAA
CTGAACAAAA AGAACGACAT CCGCAGAAAG GTAGCCAATT TCGTAGTCAG AGAGGCGGAG
AGATTAGGCG CCGCAATAGC CGTCGAAAAT CTGCCAAGGG AAGTGCCAAA AAACATGATC
AAAAATGTAG ATGATCCAAA GCTCAGAGAT AGAATCTACA AAGCTGGATT TAGAAGTATG
TTAAGAGAAA TTATACAAAA GGCAAGGGAA CACGGCATCC CGGTGATAAA GGTCGACCCG
AGGGGTACCT CCTCCACCTG CCCGCGGTGT GGGGGGAGGT TGGTGAGGGG CCCTGCCCCG
AGGCTCCTCC TCTGCCCCCA CTGCGGGTGG GAAGGGGGGA GGGACGTCGC CGCCGTCATA
AACATCGAAA GGAGGGCACT CGAGGAGGGC CGCGTGCCGC CCGGCCCCAT GCCCAATGAC
CCCACGCCCG AGGTATCTTG GATACCCATG ACGGCGTGGG CGAGGAGAAA GTCCCTAGGC
GCAATAGCCT AG
 
Protein sequence
MYVVTHTVAV PSPRLSWRKF NTLKELVEKY RELVVHLVDY GFKHGVTGQI SLRKALYEEL 
RRRHPDLPSH YVYTAAQDAA ARVKSFMALK REGKAKTEKP EIRRISIWLD DHLWIREGFT
AVRVSTHRGW VTIPLWPTRQ FWRHINEGWR LKTQPRLKLD EKRRIAYVYF VFEKVVEEKP
AKGVVSVDLN ENNVAVKAGG RVYILETGIR DITLGYNSRR EVMQSLKGNR YVSRALKKNE
LNKKNDIRRK VANFVVREAE RLGAAIAVEN LPREVPKNMI KNVDDPKLRD RIYKAGFRSM
LREIIQKARE HGIPVIKVDP RGTSSTCPRC GGRLVRGPAP RLLLCPHCGW EGGRDVAAVI
NIERRALEEG RVPPGPMPND PTPEVSWIPM TAWARRKSLG AIA