Gene Pisl_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1033 
Symbol 
ID4616756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp931388 
End bp932710 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content54% 
IMG OID639784130 
Producttransposase, IS605 OrfB 
Protein accessionYP_930550 
Protein GI119872543 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000000086107 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAACTTT TGGCTTGCCC CCGTGTGTCT TCTATAACCA ACAGAGATTT CAGCCCCACC 
ACATATACAC ACATGTACGT ATACAGAACG CTGAGGATAG AGATACCTTG GCGCCTTGTC
GAAGAGAGAC CGGACGTCCT CGGCCTCGCC GTAAGGATGC GCCTAGCGGT GGAGGAGTAC
GCCAGAAGGC TGTTAAAGGA GTTGACGGGG CAAGAGGAGC CCAAGCTCGC GCCGGAGGAG
CTTGTCTGCT TGCTTACTCC CGACAGACGG GAGCTGGCAC GGCGGATTAT CGAGGAGGTG
TTTCCCAAGT ACGGACTTAA GAGATATATC GCAGAGTGGG CTAAGTTCTT CTGGCGCGAC
GTAGTGTTCC ACAGGGCGGT TCCGCTCAAC GCCCAACTTA GAGTTGGGAA CGAAAGAGAC
ATAAGTATGG CGGTCTTTGT CGACCTAAAG AGCGGTATTG TTAGAGTAAG AAAACTCGGC
ATACCGCCTT TCGCCGTCGA GTTGAAGAAG AACAACATAG TTTGGATAAG GGAGAGACTA
GAAGAGGGCG CCAAATTGAA GTTGGCGTTC CTCGGCATAG AGAGACAGAG GGGCAAGGAG
CCGACCTACG GCAAGCTCTA CGTCGCCCTC GTCTTCGCCC GCGAAGTTCA ACAGATAAAG
CCCAGGGCCG TTGTTGTCGT TGACGTGAAC CGTCTCGACA ACGGCGTCAC GGCGGGTCTC
CTCGTGGATG GAAAACTGAG ACAGACGTTG AGACTTCCCG ACGAGAGCGC GGTAAGAGAA
CTGAGGAGAC TCCACGAGGA GATAAGCCGT CTTGACGGAA AAGCCGCTAG GGAGGCGGAT
CCCGTCAGAA GGAGACGTCT CGAAGACAGA GCACGTTATC TCGCGTCTAA GCGGTTTAGG
AAGATAAGGG GCATTGTGGC GCATATCGCT AGAGAAATAA TCGAGCTCGC CAAGACGTAC
AGCGCCGCCG TCGTGGTGGA CACAATGGAG GACGAAACAT ATCGAGAGCT CAAAGAAAGG
AACAGTAGCG GAGTGAAGAA ACACTTTCTA GACGGGTTGG GCCAACTGAG GAGGCGCCTA
CAACACTTGG CACAGTGGTA CGGCTTGCCG TATTTGGAGG AGCGGCTGTA TTCGACCATC
TGCCCCCGCT GTGGCGCGAA GATGAAAGAG CTGAATAACA GGCGAATGCG GTGTCCCGTC
TGCGGCTTCA ACAACAACCG AGACAACGTG CCGCTGATAT GGGCAAAGAG AAGGTACTGG
GAAATCCTCC AAAAGACAAA ACAACCCGCT TTTTCAGCGA CCACCACACT TTTAACCTCG
TAA
 
Protein sequence
MKLLACPRVS SITNRDFSPT TYTHMYVYRT LRIEIPWRLV EERPDVLGLA VRMRLAVEEY 
ARRLLKELTG QEEPKLAPEE LVCLLTPDRR ELARRIIEEV FPKYGLKRYI AEWAKFFWRD
VVFHRAVPLN AQLRVGNERD ISMAVFVDLK SGIVRVRKLG IPPFAVELKK NNIVWIRERL
EEGAKLKLAF LGIERQRGKE PTYGKLYVAL VFAREVQQIK PRAVVVVDVN RLDNGVTAGL
LVDGKLRQTL RLPDESAVRE LRRLHEEISR LDGKAAREAD PVRRRRLEDR ARYLASKRFR
KIRGIVAHIA REIIELAKTY SAAVVVDTME DETYRELKER NSSGVKKHFL DGLGQLRRRL
QHLAQWYGLP YLEERLYSTI CPRCGAKMKE LNNRRMRCPV CGFNNNRDNV PLIWAKRRYW
EILQKTKQPA FSATTTLLTS