Gene Pisl_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1907 
Symbol 
ID4617453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1722782 
End bp1724083 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content58% 
IMG OID639784998 
ProductIS605 family transposase OrfB 
Protein accessionYP_931397 
Protein GI119873390 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.128172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCAG GGGATATAGA AAAGGGGAGC GGACAGAGGA GGCGGAGCCG GAGGCGCAAC 
AAGAGCAACG CTGAGAAGAA AGACGAGAAG AAAGACGTCT TGACGCACGC CGTCGCCGTC
CCCAGCCCGC GGCTGAGCTG GAGGAAGTTC AACGCGTTGA AGGAGCTGGA GGAGAGGTAT
AAGGAGTTCG TCGTCGAGTT CGTTGAATAC GGCTTTAAGC GCGGCGTGAC GGGGCAGGTC
TCACTCCGCA AGGCTCTGTA CAGCGAGCTG AGGGGGAGGT ACCCCGACCT CCCATCCCAC
TACGTCTACA CGGCTGCGCA GGACGCAGCC ACACGCGTAA AGAGCTTTAT GGCGCTGAAG
CGCGAGGGCA AGGCGAAGAC GGAAAAGCCA GAAATACGAA GGATCAGCAT CTGGCTTGAC
GACCGCCTCT GGAAGCCAGA GGGCTATACC GCCATAAGGG TGTCAACGCA CAGAGGTCGG
ATAACGATAC CGCTTTGGCC GACCAAGCAG TTCTGGAAGC ACCTCAACGG CGGCTGGAGG
CTGAAGTCGC AGCCGAGGCT AAAGCTGGAC GAAAAGAGGA GGGCGGTCTA CGTCTACTTC
GTCTTCGAGA AGGTTGTAGA GGAGAGGCCG GCGAAGGGCA TCATCGCCGT TGACCTCAAC
GAGAACAACG TGGCTGTGAA GGCCGGCGGC AGGGTGTACA TCCTTGAGAC TGGGATTAGA
GATATCACAC TCGGCTACCA CAGCCGGAGA GAGGTTATGC AGTCTCTCAA GGGCAACCGC
TACACAAGCC GTGCGCTGAA GAAAAACGAA CTGAACAAGA AGAGCGACAT CCGGAGGAAG
GCGGCCAATT TCGTAGTCAG AGAGGCGGAG AGGTTAGGTG CCGCAATAGC CGTCGAAAAT
TTGCCAAAGG AAGTGCCAAA AAACATGATA TCGAGAGTTG ATGATCCTGT ATTAAGAGAT
AGAATCTACA AAGCTGGCTT TAGAAGTATG TTGAGGGAAA TTATACGTAA GGCAAGGGAG
AGGGGGATCC CCGTGGTGAA GGTCAATCCG AGGAGAACCT CCTCCACCTG CCCGCGGTGT
GGCGGGGGGC TTGCGAGGGG CTCTGCCCCG AGGCTCCTCC GGTGCCCCCA CTGCGGGCGG
GAGTGGGGGA GGGACGTCGC CGCCGTCATA AACACCGAAA GGAGGGCACT CGAGGAGGGC
CGCGTGCCGC CCGGCCCCAT GCCCGATGAC CCCACGCCCG AGGTATCTTG GATACCCATG
AAGGCGTGGG CGAGGAGAAA GTCCCTAGGC ATAACAGCCT AG
 
Protein sequence
MEAGDIEKGS GQRRRSRRRN KSNAEKKDEK KDVLTHAVAV PSPRLSWRKF NALKELEERY 
KEFVVEFVEY GFKRGVTGQV SLRKALYSEL RGRYPDLPSH YVYTAAQDAA TRVKSFMALK
REGKAKTEKP EIRRISIWLD DRLWKPEGYT AIRVSTHRGR ITIPLWPTKQ FWKHLNGGWR
LKSQPRLKLD EKRRAVYVYF VFEKVVEERP AKGIIAVDLN ENNVAVKAGG RVYILETGIR
DITLGYHSRR EVMQSLKGNR YTSRALKKNE LNKKSDIRRK AANFVVREAE RLGAAIAVEN
LPKEVPKNMI SRVDDPVLRD RIYKAGFRSM LREIIRKARE RGIPVVKVNP RRTSSTCPRC
GGGLARGSAP RLLRCPHCGR EWGRDVAAVI NTERRALEEG RVPPGPMPDD PTPEVSWIPM
KAWARRKSLG ITA