Gene Pisl_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1067 
Symbol 
ID4616370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp961619 
End bp962869 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID639784163 
ProductIS605 family transposase OrfB 
Protein accessionYP_930583 
Protein GI119872576 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGCAG AGGAGGGCGA GCCGAAGAAG AGGGGCGGAA AGAGCGGCGG GAAAAGCGCC 
GAGAGAGGCA AGAAGAAAGA CGCGGAGAAG AAGAGGGATC ACGTCCTCAC TCGCGCCGTC
GTAATCCCCA GCGCCCGCCT CAGCTGGAGG AAGTTCAACG CGTTGAAGGA GCTTGAAAAG
AAGTACAGAG AGCTGAGGAG GAGGCACCCC GACCTCCCAT CCCACTACGT CTACACGGCG
GCGCAGGACG CGGCCACACG CGTAAAGAGC TTTATGGCGC TGAAGCGCGA GGGCAAGGCG
AAGACGGAAA AGCCAGAAAT ACGAAGGATC AGCATCTGGC TTGACGACCG CCTCTGGAAG
CCAGAGGGCT ATACCGCCAT AAGGGTGTCA ACGCACAGAG GTCGGATAAC GATACCGCTT
TGGCCGACCA AGCAGTTCTG GAAGCACCTC AACGGCGGCT GGAGGCTGAA GTCGCAGCCG
AGGCTAAAGC TGGACGAAAA GAGGAGGGCG GTCTACGTCT ACTTCGTCTT CGAGAAGGTT
GTAGAGGAGA GGCCGGCGAA GGGCATCATC GCCGTTGACC TCAACGAGAA CAACGTGGCT
GTGAAGGCCG GCGGCAGGGT ATACATCCTT GAGACCGGGA TTAGGAACAT CACTGTGGGA
TACCACAGCC GTAGGGAGGT CATGCAGTCT CTCAAGGGCA ACCGCTATAC AAGCCGCGCG
CTGAAGAGAA ACGAGCTGAA CAAGAAGAGC GACATTAGGA GGAAGGCGGC CAATTTCGTA
GTCAGAGAGG CGGAGAGGTT AGGTGCCGCA ATAGCCGTCG AAAATTTGCC AAAGGAAGCG
CCAAAAAACA TGATATCGAG AGTTGATGAT CCTGTATTAA GAGATAGAAT CTACAAAGCT
GGCTTTAGAA GTATGTTGAG GAAAATTATA CGTAAGGCAA GGGAGAGGGG GATCCCCGTG
GTGAAGGTCA ATCCGAGGAG AACCTCCTCC ACCTGCCCGC GGTGTGGCGG GGGGCTTGCG
AGGGGCTCTG CCCCGAGGCT CCTCCGGTGC CCCCACTGCG GGCGGGAATG GGGGAGGGAC
GTCGCCGCCG TCATAAACAT CGAAAGGAGG GCACTCGAGG AGGGCCGCGT GCCGCCCGGC
CCCATGCCCG ATGACCCCAT GCCCGAGGTA GCCTGGCTAC CAATGGGGGC GTGGGCGAGG
AGAAAGTCCC TAGGCGCGAT TAGTCAAGAA TTGTCAGCTA TGACCGCCTA G
 
Protein sequence
MGAEEGEPKK RGGKSGGKSA ERGKKKDAEK KRDHVLTRAV VIPSARLSWR KFNALKELEK 
KYRELRRRHP DLPSHYVYTA AQDAATRVKS FMALKREGKA KTEKPEIRRI SIWLDDRLWK
PEGYTAIRVS THRGRITIPL WPTKQFWKHL NGGWRLKSQP RLKLDEKRRA VYVYFVFEKV
VEERPAKGII AVDLNENNVA VKAGGRVYIL ETGIRNITVG YHSRREVMQS LKGNRYTSRA
LKRNELNKKS DIRRKAANFV VREAERLGAA IAVENLPKEA PKNMISRVDD PVLRDRIYKA
GFRSMLRKII RKARERGIPV VKVNPRRTSS TCPRCGGGLA RGSAPRLLRC PHCGREWGRD
VAAVINIERR ALEEGRVPPG PMPDDPMPEV AWLPMGAWAR RKSLGAISQE LSAMTA