Gene Pisl_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1153 
Symbol 
ID4617463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1045554 
End bp1046621 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID639784249 
Producttransposase, IS605 OrfB 
Protein accessionYP_930667 
Protein GI119872660 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000318591 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAGCT ATAGAGCGAT TGCGCTTAAA CTGCCGGAGC TGGACTGTGG CGTGGAGCGG 
CTTATGGCGT TGGCGAATCT GGCGCACCGC GGATACCGCG TTGAGCCGCC GGATCTGCCC
AAGACGGTGT CAATAATGCT GTACAGAAGA AGGCATGAGC TCGCGTTTGG CACAGAGCCG
AAGAGGTGGC TTGCCAGAAC GTGGTTTCCT CTCACAACCC TTAGGATTGG GAACGGTCAA
AAAATCGGCG ACGGTGGGGC CCCCGTCGTG TTGGACTTCG ACAGAGGAGT TGTGAAGCTG
AGGTTTATCT GCCACGCCGA GGTGCCCATG CCGAAGTGGG CCTACGACAG GGTCTCCGAG
GGCGGCGACG TGAAGTTCGC CCTCCTTGGT CTCAAGAGGG GGAAGCCGCA CCTAGCGCTG
GTGGCGGAGC GTGAGGTCGA GCTGATACAG ACGAATAGCG TCCTAGTGGT AGACGTCAAT
TCGTGGAGAC ACGGCGTTGT GTGGGCTCTG ATCAGAGACG GAAAAACAAC AAAGTGGGCG
CGAGTGAGAC CGGACTTGGG ATACATAGAG CGGCTATACA GCGAGGTCGT CAGACTTGAG
CACAAATACG GAAAGCTGGA GAGACTTGGT CTCCACGAGG GCAGAGACAG CAAGAAGCTG
TGGAGACAGA TAAAGCAGAA GAGGAGACGG CTTTACGCAT ACCTCAGAGA CTTTGCGCAA
AAGGCGGCGC ACAGATTGGC ACTGAAGGCC GTGAAGCGCC GAGCGGAGGT TTGGATCGAC
GACATGTTGG AGGAGTCCAG GAGGGAGCTG ATTGAGGAGA AACTGCCCAG CGACCTTGTC
AAGCTCTACA TGCTCTACCT CCGTCGCTTT ATCAACTTGT TGACGAACCA ATTGGCGTGG
TACGGCATTC CGTACAGATT TAAGCGTCTG CCGTCCACCG TGTGTCCAGT ATGCGGTTCC
GAGCTGACAC AACTGCCCGA CAGAACAATG GTATGTCAAT GTGGATTCAG AGAAAAGAGA
GACCTAGTGC CGATTAGGTG GGCACTGAAG TACACATCCC CGCCCTAA
 
Protein sequence
MKSYRAIALK LPELDCGVER LMALANLAHR GYRVEPPDLP KTVSIMLYRR RHELAFGTEP 
KRWLARTWFP LTTLRIGNGQ KIGDGGAPVV LDFDRGVVKL RFICHAEVPM PKWAYDRVSE
GGDVKFALLG LKRGKPHLAL VAEREVELIQ TNSVLVVDVN SWRHGVVWAL IRDGKTTKWA
RVRPDLGYIE RLYSEVVRLE HKYGKLERLG LHEGRDSKKL WRQIKQKRRR LYAYLRDFAQ
KAAHRLALKA VKRRAEVWID DMLEESRREL IEEKLPSDLV KLYMLYLRRF INLLTNQLAW
YGIPYRFKRL PSTVCPVCGS ELTQLPDRTM VCQCGFREKR DLVPIRWALK YTSPP