Gene Pisl_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0665 
Symbol 
ID4617137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp604560 
End bp605690 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content44% 
IMG OID639783759 
ProductPUA domain-containing protein 
Protein accessionYP_930186 
Protein GI119872179 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00451] uncharacterized domain 2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.438806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.11367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTAT ACGGTATAGA AATAGATTAT GGGCTCTATA AACACTTTCT CGAATATCTC 
TCCGATAACG AAATAGATGC GCTTTTTCAC TCTGTCACAA AGCCCCCCTC TAGATATTAT
ATAAGGGTAA ATACTACAAA GATAAGCCGT AGAGACTTAA TGAAGAGATT AAACTCCAGG
GGAGTTCAGG CATACCCGGA CGAACATTTT GACGATGCCC TATGGCTTCC TGTAGAGGGG
CCTTTTGACA TACCGACAGC CAGGAAACAA GTCATAGTTG ATAAAAAAGC TGCAGAAAGC
GTAATGCTAG GTGCGGATCT TTACGCACCT GGTATCGTAA AGACAGATCA CGTCAAGGAA
GGCGACGAAG TAAATATAGT GTCAGACAAC GGCGTAGTCG TTGCCTTTGG AACTGCTGTT
GTAGATAGTG ATGAAATCTT AAAGACCCGG AGGGGGTTGT ATATAAAGGT AGAGAAATCT
CTTTACAAAG CACCAAAGAT AAGGGACTTA CCTGAGTATA AAGAAGGCTT GTTGTATAGC
CAAAGTCTTC CGGCAATAGC TGTGGGACAT GTGGCAAAGA GGGCAAGAGC TTCTACAGCG
GTAGATCTCA ACGCTGCGCC AGGCGGAAAG GCTACACATT TAGCTCAGAT AGGGCTACGT
ATAATAGCTG TAGATAGATC TTGGCCAAAA ATAGAGAAGC TAAAAGAAGA GGTTAAGAGA
CTAGGTCTAG CCGATAGAAT TGACGTCGTT TTACATGACA GTAGATATTT AGATAGAGAC
TTTCCCCGCT TGGCGGCCGA TTTGGCTCTA GTAGACCCGC CTTGTACAGA CATAGGGGTG
CGCCCCAAGA TTTATCATAA GGTGACTATA GAGATGGCTA AGACGTTATC TAGATATCAG
ATCCAGTTTC TCAAGACAGC ACTTAAGATA GCACCGACGG TCATATACTC CACCTGCACA
CTTACGTATA TAGAAAATGA GGATGTTATA AGGAAAGTAG GCGCAGAGCC CGTTGATACA
GGATTAGAAA TAGGCGCCCC TGGGTGGGGA TGTCCAGAAT GTAGAAGATT CTTACCTCAC
ATACATAATA CGCCTGGTTT CTTCATTGCG CTTTTACGCC GCCGGCGTTA G
 
Protein sequence
MNLYGIEIDY GLYKHFLEYL SDNEIDALFH SVTKPPSRYY IRVNTTKISR RDLMKRLNSR 
GVQAYPDEHF DDALWLPVEG PFDIPTARKQ VIVDKKAAES VMLGADLYAP GIVKTDHVKE
GDEVNIVSDN GVVVAFGTAV VDSDEILKTR RGLYIKVEKS LYKAPKIRDL PEYKEGLLYS
QSLPAIAVGH VAKRARASTA VDLNAAPGGK ATHLAQIGLR IIAVDRSWPK IEKLKEEVKR
LGLADRIDVV LHDSRYLDRD FPRLAADLAL VDPPCTDIGV RPKIYHKVTI EMAKTLSRYQ
IQFLKTALKI APTVIYSTCT LTYIENEDVI RKVGAEPVDT GLEIGAPGWG CPECRRFLPH
IHNTPGFFIA LLRRRR