Gene Pisl_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0972 
Symbol 
ID4617292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp868938 
End bp870356 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content47% 
IMG OID639784071 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_930491 
Protein GI119872484 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.493614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACGTG TCTTAATAGT GAGGGTAGGC GAGCTAACCA TCAAGAGAGG CAAGACACGT 
GTAGAAATGG AAAGACTTTT GCTAAGAGCC GCCAAAGAGG CGGCAACAGA ATGCGGCAAC
GTTAGATTTG TCAGAGAACC CGGGAGGATA TATGCACTCG GCGATATCGA CTGTTTAAGA
AATAAGCTCT CGAGAGTATT TGGCGTAAAG TCTGTAAGTC CGGCGTATGT AATTATTTTC
GAAAAGATAG AGGATATTGT AGATGCAGCG TTAAAACTCT GGGGTAGCGC GGTGGCGGGG
AGAAAGTTCG CCGTGAGAGT GCATAGAGTT GGTGAGCATA GCTTTACGTC TAGAGATATA
GCTATTGCAG TAGGCGCCGT GTTAACAAAA GCAGGAGGCA AAGTGGATTT GGAAAATCCA
GAAGTTGAGC TTTTTATAGA AGTGCGTAAT AACCGCGCGT TTCTATACAC AGAAGTTATA
GAAGGGCCTG GCGGCCTTCC CATAGGCTCA GAGGGAAAGG TCTTGGCGTT GGTATCAGGC
GGAATAGACT CGCCTGTCGC CGCGTGGATG ATGATGAGGA GGGGGGCTCA TGTCGATGTC
TTTTACTGTA ACCTAGGCGG TACATTTGTC GCCAGGCTTG TCGTAGAAGT TATAAAGAGA
CTTCTCTCTT GGTCCTACGG CTATAACGCC CGCGTAGTTA TTACAGACTG TGCACCTATA
GGCCGCACGA TACGTAGAAA TGTTAAAGAA GAGCTGTGGA ATATCGCTTT TAAGCGCGCG
TTATATCTCA CAGCCCGCAA AGTGGCAGAC ATAGTTAGGG CAACTGCACT AGTGACGGGC
GAATCGTTAG GGCAAGTATC GTCGCAAACT TTACAAGCCC TAGCTGCGGT TGAGAGAGGG
CTAGATATTC CAGTTATGAG ACCCCTAGTA GGGATGGATA AAGATGAAAT TATACAACTG
GCCAGGAAGA TAGGCACATA CGAGCTGTCT ATAAAGATAC CGGAGTATTG TGCCATATTT
AGCAAGAGAC CTAAGAAGTG GGCCACTAGA GAGGAGATAG AGGAGATAGA CTTAGCGATA
TACGACGCAG TAATAGAGGT TGTGAAAAAT ATAAAAATTG TCAAGAAACG AGAGCTAGAT
AGCTATATAG CGTCTCTATC CCCCCCGCAA GACATAGAGA TAGAAAGCTT GCCTCCAGAC
GCCGTTTTGA TAGACTTACG AGACCAAAAG TCTTTTCAAA AATGGCATCT CCCAGGAGCT
TTAAGAGCCG ACCCAGACGA CGTACTAACA TTAGTAGATA GACTGGGGCA CGATAAAACC
TACGTCTTTT ACTGCTACAG CGGAGGTCTT AGCCTAGACG TCGCCGAGAG TCTCCGCAAA
TTTGGAGTAA AGGCGTATTC GCTTAAACTA CGTAAATAG
 
Protein sequence
MERVLIVRVG ELTIKRGKTR VEMERLLLRA AKEAATECGN VRFVREPGRI YALGDIDCLR 
NKLSRVFGVK SVSPAYVIIF EKIEDIVDAA LKLWGSAVAG RKFAVRVHRV GEHSFTSRDI
AIAVGAVLTK AGGKVDLENP EVELFIEVRN NRAFLYTEVI EGPGGLPIGS EGKVLALVSG
GIDSPVAAWM MMRRGAHVDV FYCNLGGTFV ARLVVEVIKR LLSWSYGYNA RVVITDCAPI
GRTIRRNVKE ELWNIAFKRA LYLTARKVAD IVRATALVTG ESLGQVSSQT LQALAAVERG
LDIPVMRPLV GMDKDEIIQL ARKIGTYELS IKIPEYCAIF SKRPKKWATR EEIEEIDLAI
YDAVIEVVKN IKIVKKRELD SYIASLSPPQ DIEIESLPPD AVLIDLRDQK SFQKWHLPGA
LRADPDDVLT LVDRLGHDKT YVFYCYSGGL SLDVAESLRK FGVKAYSLKL RK