Gene Pisl_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1324 
Symbol 
ID4617207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1198657 
End bp1199703 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content48% 
IMG OID639784413 
Productradical SAM domain-containing protein 
Protein accessionYP_930830 
Protein GI119872823 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.161004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTTTGT TTAAAAGAGA CGACATAGAA GAGCTCCTAA AGGCAGATCT CTGGGAGCTC 
GGCCGTAGGG CCTATGAAAT AAGGCTGAGG ACGTATGGCA AAACCACTAC CTTTATTTCA
AATATGGTTT TGAACTATAC AAACGTATGT GTAGTTGGAT GCTCTTTCTG CGCTTTCTAC
CGGCCGCCGG GCCACCCCGA GAGTTATGTA TATACGGTGG AGGAGGCGGT CAAGAGAGTG
TTGGCTATAG ACGCTAAATA CGGCATTAGA CAAGTCTTGA TACAAGGCGG GGTTAACCCC
GACGTAGGCA TTGAATACTT TGAGGCGCTT TTCCGCGCCA TAAAGACAAA GGCTCCGCAC
ATAGCTATAC ATGCACTTTC GCCACTAGAG ATAGACTATC TATCGCGGAG AGAACGCGCC
ACATATCGGG AGGTGTTAGA GAGGCTGAGG GAGGCTGGCA TGGACTCTAT GCCAGGTGGC
GGCGGTGAAA TACTTGTCGA CAGAGTGAGG AAGGAGGTCG CGCCCAGGAA GATAGATAGC
TCGACTTGGC TTAGAATTAT GGAGGAGGCA CATAAAATCG GCATCCCAAC CTCAGCTACG
ATGATGTACG GCCATGTTGA AACTATAAGC GACATCGCAG AGCACATGTA CAAAATTGCA
GAACTCCAAG AAAAAACAAA AGGCTTCCTG GCTTTTATCG CTTGGAATTT TGAACCTGGA
ACAAGCGAAC TTGGCAAACG TATAAAATAC CCAAAGACAT CGGCTACGTT GTTGAGAATT
ATAGCTGTGG CGAGGATAGT TTTCGACGGC ATTATCCCCC ATATACAAAG CGGCTGGCTC
ACCACGGGGC CGGAGACTGC GCAATTGGCT ATGTACTTTG GCGCAGACGA CTTCGGAGGC
ACATTGTACG AGGAGAAAGT CCTCGAGTGG AAACGCGTTG AAACGCCGAT AGATAAGAGA
GAAGATGTAA TACAAATCAT AAGATCGGCT AGTTTTACGC CAGCAGAGCG GGATAATATG
TACAACGTCG TCAAGGTATA TGGTTAG
 
Protein sequence
MPLFKRDDIE ELLKADLWEL GRRAYEIRLR TYGKTTTFIS NMVLNYTNVC VVGCSFCAFY 
RPPGHPESYV YTVEEAVKRV LAIDAKYGIR QVLIQGGVNP DVGIEYFEAL FRAIKTKAPH
IAIHALSPLE IDYLSRRERA TYREVLERLR EAGMDSMPGG GGEILVDRVR KEVAPRKIDS
STWLRIMEEA HKIGIPTSAT MMYGHVETIS DIAEHMYKIA ELQEKTKGFL AFIAWNFEPG
TSELGKRIKY PKTSATLLRI IAVARIVFDG IIPHIQSGWL TTGPETAQLA MYFGADDFGG
TLYEEKVLEW KRVETPIDKR EDVIQIIRSA SFTPAERDNM YNVVKVYG