Gene Pars_1882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1882 
Symbol 
ID5055695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1687623 
End bp1689047 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content58% 
IMG OID640469428 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001154085 
Protein GI145592083 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.379133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGAGG TGGTTGTGGT TAGGCCTGGC GAGTTTACCA TTAAGAGGGG GGCCACCAGA 
GCGGAGATGG AAAAACTATT GCTAAAAGCG GCTAGAGAAG CCGCAGAGGA GTGCGGCGGG
GCCAAATTCG AGAAGGAGCC CGGTAGGATC TATGCGCGTG GCGACACCCA GTGTCTAAAG
AAGGCACTAG CGAGAGTCTT CGGCGTGAAG TCGGTTAGTC CGGCCTATGT CATGAAATTT
GAGGGTGTTG CAGACATCGC CAGGGAGGCG GCGAGGCTAT GGGTGGGTCT GGCGGCTGGG
AGGAGGTTCG CGGTGAGGGT GCACAGGGTG GGGAACCACC CCTTTACTTC TAGAGACGTG
GCGGCGGCGG TGGGCTCCGC CTTGGTGGCC GCCGGCGCGA GGGTCGATCT TGAAAACCCA
GAGGTGGAGT TTTTTGTAGA GGTGAGGGGG GACAGGGCCT ATTTCTACAC TGAGGTGGTG
GAGGGTCCGG GCGGGCTTCC CTTGGGATCT GAGGGCAAGG TGCTGGCCTT AGTGTCGGGA
GGTATAGACT CGCCGGTAGC GGCCTGGTTG TTAATGAGGA GGGGGGCTCA TGTTGATGTT
CTCCACTGCA ACCTGGGGGG CACAGTGGCG CTCAGGCATA CGCTCGAGGT GATCAAAAGA
CTTCTGGCGT GGTCGTACGG CTACAACGCC CGTGTGATAA TAGGTGACTG TAGCCCTGTG
GCAAAGGCGT TGCGTAGTGG AGTGAGAGAG GAGTTGTGGA ATATCGCTTT CAAAAGGGCT
CTCTACCGCA TAGGCGCTGA GGTTGCAAAA ACTGTACGTG CCGCCGCCCT GGTCACCGGG
GAATCCCTTG GCCAAGTCTC GTCACAGACG TTGCAGGCCT TGGCCGCTGC CGAGATGGGA
GTGGGGATAC CCATACTCCG GCCGTTGATA GGCATGGACA AAGATGAGAT AACTAAACTG
GCGCAGAGGA TAGGGACTTA CGAAATCTCC GCAAAAACGC CTGAATACTG CGCGGTTTTC
AGCAGAAGGC CTAAGAAGTG GGCTACAAGA GAGGAGATAG AGGAGATAGA CTTTGCACTG
CACGATGCGG TGGCTGAGGT GGCGAGCAAC GTGAAGGTGG TGAGGAAGTG GCAACTCGCC
GAATTTATCA AAACCCTATC ACCGCCGGAG GACATTGAAG TGGAGACACC GCCGGAGGGG
GCCGTGGTGG TTGACCTGAG AGATGAGGAA TCGTACAGAA AATGGCACCT CCCAGGCGCG
GTTAGGGCCG ATTTCGACGA GGTGCTCTCG CTGGTGGATA AGCTAGGCAG GGATAAGACC
TACGTCTTCT ACTGCTACAG CGGAGGCCTC AGTCTCGACG TCGCAGAAAG TTTGCGCAAG
CTTGGCATTA AGGCATACTC GCTGAGGCTC CGTCGCGGCA CCTAG
 
Protein sequence
MDEVVVVRPG EFTIKRGATR AEMEKLLLKA AREAAEECGG AKFEKEPGRI YARGDTQCLK 
KALARVFGVK SVSPAYVMKF EGVADIAREA ARLWVGLAAG RRFAVRVHRV GNHPFTSRDV
AAAVGSALVA AGARVDLENP EVEFFVEVRG DRAYFYTEVV EGPGGLPLGS EGKVLALVSG
GIDSPVAAWL LMRRGAHVDV LHCNLGGTVA LRHTLEVIKR LLAWSYGYNA RVIIGDCSPV
AKALRSGVRE ELWNIAFKRA LYRIGAEVAK TVRAAALVTG ESLGQVSSQT LQALAAAEMG
VGIPILRPLI GMDKDEITKL AQRIGTYEIS AKTPEYCAVF SRRPKKWATR EEIEEIDFAL
HDAVAEVASN VKVVRKWQLA EFIKTLSPPE DIEVETPPEG AVVVDLRDEE SYRKWHLPGA
VRADFDEVLS LVDKLGRDKT YVFYCYSGGL SLDVAESLRK LGIKAYSLRL RRGT