Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1882 |
Symbol | |
ID | 5055695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1687623 |
End bp | 1689047 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469428 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001154085 |
Protein GI | 145592083 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.379133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGAGG TGGTTGTGGT TAGGCCTGGC GAGTTTACCA TTAAGAGGGG GGCCACCAGA GCGGAGATGG AAAAACTATT GCTAAAAGCG GCTAGAGAAG CCGCAGAGGA GTGCGGCGGG GCCAAATTCG AGAAGGAGCC CGGTAGGATC TATGCGCGTG GCGACACCCA GTGTCTAAAG AAGGCACTAG CGAGAGTCTT CGGCGTGAAG TCGGTTAGTC CGGCCTATGT CATGAAATTT GAGGGTGTTG CAGACATCGC CAGGGAGGCG GCGAGGCTAT GGGTGGGTCT GGCGGCTGGG AGGAGGTTCG CGGTGAGGGT GCACAGGGTG GGGAACCACC CCTTTACTTC TAGAGACGTG GCGGCGGCGG TGGGCTCCGC CTTGGTGGCC GCCGGCGCGA GGGTCGATCT TGAAAACCCA GAGGTGGAGT TTTTTGTAGA GGTGAGGGGG GACAGGGCCT ATTTCTACAC TGAGGTGGTG GAGGGTCCGG GCGGGCTTCC CTTGGGATCT GAGGGCAAGG TGCTGGCCTT AGTGTCGGGA GGTATAGACT CGCCGGTAGC GGCCTGGTTG TTAATGAGGA GGGGGGCTCA TGTTGATGTT CTCCACTGCA ACCTGGGGGG CACAGTGGCG CTCAGGCATA CGCTCGAGGT GATCAAAAGA CTTCTGGCGT GGTCGTACGG CTACAACGCC CGTGTGATAA TAGGTGACTG TAGCCCTGTG GCAAAGGCGT TGCGTAGTGG AGTGAGAGAG GAGTTGTGGA ATATCGCTTT CAAAAGGGCT CTCTACCGCA TAGGCGCTGA GGTTGCAAAA ACTGTACGTG CCGCCGCCCT GGTCACCGGG GAATCCCTTG GCCAAGTCTC GTCACAGACG TTGCAGGCCT TGGCCGCTGC CGAGATGGGA GTGGGGATAC CCATACTCCG GCCGTTGATA GGCATGGACA AAGATGAGAT AACTAAACTG GCGCAGAGGA TAGGGACTTA CGAAATCTCC GCAAAAACGC CTGAATACTG CGCGGTTTTC AGCAGAAGGC CTAAGAAGTG GGCTACAAGA GAGGAGATAG AGGAGATAGA CTTTGCACTG CACGATGCGG TGGCTGAGGT GGCGAGCAAC GTGAAGGTGG TGAGGAAGTG GCAACTCGCC GAATTTATCA AAACCCTATC ACCGCCGGAG GACATTGAAG TGGAGACACC GCCGGAGGGG GCCGTGGTGG TTGACCTGAG AGATGAGGAA TCGTACAGAA AATGGCACCT CCCAGGCGCG GTTAGGGCCG ATTTCGACGA GGTGCTCTCG CTGGTGGATA AGCTAGGCAG GGATAAGACC TACGTCTTCT ACTGCTACAG CGGAGGCCTC AGTCTCGACG TCGCAGAAAG TTTGCGCAAG CTTGGCATTA AGGCATACTC GCTGAGGCTC CGTCGCGGCA CCTAG
|
Protein sequence | MDEVVVVRPG EFTIKRGATR AEMEKLLLKA AREAAEECGG AKFEKEPGRI YARGDTQCLK KALARVFGVK SVSPAYVMKF EGVADIAREA ARLWVGLAAG RRFAVRVHRV GNHPFTSRDV AAAVGSALVA AGARVDLENP EVEFFVEVRG DRAYFYTEVV EGPGGLPLGS EGKVLALVSG GIDSPVAAWL LMRRGAHVDV LHCNLGGTVA LRHTLEVIKR LLAWSYGYNA RVIIGDCSPV AKALRSGVRE ELWNIAFKRA LYRIGAEVAK TVRAAALVTG ESLGQVSSQT LQALAAAEMG VGIPILRPLI GMDKDEITKL AQRIGTYEIS AKTPEYCAVF SRRPKKWATR EEIEEIDFAL HDAVAEVASN VKVVRKWQLA EFIKTLSPPE DIEVETPPEG AVVVDLRDEE SYRKWHLPGA VRADFDEVLS LVDKLGRDKT YVFYCYSGGL SLDVAESLRK LGIKAYSLRL RRGT
|
| |