Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1312 |
Symbol | |
ID | 6166297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1171616 |
End bp | 1173043 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641668467 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001794685 |
Protein GI | 171185766 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00281014 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.313151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGAGGG TTCTCCTCGT GAGGGTGGGG GAGCTCACCG TAAAGAGGGG CTGGACCCGC GTCGAGATGG AGAGGCTGTT GCTACGGGCG GCGAAGGAGG CGGCTGGCGA ATGCGGAGGG GCGAGGTTCG CGAGGGAGCC GGGGAGGATA TACGCATATG GCGACGTCAA TTGCCTCAAA AAGGCGCTGT CTAGAGTCTT CGGGGTTAAG TCGGTGAGTC CTGCGTACGT CCTCCAGTTT AAGGATCTGG CGGAGGTGGC GGCCGCCGCC GCGGAGCTCT GGGGCGGGGA GGTGGCCGGG AGGCGGTTCG CGGTTAGGGT CCACAGGGTG GGGACGCACG GCTTCACCTC AAGAGACGTG GCCGCCGCCG TGGGCGCGGC GTTGGTTAAA GCAGGAGGTT CGGTGGATCT CGAGACCCCG GAGGTGGAGC TTTATGTGGA GGTGCGGGGG GACCGGGCCT TTCTATATAG GGAGGTGCTG GAGGGGCCGG GGGGCCTCCC CCTGGGGTCT GAGGGGAAGG TGCTGGCGTT GGTCTCCGGC GGCATCGACT CGCCGGTTGC AGCGTGGATG CTCATGCGTA GGGGGGCACA CGTGGACGTC TTCTACTGCC ACCTCGGCGG GACCTACGCG CTAAGGCTCG TTGTGGAGGT GATAAAGAGG CTACTGTCTT GGTCCTACGG CTACAACGCG AGGGTGGCCG TGGCGGACTG CTCCCCAGTG GTGCGGGCCT TACGGAGGGG GGTGAGGGAG GAGCTCTGGA ACATAGCCTT TAAGAGGGCG CTCTACCTCG CCGCTTCCAA GGTGGCTGAG GCCGTGAAGG CGGCCGCCTT GGTCACGGGG GAGTCGCTTG GCCAGGTGTC TTCGCAGACG TTGCAGGCGC TTGCGGCGGC TGAACGCGGG CTCGATATGC CCATCTTTAG GCCTCTGGTG GGCATGGACA AGGACGAGAT CGTGCATCTC GCCGAGAGGA TCGGGACGTA CGAGGTTTCG GCTAGGCTTC CCGAGTACTG CGCCCTCTTG AGCAGAAGGC CTAGGAAGTG GGCAACGCGT CAGGAGGTGG AGGAGATAGA TCTGGCGATC CACGACGCCG TGGCGGAGGT CGTAAACGGC GTTAAGGTAA TTAGGAAGAG CGAGCTGGAA AGCTTCGCGT CTTCTCTAAA GCCGCCGCAC GACCTAGAGC TGGAGACCCC GCCCCCGGAC TCCGTGTTGG TTGATCTACG AAGCGCGGAG GACTACAGAA GGTGGCACCT CCCAGGCGCT CTCAGGGCGG ACCCAGACGA CGTTTTAACG CTGGTCGACC GCCTAGGCAG AGACAAGACC TACGTCTTCT ACTGCTACGG AGGAGGCACA AGCTTAGACG TGGCGGAGAG CCTCCGGAGG CTTGGGATCA AGGCCTACTC CCTCAAGCTT AAACCGCAGG GCGGTTGA
|
Protein sequence | MERVLLVRVG ELTVKRGWTR VEMERLLLRA AKEAAGECGG ARFAREPGRI YAYGDVNCLK KALSRVFGVK SVSPAYVLQF KDLAEVAAAA AELWGGEVAG RRFAVRVHRV GTHGFTSRDV AAAVGAALVK AGGSVDLETP EVELYVEVRG DRAFLYREVL EGPGGLPLGS EGKVLALVSG GIDSPVAAWM LMRRGAHVDV FYCHLGGTYA LRLVVEVIKR LLSWSYGYNA RVAVADCSPV VRALRRGVRE ELWNIAFKRA LYLAASKVAE AVKAAALVTG ESLGQVSSQT LQALAAAERG LDMPIFRPLV GMDKDEIVHL AERIGTYEVS ARLPEYCALL SRRPRKWATR QEVEEIDLAI HDAVAEVVNG VKVIRKSELE SFASSLKPPH DLELETPPPD SVLVDLRSAE DYRRWHLPGA LRADPDDVLT LVDRLGRDKT YVFYCYGGGT SLDVAESLRR LGIKAYSLKL KPQGG
|
| |