Gene Tneu_1312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1312 
Symbol 
ID6166297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1171616 
End bp1173043 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content64% 
IMG OID641668467 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001794685 
Protein GI171185766 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00281014 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.313151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGAGGG TTCTCCTCGT GAGGGTGGGG GAGCTCACCG TAAAGAGGGG CTGGACCCGC 
GTCGAGATGG AGAGGCTGTT GCTACGGGCG GCGAAGGAGG CGGCTGGCGA ATGCGGAGGG
GCGAGGTTCG CGAGGGAGCC GGGGAGGATA TACGCATATG GCGACGTCAA TTGCCTCAAA
AAGGCGCTGT CTAGAGTCTT CGGGGTTAAG TCGGTGAGTC CTGCGTACGT CCTCCAGTTT
AAGGATCTGG CGGAGGTGGC GGCCGCCGCC GCGGAGCTCT GGGGCGGGGA GGTGGCCGGG
AGGCGGTTCG CGGTTAGGGT CCACAGGGTG GGGACGCACG GCTTCACCTC AAGAGACGTG
GCCGCCGCCG TGGGCGCGGC GTTGGTTAAA GCAGGAGGTT CGGTGGATCT CGAGACCCCG
GAGGTGGAGC TTTATGTGGA GGTGCGGGGG GACCGGGCCT TTCTATATAG GGAGGTGCTG
GAGGGGCCGG GGGGCCTCCC CCTGGGGTCT GAGGGGAAGG TGCTGGCGTT GGTCTCCGGC
GGCATCGACT CGCCGGTTGC AGCGTGGATG CTCATGCGTA GGGGGGCACA CGTGGACGTC
TTCTACTGCC ACCTCGGCGG GACCTACGCG CTAAGGCTCG TTGTGGAGGT GATAAAGAGG
CTACTGTCTT GGTCCTACGG CTACAACGCG AGGGTGGCCG TGGCGGACTG CTCCCCAGTG
GTGCGGGCCT TACGGAGGGG GGTGAGGGAG GAGCTCTGGA ACATAGCCTT TAAGAGGGCG
CTCTACCTCG CCGCTTCCAA GGTGGCTGAG GCCGTGAAGG CGGCCGCCTT GGTCACGGGG
GAGTCGCTTG GCCAGGTGTC TTCGCAGACG TTGCAGGCGC TTGCGGCGGC TGAACGCGGG
CTCGATATGC CCATCTTTAG GCCTCTGGTG GGCATGGACA AGGACGAGAT CGTGCATCTC
GCCGAGAGGA TCGGGACGTA CGAGGTTTCG GCTAGGCTTC CCGAGTACTG CGCCCTCTTG
AGCAGAAGGC CTAGGAAGTG GGCAACGCGT CAGGAGGTGG AGGAGATAGA TCTGGCGATC
CACGACGCCG TGGCGGAGGT CGTAAACGGC GTTAAGGTAA TTAGGAAGAG CGAGCTGGAA
AGCTTCGCGT CTTCTCTAAA GCCGCCGCAC GACCTAGAGC TGGAGACCCC GCCCCCGGAC
TCCGTGTTGG TTGATCTACG AAGCGCGGAG GACTACAGAA GGTGGCACCT CCCAGGCGCT
CTCAGGGCGG ACCCAGACGA CGTTTTAACG CTGGTCGACC GCCTAGGCAG AGACAAGACC
TACGTCTTCT ACTGCTACGG AGGAGGCACA AGCTTAGACG TGGCGGAGAG CCTCCGGAGG
CTTGGGATCA AGGCCTACTC CCTCAAGCTT AAACCGCAGG GCGGTTGA
 
Protein sequence
MERVLLVRVG ELTVKRGWTR VEMERLLLRA AKEAAGECGG ARFAREPGRI YAYGDVNCLK 
KALSRVFGVK SVSPAYVLQF KDLAEVAAAA AELWGGEVAG RRFAVRVHRV GTHGFTSRDV
AAAVGAALVK AGGSVDLETP EVELYVEVRG DRAFLYREVL EGPGGLPLGS EGKVLALVSG
GIDSPVAAWM LMRRGAHVDV FYCHLGGTYA LRLVVEVIKR LLSWSYGYNA RVAVADCSPV
VRALRRGVRE ELWNIAFKRA LYLAASKVAE AVKAAALVTG ESLGQVSSQT LQALAAAERG
LDMPIFRPLV GMDKDEIVHL AERIGTYEVS ARLPEYCALL SRRPRKWATR QEVEEIDLAI
HDAVAEVVNG VKVIRKSELE SFASSLKPPH DLELETPPPD SVLVDLRSAE DYRRWHLPGA
LRADPDDVLT LVDRLGRDKT YVFYCYGGGT SLDVAESLRR LGIKAYSLKL KPQGG