Gene Teth514_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_2024 
Symbol 
ID5876497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp2033480 
End bp2034622 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content33% 
IMG OID641542369 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001663632 
Protein GI167040647 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGACA TATTATTGAT TAAATATGGA GAATTAGCTT TAAAAGGAGA TAATAGGTCT 
TTTTTTGAAA ATAAATTGAT AAAAAATATA AAACATGCTC TTTCTGACTT TAAGGAAGTT
AAAGTTGAAA AAACTCATGG CAGAATTTAT GTAGAATGTG ATGGAGATAT TGAAGAGGTA
ATAGAAAGAT TAAAAAAAGT CTTTGGTATT GTAGGAATAA CAAAAGCTAA AAAAACCGAT
TTAAACTTGG ATGAAATATT TAAAGCTGCA GTAGAACTTA TGAAAGGACA CGAAGGAAAG
ACTTTTAAAG TAGAGACTAA GAGGCCAAAT AAGTCTTTTC CTTATAACAG CATGGAGGTC
AGCCGCAGAG TAGGAGCAGC AGTATTGAAA AATGTCAAAA ACTTAAAAGT AGATGTTCAT
AATCCTGATG TGCTTTTAAA TGTAGAGATA AGAGAAATGG CTTTTGTATA CGCGGGAGTG
ATTGAGGGAA TAGGAGGACT TCCTCTTGGG ACAAACGGTA AAGCGACTGT ACTTTTGTCA
GGAGGAATTG ACAGTCCTGT AGCTGCTTGG ATGATGATGA AAAGAGGCGT AGAAGTAGAA
GCAGTTTATT TTCACAGCCC TCCTTATACT TCTGAAAGGG CTAAAGACAA AGTTGTAGAT
TTGTGCAAAG TCCTTTCTCA ATATGGACAA AGGATAAAAT TACACGTAGT TCACTTTACT
GATTTGCAAT TAGAAATTTA TGAGAAATGT CCACCTAAAT TTACTACTAT AATTATGAGA
AGAATGATGA TGAAGATAGC AGAAAAAATT GCTCAAAAAA ATGGTTCTAT GGCTCTAATC
ACAGGGGAAA GTTTAGGACA AGTTGCAAGC CAAACGATTG AAAGTTTATA TGTAACCAAT
GCTTCTGTCT CTATGCCAAT ATTTAGACCT CTTATTGGGA TGGATAAGAC AGAGATTATA
GATTTAGCTC AAAAGATTAG TACGTTTGAG ATCTCTATAA GACCCTATGA AGATTGTTGC
ACTATCTTTG TGCCAAAACA TCCTGCTACA AAGCCAAAAT TAGAAAAAGT AATAGAAGCA
GAACAAAAAA TGGAGTATCA AAAATACATT GATAATTTTG AAGAAGAGGT TATAGAAGTT
TAA
 
Protein sequence
MQDILLIKYG ELALKGDNRS FFENKLIKNI KHALSDFKEV KVEKTHGRIY VECDGDIEEV 
IERLKKVFGI VGITKAKKTD LNLDEIFKAA VELMKGHEGK TFKVETKRPN KSFPYNSMEV
SRRVGAAVLK NVKNLKVDVH NPDVLLNVEI REMAFVYAGV IEGIGGLPLG TNGKATVLLS
GGIDSPVAAW MMMKRGVEVE AVYFHSPPYT SERAKDKVVD LCKVLSQYGQ RIKLHVVHFT
DLQLEIYEKC PPKFTTIIMR RMMMKIAEKI AQKNGSMALI TGESLGQVAS QTIESLYVTN
ASVSMPIFRP LIGMDKTEII DLAQKISTFE ISIRPYEDCC TIFVPKHPAT KPKLEKVIEA
EQKMEYQKYI DNFEEEVIEV