Gene TRQ2_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1140 
Symbol 
ID6092573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1182503 
End bp1183660 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content46% 
IMG OID642488334 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001739168 
Protein GI170288930 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000091055 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAGTTT ACATAGTGAG ATATTCCGAG ATAGGTCTCA AAGGAAAGAA CAGAAAAGAT 
TTTGAAGAAG CCCTCAAAAG AAACATCGAG AGAGTAACCG GAATGAAGGT GAAGAGACAG
TGGGGAAGAT TTCTCATTCC AATAGATGAA AACGTAACAC TCGATGACAA GCTGAAGAAA
ATCTTTGGAA TTCAGAATTT CAGCAAAGGA TTTCTGGTGA GTCACGATTT CGAGGAAGTG
AAGAAATATT CACTGATCGC GGTGAAAGAA AAGCTGGAAA AAGGAAATTA CAGAACTTTC
AAGGTGCAGG CCAAAAAAGC TTATAAGGAA TACAAAAAAA GTATATACGA AATAAACAGT
GAGCTCGGTG CCTTGATACT CAAAAACTTC AAGGAACTTT CCGTTGATGT ACACAATCCG
GATTTTGTTC TCGGGGTGGA AGTGAGACCT GAAGGGGTTC TGATTTTCAC AGACAGGGTG
GAGTGCTACG GTGGACTTCC CGTGGGAACG GGAGGAAAAG CGGTTCTTCT TCTCTCTGGA
GGAATAGACA GTCCTGTGGC AGGCTGGTAC GCACTGAAAA GAGGAGTTCT CATAGAGTCC
GTCACGTTCG TGTCTCCTCC TTTCACATCG GAGGGAGCCG TGGAAAAAGT GAGAGACATA
TTGAGAGTTC TCAGGGAGTT CAGTGGGGGT CATCCTTTGA GATTGCACAT TGTGAATCTC
ACAAAGCTGC AGCTTGAGGT CAAAAAGAAC GTACCGGACA AATACTCGCT GATCATGTAC
AGAAGGTCCA TGTTCAGAAT AGCGGAAAAA ATAGCGGAGG AAACCGGTGC GGTTGCTTTT
TACACGGGGG AGAACATAGG ACAGGTGGCG AGCCAGACCC TGGAAAACCT CTGGTCTATA
GAGAGCGTGA CCACAAGACC CGTGATAAGG CCTCTTTCTG GTTTCGACAA GACAGAGATC
GTCGAAAAGG CAAAAGAGAT CGGAACCTAC GAGATCTCTA TAAAGCCTTA CCAGGACAGC
TGCGTCTTCT TCGCTCCGAA AAATCCTGCA ACGAGATCTC ATCCCTCGAT CCTCGAGAAG
CTGGAACAGC AGGTTCCAGA TCTTCCCGTT CTCGAAGAAG AAGCGTTCAC CTTCAGAAAA
GTCGAGGTGA TAGAGTGA
 
Protein sequence
MRVYIVRYSE IGLKGKNRKD FEEALKRNIE RVTGMKVKRQ WGRFLIPIDE NVTLDDKLKK 
IFGIQNFSKG FLVSHDFEEV KKYSLIAVKE KLEKGNYRTF KVQAKKAYKE YKKSIYEINS
ELGALILKNF KELSVDVHNP DFVLGVEVRP EGVLIFTDRV ECYGGLPVGT GGKAVLLLSG
GIDSPVAGWY ALKRGVLIES VTFVSPPFTS EGAVEKVRDI LRVLREFSGG HPLRLHIVNL
TKLQLEVKKN VPDKYSLIMY RRSMFRIAEK IAEETGAVAF YTGENIGQVA SQTLENLWSI
ESVTTRPVIR PLSGFDKTEI VEKAKEIGTY EISIKPYQDS CVFFAPKNPA TRSHPSILEK
LEQQVPDLPV LEEEAFTFRK VEVIE