Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1140 |
Symbol | |
ID | 6092573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1182503 |
End bp | 1183660 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488334 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001739168 |
Protein GI | 170288930 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000091055 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAGTTT ACATAGTGAG ATATTCCGAG ATAGGTCTCA AAGGAAAGAA CAGAAAAGAT TTTGAAGAAG CCCTCAAAAG AAACATCGAG AGAGTAACCG GAATGAAGGT GAAGAGACAG TGGGGAAGAT TTCTCATTCC AATAGATGAA AACGTAACAC TCGATGACAA GCTGAAGAAA ATCTTTGGAA TTCAGAATTT CAGCAAAGGA TTTCTGGTGA GTCACGATTT CGAGGAAGTG AAGAAATATT CACTGATCGC GGTGAAAGAA AAGCTGGAAA AAGGAAATTA CAGAACTTTC AAGGTGCAGG CCAAAAAAGC TTATAAGGAA TACAAAAAAA GTATATACGA AATAAACAGT GAGCTCGGTG CCTTGATACT CAAAAACTTC AAGGAACTTT CCGTTGATGT ACACAATCCG GATTTTGTTC TCGGGGTGGA AGTGAGACCT GAAGGGGTTC TGATTTTCAC AGACAGGGTG GAGTGCTACG GTGGACTTCC CGTGGGAACG GGAGGAAAAG CGGTTCTTCT TCTCTCTGGA GGAATAGACA GTCCTGTGGC AGGCTGGTAC GCACTGAAAA GAGGAGTTCT CATAGAGTCC GTCACGTTCG TGTCTCCTCC TTTCACATCG GAGGGAGCCG TGGAAAAAGT GAGAGACATA TTGAGAGTTC TCAGGGAGTT CAGTGGGGGT CATCCTTTGA GATTGCACAT TGTGAATCTC ACAAAGCTGC AGCTTGAGGT CAAAAAGAAC GTACCGGACA AATACTCGCT GATCATGTAC AGAAGGTCCA TGTTCAGAAT AGCGGAAAAA ATAGCGGAGG AAACCGGTGC GGTTGCTTTT TACACGGGGG AGAACATAGG ACAGGTGGCG AGCCAGACCC TGGAAAACCT CTGGTCTATA GAGAGCGTGA CCACAAGACC CGTGATAAGG CCTCTTTCTG GTTTCGACAA GACAGAGATC GTCGAAAAGG CAAAAGAGAT CGGAACCTAC GAGATCTCTA TAAAGCCTTA CCAGGACAGC TGCGTCTTCT TCGCTCCGAA AAATCCTGCA ACGAGATCTC ATCCCTCGAT CCTCGAGAAG CTGGAACAGC AGGTTCCAGA TCTTCCCGTT CTCGAAGAAG AAGCGTTCAC CTTCAGAAAA GTCGAGGTGA TAGAGTGA
|
Protein sequence | MRVYIVRYSE IGLKGKNRKD FEEALKRNIE RVTGMKVKRQ WGRFLIPIDE NVTLDDKLKK IFGIQNFSKG FLVSHDFEEV KKYSLIAVKE KLEKGNYRTF KVQAKKAYKE YKKSIYEINS ELGALILKNF KELSVDVHNP DFVLGVEVRP EGVLIFTDRV ECYGGLPVGT GGKAVLLLSG GIDSPVAGWY ALKRGVLIES VTFVSPPFTS EGAVEKVRDI LRVLREFSGG HPLRLHIVNL TKLQLEVKKN VPDKYSLIMY RRSMFRIAEK IAEETGAVAF YTGENIGQVA SQTLENLWSI ESVTTRPVIR PLSGFDKTEI VEKAKEIGTY EISIKPYQDS CVFFAPKNPA TRSHPSILEK LEQQVPDLPV LEEEAFTFRK VEVIE
|
| |