Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0213 |
Symbol | |
ID | 6091617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 207295 |
End bp | 209886 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642487394 |
Product | CBS domain-containing protein |
Protein accession | YP_001738256 |
Protein GI | 170288018 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase [COG0618] Exopolyphosphatase-related proteins [COG3448] CBS-domain-containing membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGTCA TCACCACACA CAGGTCACCC GATTTCGATG CCTTCGCTTC CTGTGTCGCG GCAAAGAAGC TGTTTGATGA CCACATTATA GTTCTGCCTT CCAATCCAGC GAGGAATCTT TCCGATTTCC TCAAGGTTTA CTCGGATCGG TTTGAATTCG TCTGGGATCA TGAGTTTGAA GGTGAGATCA CAGAACTTGT GATCGTCGAC GCACCTTCTC TGGACCGCAT ACCGGAGAGC ATCAGAGAGA GAAGCCAGGG GGCGAAAATA ACTGTTTACG ATCACCACGT AGATGAAAGT CCTTACGATG GAATGGTATC GAAGGTAGGA GCCACGATCA CGATACTCGT TGAGCTCATT CGGGAGAAAA ATATACCACT GGATCCCACC GAAGCCACTC TTTTCATGAT CGCTCTCTAC GACGACACGG GAAATCTCCT TTTTTCTTCA ACCACACCGC GGGATCTGGA GATAGCGAAA TTCCTTCTGG AGAACGGGGC AAATCTCGAT GAAGTGGCAC TTTACACAAG AGAAGAACTC ACCCCCAAGC AGATGGAACT GCTGGACAAA CTGATAGAGA ACGCACGTGA TTATGAAGTG AACGGTGTTC CGATAACGAT CTCGTTGATA GAGTGCGAGG ATTTCGTTGG GGGAATGGGG CTGATTGTGA GTAAAGCCTG GGAAATGATG GGAAAGGAAA CCTTCATAGC GATCGTGAAG ATGGGAAAGA AGATCTACGT TATTGGAAGA ACCGGTTCAC CAGATGTGGA TCTCGGCTCT CTCATGAAAG ACCTCGGGGG AGGCGGACAC ACGAGGGCAG CCTCTGCGAC TATCACCGGA AAAGAGATCG ACGAGGTTTT GAAAGAAGTT TTGAATAGAC TCCATGATCA CGTGGTGCCA CTCCTCCGCG CAAGAGATAT CATGTCCTCT CCCGTGAAGG TGGTCCTATC TAACATGACG ATAAAAGAAG TGGATAGGTT GATGAAGCAA ACCGGACACA GTGGATTTCC AGTCGTTGAA GGAAACAGGC TGGTAGGTAT TGTTACGAAG AAAGCGGTCG AAAAGGCGAT GAACCATGGT CTGGGAGACA GGCCTGTGAA ATCCATCATG TCGACGAACC TGGTGGTGGC GACCCCTGAT ACTTCTGTAA CCAGGTTAAG AGAACTCATG GTGGAGCACG CCATAGGACG AATCCCCATA CTGGAAAACG GTATTCTCGT GGGCATCGTG ACAAGAAGTG ACGTTCTGAG GGCGATATTC GGAAAACCCT TCAAAAAATA CGTAATGCCG GTGTTTCAGG CGAACGGACA GATATTCAGA GACGTTTCAA AGCTCCTCGT GGAACGGGTG GATCCGAAGA TTTTGAATCT TTTCAGACTC CTCGGGAAGT TCGGTGATGA GGTGAACATG CCCGTTTACG TTGTGGGAGG TTTCGTCCGC GATCTTCTAC TCGGTATAAA GAATCTCGAT GTAGACATCG TTGTTGAAGG CAACGCGCTG GAATTTGCCG AGTACGCTAA ACGTTTTCTA CCGGGAAAAC TGGTGAAACA CGACAAATTC ATGACCGCCT CTCTTTTTCT GAAGGGAGGC CTCAGAATAG ACATCGCAAC AGCAAGACTG GAGTACTACG AATCACCCGC CAAACTCCCC GATGTGGAGA TAAGTACGAT AAAGAAAGAT CTCTACAGGA GAGACTTCAC GATAAACGCG ATGGCTATAA AGCTGAATCC GAAAGATTTC GGATTGCTGA TCGATTTCTT TGGAGGATAC AGAGATCTGA AGGAAGGAGT AATACGGGTT CTTCACACCC TGAGTTTTGT AGACGATCCC ACAAGGATTC TACGTGCCAT TCGGTTTGAG CAACGTTTTG ACTTCAGAAT AGAAGAAACA ACAGAGAGGC TCCTGAAACA GGCCGTTGAA GAAGGTTACC TTGAGAGAAC AACTGGACCA CGTCTCAGGC AGGAACTGGA GAAAATACTC GAAGAGAAAA ACCCCCTGAA GTCGATCAGA AGGATGGCAC AGTTCGATGT GATAAAACAT CTGTTTCCAA AAACCTATTA CACACCTTCC ATGGACGGGA AGATGGAAAA TCTCTTCAGA AACATTCCGT GGGTGGAGGA GAACTTCGGA GAGGTTGACA AATTCTACGC GGTGCTCCAC GTGTTCCTTG AGTTCTACGA CGACGAGAGC TGGAAAGAAG TGAGGGATAG ATATTCTCTT CGCAGGGATT TGATAAATGA AATCAGGCAT GTAGAAAAGA GTGCCCCCGC TCTTTTAGAA ATGCTTTCAG AAAGGGTTCC TGCTTCCTTT GTTTATCCTC TCGTGAAGGG AGTTTCGAAC GAAACGATCT GTCACTTCTT AGCGTATCTC AGCGGTGAGA AAGAAGAGCT GTTCAAATCT TATCTTTTGA AGATAAAGAA CACAAAGCTC GAAAAGATAA ACGGGGAGTA TCTAATCAGA AAAGGAATAA CATCCGGTAA AATAATTGGA GAGGTTCTCG AGAAGATTCT CATGAAAAAA CTCGATGGAG ACACAAGAGA TGAAGAGGAG ATACTGGAGG AAGTCCTGGC ATCATTAGAA ACGGAGGGAT AA
|
Protein sequence | MRVITTHRSP DFDAFASCVA AKKLFDDHII VLPSNPARNL SDFLKVYSDR FEFVWDHEFE GEITELVIVD APSLDRIPES IRERSQGAKI TVYDHHVDES PYDGMVSKVG ATITILVELI REKNIPLDPT EATLFMIALY DDTGNLLFSS TTPRDLEIAK FLLENGANLD EVALYTREEL TPKQMELLDK LIENARDYEV NGVPITISLI ECEDFVGGMG LIVSKAWEMM GKETFIAIVK MGKKIYVIGR TGSPDVDLGS LMKDLGGGGH TRAASATITG KEIDEVLKEV LNRLHDHVVP LLRARDIMSS PVKVVLSNMT IKEVDRLMKQ TGHSGFPVVE GNRLVGIVTK KAVEKAMNHG LGDRPVKSIM STNLVVATPD TSVTRLRELM VEHAIGRIPI LENGILVGIV TRSDVLRAIF GKPFKKYVMP VFQANGQIFR DVSKLLVERV DPKILNLFRL LGKFGDEVNM PVYVVGGFVR DLLLGIKNLD VDIVVEGNAL EFAEYAKRFL PGKLVKHDKF MTASLFLKGG LRIDIATARL EYYESPAKLP DVEISTIKKD LYRRDFTINA MAIKLNPKDF GLLIDFFGGY RDLKEGVIRV LHTLSFVDDP TRILRAIRFE QRFDFRIEET TERLLKQAVE EGYLERTTGP RLRQELEKIL EEKNPLKSIR RMAQFDVIKH LFPKTYYTPS MDGKMENLFR NIPWVEENFG EVDKFYAVLH VFLEFYDDES WKEVRDRYSL RRDLINEIRH VEKSAPALLE MLSERVPASF VYPLVKGVSN ETICHFLAYL SGEKEELFKS YLLKIKNTKL EKINGEYLIR KGITSGKIIG EVLEKILMKK LDGDTRDEEE ILEEVLASLE TEG
|
| |