Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1804 |
Symbol | |
ID | 6093255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1821347 |
End bp | 1823101 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642489001 |
Product | hypothetical protein |
Protein accession | YP_001739818 |
Protein GI | 170289580 |
COG category | [S] Function unknown |
COG ID | [COG3472] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACAATC TTTTGTTTAA GAAAGTAGAT TACAGTGTGG GTGGACTGCT GGAGAGCATC GATAGCGGTG AAATAGGCCT TCCTGATATT CAACGGCCCT TTGTCTGGGA TACAACCCGT GTGCGTGATC TGTTTGATTC TATGTACCGT GGTTATCCCA TAGGAACTCT TCTTTTCTGG GAAAATGGTT TCCCTGGTGA ACACCGCACT ATCGGGACAG GCCCCAAGAA GAAAGTGCCT CGCCTGCTCG TTGTAGATGG TCAACAGCGA CTCACTGCCC TTTACTCCGT AATGAAGGGT GTTCCCATCG TAGACAAGAA CTTTCGACAA CGGCGCTTGA GAATAGCCTT CAATCCGTTG GAAGAAAAAT TCGAGGTCAC CAATACATCT ATTGAACGAG ATCCTACCTG GATATCTGAT ATCAGTATCC TGTGGCAGGA AGGTTTCGCG CTGTACGATT TTATCTCCAG TTTCATGAAG AGATTGGAAG AACGGCGTGG TCTAACCGAA GAGGAACGGC AGCGGATTCC TCGATCCATT CAAAAGCTGG TCAACCTCGT CAATTATCCA ATGACCGCTC TGGAAATTTC TGCCAGCGCC ACAGAAGAAC AGGTTTCTGA GATCTTTGTT CGCATAAACA GCAGAGGTCG TACACTCAAT CAGGCCGATT TCATCCTGAC GTTGATGTCC GTTTTCTGGG ATGAAGGGCG AAAACAACTG GAAGAATTCT GCCGGCGGGC TAAGAATCCA CCTTCGGATA ATCGTCCTTC ACCCTATAAC CCATACTTCA AACCTCAACC AGATCAGCTA CTGAGAGTCG ATGTGGCACT GGCTTTTCGT CGCGCCCGGC TGGAATACGT GTATTCCATT CTGCGAGGCA AAGATCTTCA GACCGGTGAA TTCTCTCCTG AACGTCGTGA TGCCCAGTTC GCTCTCCTGA GAAAAGCACA GGACGAAGTT CTCAACCTGC AAAACTGGCA CGACTTTTTG AAAGTTATAA AGCGTGCCGG ATATATTCAT CCCAGTCTTA TTACCTCTGA AATGGCGCTG GTTTACACTT ATTCCCTCTG GCTCATTGGC AAACAAGACT TTGGCCTGGA CCAGCACACT CTCCGCAATC TGATGGCACG ATGGTTCTTC ATGAGTTCAC TCACCAGCCG CTACTCCTCT TCCCCTGAAA CTCGTATGGA ACAGGATCTC GCATTGATAC GAGGCTGTAC CAATTCAGAA GAGTTCATTC GGACACTGGA ACAGGAAATA TCAGCGGTTC TGACAAATGA TTACTGGACT GTCACGCTCC CCAACGAACT CGCCACAGCT TCCGCTCGCA GTCCGGGACA ATTCGCCTTC TTTGCAGCTC TCTGTTTGCT TGATGCCCCG GTACTCTACT CTTCTATGAA GGTTCGCGAC CTGCTCGATC CCACATCACA ATCGGGAAGG TCAGCCCTGG AGAGACACCA TCTCTTTCCA CGCAAATACC TTCAAAAACT GGGCATCAAA GATAAACACG ACATAAACCA GGTTGCCAAT TTCGCACTGG TAGAATGGTA CGACAACGTT GATATAGGAG ATCGCCCTCC TTCAGATTAC GCGCCCGAGT ATGAAAGACG TTTTCCACCC GACAAACTTA AAGAAATGTA CTGGTACCAT GCACTGCCCG AAGGCTGGTA CAACATGGAT TACTGGACAT TTTTGGAGGA ACGCCGACGT CGAATGGCAG AAATCATTCG AAAAGGGTTT GAGAGTTTGA AGTGA
|
Protein sequence | MNNLLFKKVD YSVGGLLESI DSGEIGLPDI QRPFVWDTTR VRDLFDSMYR GYPIGTLLFW ENGFPGEHRT IGTGPKKKVP RLLVVDGQQR LTALYSVMKG VPIVDKNFRQ RRLRIAFNPL EEKFEVTNTS IERDPTWISD ISILWQEGFA LYDFISSFMK RLEERRGLTE EERQRIPRSI QKLVNLVNYP MTALEISASA TEEQVSEIFV RINSRGRTLN QADFILTLMS VFWDEGRKQL EEFCRRAKNP PSDNRPSPYN PYFKPQPDQL LRVDVALAFR RARLEYVYSI LRGKDLQTGE FSPERRDAQF ALLRKAQDEV LNLQNWHDFL KVIKRAGYIH PSLITSEMAL VYTYSLWLIG KQDFGLDQHT LRNLMARWFF MSSLTSRYSS SPETRMEQDL ALIRGCTNSE EFIRTLEQEI SAVLTNDYWT VTLPNELATA SARSPGQFAF FAALCLLDAP VLYSSMKVRD LLDPTSQSGR SALERHHLFP RKYLQKLGIK DKHDINQVAN FALVEWYDNV DIGDRPPSDY APEYERRFPP DKLKEMYWYH ALPEGWYNMD YWTFLEERRR RMAEIIRKGF ESLK
|
| |