Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1697 |
Symbol | |
ID | 6093147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1718939 |
End bp | 1719988 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642488897 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001739714 |
Protein GI | 170289476 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000546308 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTT TCGTGAAAAC CTTTGGCGGG ACAAGAGTTA TCAAGAACAA TGATATCGTG AACGCAAGAG ACTGGCCATC GCAAAAAGCG TTTGCCCTGT TCAGGTATCT CATTTTCAGA AGGAACGAAG AAGTGTCCGT CGAAGAGATC TACAACCTCT TTTGGGAAGA CATGGACGAC ACATTCGCGA AATCAAATCT GAACACCACA CTCCATATTA TAAGGAGAAC CACCGGAATA ACGAGCGAAG AACTCTTTGT GAAAGGAGAT CTCTGCTGTT TCTTCCCGGG AGACAAAATC ACCATAGACG CAGATATTTT CGAAGAGTGC CACAGAAATC TGATGAAAGC CACATCGGAT ATTGAACATG AAAAACTTCT TAAGAGGATG TTCGAGATTT ATGCAGGACC GTTTCTAATC GAAGACATTT TCGCAGAATG GGTGCAGGAA ATCAGAGAAA TCTACGAATC GTGGTACTCA GATGTTTTAA AAGAGCTCTT CAAATTGTAT CTGGCAAAAA AAGATTACGA CGCCGCCCTC GAGATGGTAA ACGCTTATTT TCAGAGAGAG CCTTACGACG AGGATATGTA CTACAAAGCA ATAGAGGTCC TTCTGAAAAA GGGTGATATC ACAAGGGCAA AACGCGTATA CGACAAGCTC TCGAGTCATC TTATGGAGAT AGGGATCAAA CCTCGATTGA AATTCGATGA TTTTCTCTCC AAAAGAGGCT CAGAATTCAT GCTGAACGGT AACAAGGCAG TGGTGGTTGA TGAAAAGCTT TTTGAAAGTT TCCTCTTTCT GGAGAGTCGA AGGAGAGAAA AGTCTTTTGT CCTCGTCGAG GTGAAACTGA TGAATAAGAG TATCAGCACT GAAGATGTCT CCCAAAGGGT AGCATCTCAT CTTCGAAAGG GAGACGTGAT GACCTTCTCA GGTGAAACTA TCCGAATTCT CTTCCACTGT CCCGAACAGC GTCGTCCAAC AATGGAAAAA CGTGTAGCAG ACGTCCTCGA GAAAGTTGGA GTGAAGAAAG GTCAGTACGA AATTTCCTGA
|
Protein sequence | MSIFVKTFGG TRVIKNNDIV NARDWPSQKA FALFRYLIFR RNEEVSVEEI YNLFWEDMDD TFAKSNLNTT LHIIRRTTGI TSEELFVKGD LCCFFPGDKI TIDADIFEEC HRNLMKATSD IEHEKLLKRM FEIYAGPFLI EDIFAEWVQE IREIYESWYS DVLKELFKLY LAKKDYDAAL EMVNAYFQRE PYDEDMYYKA IEVLLKKGDI TRAKRVYDKL SSHLMEIGIK PRLKFDDFLS KRGSEFMLNG NKAVVVDEKL FESFLFLESR RREKSFVLVE VKLMNKSIST EDVSQRVASH LRKGDVMTFS GETIRILFHC PEQRRPTMEK RVADVLEKVG VKKGQYEIS
|
| |