Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0510 |
Symbol | |
ID | 6091925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 510154 |
End bp | 512121 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642487697 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738549 |
Protein GI | 170288311 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00108617 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGT TTGTGTTGGT CCTCGTTGCA CTTCTTGTTT TTCAAATTCT TGTGATGGGT TTTGGCCTCA AAGAGATCAG GCCGGGGGAA TATTACAACC TTTCCGACTA TGAACGTCTG ACGGGGAAGA AGATCACAAA GTTCAACGAG GCTCCTATGC TGAAAGAGAT GGTTGAGAAG GGATTGTTAC CACCTGTTGA AGAAAGACTT CCGAAAAACC CCCTTGTTGT GACTCCCGTA AAAGAGATCG GACAGTACGG TGGCACATGG AGGAGAGCCT GGTACGGTTT TTCTGACAAA TGGGGACCAA ACAAGATCTG CTTTGAGTAT CCAATATTCA GAAGTAATGA TGGAAGTGAA CTTGTGCCGA ATGTCTTTGA AAGTATGATA GTGTCGCCAG ATGGAAGATC CTTTACTTTC AAAATTAGAG AGGGTTTGAA GTGGTCTGAT GGAGTTCCTG TCACAACTGA GGATGTCAGG TTCTGGTATG AAGACATACT TTTGAACGAG GAAATAACAC CATCTATATC TGCCGATTTG AGATCTGGGG GAGAAGTTTT CAAACTGGAG ATAGTGGACG ACTATACTTT CAGAATGATC TTTAAAGAAC CAAATATGGT ACTGCCCTGG AGAATAACCA AGTCCTGGGT GGCTGGAACG AGTTTCGTCG TTCCCTCTCA CTACATCAAA CAATTTCATC CGAAATATGT AGGAGAGGAA AAAGCCCAGC AGATAGCAAA AGAAAACGGT TTCGATAACT GGTGGCAGTT TGTGCAAGCA AGATGTCTGA ACAGTGATTC CTGGTTGAGG AACCCCGATC TTCCAGTTCT TTTCCCATGG AAGTTGTCCA GAAGGACCAC CGACACCATG CTGGTTCTTG AAAGAAATCC GTACTACTTC AAAATCGATC CAGAAGGCAA TCAACTTCCC TATATCGATG AGATAGTTCA TTACCTCGTT CAGAACTCTC AAATGCTTGT CTTCAAGGCG ATAACCGGAG AGATAGATAT GCAGGGCAGG AATCTGTCAG TGGCGGATCT TCAGCTGCTG CTGGCTAATC AGGAAAAAGG AGGATACAGA GTCATTTTCG AAAGACAGGC TATAGGAAGT GATGTTACAC TCTGGTTCAA CCAGAATTAC GAGGAAGATG AAATACTTGC GTCTATTCTC AGGGATGTGA GATTCAGGCA GGCTATTTCT CTTGCGATCA ACAGGGAAGA GATATGGCAG CTTGTGTACC ACGGACTAGG AGAACCGCGC CAGGCGTCTC TGATCAAAGG AGTGAAATAC TACGATCCTG AATGGGAAAA GGCGTATGCC GAGTACGACC CCGAAAGAGC CAATAAACTT CTCGATGAAA TGGGACTCAC CAAACGCGAT TCGGAAGGTT ATCGTCTCAG ACCTGATGGA AAGAGGCTTG AAATAACCAT CGAGTATCCA ACCGGTGTAT TCACAGCGTG GGACGATGCT CTTCAGATGA TAAAGAATTA TGTGGAGAAA ATAGGTGTGA AAGTGCTTCT TAAATCAGAA GAGAGGAGCC TCTGGGATAC AAGGAATCAA ACGGGCCAGA TACAAATAGC CGCATGGTGG TTCGACAGAA ATTCCGACGT TTTTGGTGAT CCTTCACTTC TTCTGGGATA CAGAACCTGG GCTCCTCTCA GTTACATTTG GTACAACCAG GGCAGACAGG GTGGAAAGGC TCCTGAGGAA GGAACTGATA TGTGGAAGAT TTATGAACTG TATGACCTGG CAAGGAAGGA ACCTGACGAT CAGAAAAGAG ATGAGTACAT GAGGCAACTT CTGGAAATAC ACAAGAAAAA TCTGTGGGCG ATAGGCACCG TTGGAGCCTT ACCACAGCCA GTGGTTGTCA AGAACAATTT TGAAAACGTT CCCGAAGACT TCCTTTGGGA CGATCCTCTC AGGAGTCCCA AGAATTTGAG ACCGGAACAG TTTTTCTTCA AGAAATGA
|
Protein sequence | MKRFVLVLVA LLVFQILVMG FGLKEIRPGE YYNLSDYERL TGKKITKFNE APMLKEMVEK GLLPPVEERL PKNPLVVTPV KEIGQYGGTW RRAWYGFSDK WGPNKICFEY PIFRSNDGSE LVPNVFESMI VSPDGRSFTF KIREGLKWSD GVPVTTEDVR FWYEDILLNE EITPSISADL RSGGEVFKLE IVDDYTFRMI FKEPNMVLPW RITKSWVAGT SFVVPSHYIK QFHPKYVGEE KAQQIAKENG FDNWWQFVQA RCLNSDSWLR NPDLPVLFPW KLSRRTTDTM LVLERNPYYF KIDPEGNQLP YIDEIVHYLV QNSQMLVFKA ITGEIDMQGR NLSVADLQLL LANQEKGGYR VIFERQAIGS DVTLWFNQNY EEDEILASIL RDVRFRQAIS LAINREEIWQ LVYHGLGEPR QASLIKGVKY YDPEWEKAYA EYDPERANKL LDEMGLTKRD SEGYRLRPDG KRLEITIEYP TGVFTAWDDA LQMIKNYVEK IGVKVLLKSE ERSLWDTRNQ TGQIQIAAWW FDRNSDVFGD PSLLLGYRTW APLSYIWYNQ GRQGGKAPEE GTDMWKIYEL YDLARKEPDD QKRDEYMRQL LEIHKKNLWA IGTVGALPQP VVVKNNFENV PEDFLWDDPL RSPKNLRPEQ FFFKK
|
| |