Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1583 |
Symbol | |
ID | 6093032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1593785 |
End bp | 1595047 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642488784 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739602 |
Protein GI | 170289364 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000175887 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAAAGT GGTTGTTTTT CATGGTTCTT CTGATCGTTG CGGGTCTCAT GTTCGGAAAG GTGAACTTCG CGTCCACACA GATGACACCC GCTGCTGAAA GGGAGTTCAT GCTCAACAAA CTCGCGGAAT TTTCGAAGAA GAGCGGTATC GATGTGGAGT TCCTCAACTT CGAGTATCCA CAGCTCTACA GCAGGCTCCA GGCGGAGATC AGAGCCGGTA AAAATACGCT GAACCTGATT GCAGACCTCC AGGGAAACCT CTACATAATG GCCTCTGAAG GATTCCTCAG TGATCTCAAG GATCTCAAAT TCGAAGGAAA AACCTTCATC GAGACGCTTG AGAAGTTCGC TTATGTGAAA GGTGAAAAGG TGTTCATTCC CTGGCTCCAG GCAACTTACG TGATGGCCGT TAACAAAAAG GCGTTTGACT ACCTGCCGCG CGGTCTTTCG AAAGAAGACG TCATCAGGGG GACGGAGAAG TGGACTTACG ACGCTCTGCT CGAGTGGGCA AAGAACATCT ATGAGAAGAC GAAACAACCC CTTCTTGGCT TCCCGATCGG ACCGAAGGGA CTCTGGCACA GGTTCCTCCA CGGCTACATC TATCCATCCT TCACGGGAGC GCAGGCTCTG AAGTTCGACA GTGTGAGGGC CGTTGAAATG TGGAACTATC TGAAGGAGCT CTTCAAATAC GTACATCCGG CAAGCTCCAC CTGGGACGGG ATGGCCGATC CTCTCCTGAG AGAAGAAGTC TGGATCGCCT GGGATCACAC TGCAAGACTC AAACCCGCGA TCGTTGAAAA GCCTAACGAT TTCGTTGTTG TACCGGTCCC AAGAGGGCCG ATGGGTAGAG GGTACATCAT AGTGCTCGTG GGCCTTGCCA TACCGAAGGA TGCGGACTTC GAAGAACCCG CGAAAGTGAT AGACTTCCTC ACTTCTCCAG AGATGCAGGT TGAAATCCTC AAGAACGTCG GTTTCTTCCC AGTGGTTCAG GAAGCCGTTG GTGCCGTGCC AGAAGGTGCC CTCAAAGTGC TCGCAGAAGG TGTGATAAAT CAGTCCGCCA CGAAGGATTC TATCGTTTCC TTCATACCGA GTCTTGGACC AAAGAGCGGA GAGTTCACCG AAACCTACAG AATGGCCTTC ACGAGGATCG TCTTCCAGGG TGAAGACCCA GCGAAGGTAG TGAAGGAACT CGGTGAGCGA ATCAGACAGC TGTTCAAAGA ATCCGGAGCG GAACTTCCAG AACCCGACGC AAGTCTCTTC TGA
|
Protein sequence | MRKWLFFMVL LIVAGLMFGK VNFASTQMTP AAEREFMLNK LAEFSKKSGI DVEFLNFEYP QLYSRLQAEI RAGKNTLNLI ADLQGNLYIM ASEGFLSDLK DLKFEGKTFI ETLEKFAYVK GEKVFIPWLQ ATYVMAVNKK AFDYLPRGLS KEDVIRGTEK WTYDALLEWA KNIYEKTKQP LLGFPIGPKG LWHRFLHGYI YPSFTGAQAL KFDSVRAVEM WNYLKELFKY VHPASSTWDG MADPLLREEV WIAWDHTARL KPAIVEKPND FVVVPVPRGP MGRGYIIVLV GLAIPKDADF EEPAKVIDFL TSPEMQVEIL KNVGFFPVVQ EAVGAVPEGA LKVLAEGVIN QSATKDSIVS FIPSLGPKSG EFTETYRMAF TRIVFQGEDP AKVVKELGER IRQLFKESGA ELPEPDASLF
|
| |