Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1619 |
Symbol | |
ID | 6093068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1635482 |
End bp | 1637383 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642488820 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739638 |
Protein GI | 170289400 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00276535 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TTCTTGTGTT CGTGTTTTTA GTCCTCAGCG CTATTTCAGC GGTAATGGCT CAGATGCTAC CACCTGGTAT CCCTAGGGAA AAGACGTTGA TACTGCCTTT CCTCTTTGCA CCACTTCCCG CACCAGGTAA CTGGAACCTC TGGGCAGGAT GGAGAGCTCA AAACTGCGGT CTTCACCAGT TCGTCACCGA ACCTCTCTGG ACCATCAACC CCAACCCTGA GGAAGGCGGG ATCATCAACG CACTCGCTGC GGAGCCTCCT ATCTACAACG AAGACTTCAC AAAGCTTACG ATCAAACTCA GAAAAGGAAT TTACTGGAGC GACGGGGTTG AATTCACAGC GGACGACTTT GTGTTCACGA TTAAAACGGT GAAAGACACA CCCGGTCTGG ATTATCACGG CCCGATGCAA GATGTGAAAG ATGTCTACGC CCTTGACAAG TACACGGTCG TTGTGGAACT TGAGAGACCA AACAGTAGGT TCCATGCCTA CTTTGTTGAA AGATGGAATG CATTAAGACC TATGCCAAAA CACATCTTCG AAAAGGTAAA AGATGTGGTA TCCTTCGACT TCAACCCACC TGTAAGCTTA GGACCATACG TTTTGAAAGA TTATGACCCC GCAGGATACT GGGTGCTCTG GGAGAAGAGA AAAGACTGGC AGAGAACAGT CACTGGTCAA CTCTTCGGTG AACCTGTTCC TGAGTACGTC CTCTTCATAA ACTACGGTAC TCCTGAGAAG AACACGATGG CTATGCTGAG GCACGAACTG GACGTTCTTC AGGGATCGGC AGAACAATTG ATTACACTTC TGAGAATGAG CAAAAATACC AGAAGTTACA GAAAAACATG GCCATACATA GATCCAAGAG ATATTTCCAC GAGAGGACCT GGTTTCAACT TCATGGTGTA TCCGTACAAC ATCAAAGACG TGAGATGGGC ACTAGCTTTG TCCATAGACA TAGTAAAACT CGCCATTTCA ACGTACGACG GAATGGTCGC TATGACTCCA GGACTTCCTC TGGTTGTCAA CAAAAACTTC TACGAATGGT ATTTCAAGAG ACTGGAACCG TGGCTGGGGG AACTAGCACT GGATCTTGGA AACGGTGAAA CCTTCAAACC GTGGGATCCA CAGGCTCCAT GGAAACTCCT GGAATGGGCT CAGAAAATGT ACAAAGTGGA TATCGATCCA AATAATGAAG AGGAAGTGCG TCTGACTCTG GGTTACGGCT GGTGGAAATA CGCTCCAGAT GCAGCGGAGA AACTGTTGAA AAAACATGGT TTCTACCGCG ATGAAAACGG AAAATGGCAT CTACCAAATG GTGACCTGTG GAAGATAACC ATACTCAGAG GCCCAGATCC TACAGATATG GCTAACATCA TCATAGAGGG AATCGCCGAA CAGTGGAAAG AATTCGGTAT AGATGTCGTC TTCAATGTCT CCTCTGCCGC TGCGACGCTC GCAGGTGAAG GACGGTTTGA GGTGGTCAAC ACAGCACACG GTGGTTTTGC TGGTGAACCA TGGGGATTCC ATCCGGATCT TTACAGATGT TTCAATGCGT TCAGAAGCGA TTTTGTAAAA CCCATTGGTG AACTGACACT TGGTAGTGCT CTTAGGTGGA GTGATCCCAG AATGGACAAA ATCATAGAGG AACTCGAAAA AACAGACTGG AACGATTACG AAAAAGTTAT AGATCTTGGA GTCGAAGGAT TGAAGCTCGA AATTGAAGAG ATGGTAGCAA TACCGGTGTT CAACTGTCCT ATAACGATCG TCTTCGATGA GTATTACTGG ACCAACTTCC CAAGTCCAGA AAACGATTAT GCGAGATGTG ACAACTTCAC CACCTGGCCC CAGCTGAAGT ACCTGCTCCA CATGGTCAAA CCTGCTAAAT GA
|
Protein sequence | MKKVLVFVFL VLSAISAVMA QMLPPGIPRE KTLILPFLFA PLPAPGNWNL WAGWRAQNCG LHQFVTEPLW TINPNPEEGG IINALAAEPP IYNEDFTKLT IKLRKGIYWS DGVEFTADDF VFTIKTVKDT PGLDYHGPMQ DVKDVYALDK YTVVVELERP NSRFHAYFVE RWNALRPMPK HIFEKVKDVV SFDFNPPVSL GPYVLKDYDP AGYWVLWEKR KDWQRTVTGQ LFGEPVPEYV LFINYGTPEK NTMAMLRHEL DVLQGSAEQL ITLLRMSKNT RSYRKTWPYI DPRDISTRGP GFNFMVYPYN IKDVRWALAL SIDIVKLAIS TYDGMVAMTP GLPLVVNKNF YEWYFKRLEP WLGELALDLG NGETFKPWDP QAPWKLLEWA QKMYKVDIDP NNEEEVRLTL GYGWWKYAPD AAEKLLKKHG FYRDENGKWH LPNGDLWKIT ILRGPDPTDM ANIIIEGIAE QWKEFGIDVV FNVSSAAATL AGEGRFEVVN TAHGGFAGEP WGFHPDLYRC FNAFRSDFVK PIGELTLGSA LRWSDPRMDK IIEELEKTDW NDYEKVIDLG VEGLKLEIEE MVAIPVFNCP ITIVFDEYYW TNFPSPENDY ARCDNFTTWP QLKYLLHMVK PAK
|
| |