Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1592 |
Symbol | |
ID | 6093041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1605679 |
End bp | 1607361 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642488793 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739611 |
Protein GI | 170289373 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.382923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAT TTTTACTGGT ATTCGTGGTT GTTTTTCTTT TCTCAAGTCT GTTCTCCCAA GTTTTAGAAC GAAACGAAAC TATGTACTAT GGAGGTTCTC TGTGGTCTCC TCCTTCTAAT TGGAATCCGT TCACTCCGTG GAATGCGGTA CCAGGAACAA CTGGACTTGT CTATGAAACA ATGTTCTTTT ACGATCCACT CACTGGAAAT TTTGATCCAT GGCTTGCAGA AAAAGGTGAA TGGTTAGACA GTAAGACTTA CAGGGTTGTA TTGAGAGAGG GTATATACTG GCATGATAAT GTTCCATTGA CATCAGAAGA CGTTCGATTT ACTTTCGAAA TAGCTAAGAA GTACAAGGGA ATACATTACA GTAGTGTTTG GGAATGGCTT GATCATATTG AAACACCCGA CAACAGAACC GTCATTTTTG TGTTCAAAGA TCCTCGATAT CATGAATGGA ATGAACTCCT CTATACACTT CCAATTGTTC CAAAACATAT TTGGGAAGAA AAAGATGAAA CCACTATACT TCAATCATCA AATGAGTATC CATTGGGATC AGGCCCATAT GTTGCTCACT CGTGGGACCA GAATAAAATG ATTTTCGAGC GTTTTGAGAA TTGGTGGGGA ACAAAAGTTA TGGGTGTGAA ACCTGCTCCG AAATACGTTG TTATAGTGAG AGTCCTCAGT AACAACGTGG CGCTCGGCAT GTTGATGAAA GGAGAACTGG ACTTCAGTAA TTTCATGCTC CCAGGTGTTC CCATTTTGAA AAAAGTTTAT AATCTCAATA CATGGTACGA CGAACCACCG TATCACCTCT CATCAACGGT TGTTGGTCTT TTCCTCAATG CACGAAAATA TCCTCTTAGC CTTCCCGAGT TCAGAAGAGC AATTGCTATG TCGATAAATG CAGATCCAAT AGTTCAAAGA GTCTATGAAG GAGCCGTCTT AAAAGCAGAT CCCCTTGGTT TTCTTCCGAA TTCTGTTTGG ATGAAGTACT ATCCAAAAGA AGTTGTAGAA AAGCATGGTT TCAAATACGA TCCTGAAGAG GCGAAAAGTA TTCTTGATAA GCTTGGATTC AGGGATGTAA ATGGAGATGG TTTCAGAGAA GCCCCAGATG GAAAACCCAT TAAGCTCACC ATCGAGTGTC CGTATGGATG GACCGACTGG ATGCAGGCAA TTCAGGTGAT AGTAGATCAA CTCAAGGTGG TTGGAATAAA CGCTGAACCA TACTTCCCGG ATTCTTCCAA ATACTATGAA AACATGTACA AAGGAGAATT CGATATAGAA ATGAATGCCA ATGGAACAGG TATAAGCAGC ACTCCCTGGA CATATTTCAA TACTATTTTC TATCCTGATG CTTTAGAATC TGAATTCTCT TACACAGGAA ATTATGGAAG ATACCAGAAT CCCGAGGTGG AAAGTTTACT TGAAGAACTC AACAGGACAC CACTTGACAA TGTTGAGAAA GTCACCGAAC TCTGTGGAAA ACTTGGAGAG ATCCTTTTAA AAGATCTACC TTTCATCCCT CTCTGGTATG GAGCGATGGC GTTTATAACA CAAGATAACG TTTGGACTAA CTGGCCCAAC GAACATAATC CATATGCCTG GCCATGTGGT TGGGCCAACT GGTGGCAAAC TGGTGCCTTG AAAATTCTAT TTAATCTCAA ACCGGCAAAA TAA
|
Protein sequence | MKRFLLVFVV VFLFSSLFSQ VLERNETMYY GGSLWSPPSN WNPFTPWNAV PGTTGLVYET MFFYDPLTGN FDPWLAEKGE WLDSKTYRVV LREGIYWHDN VPLTSEDVRF TFEIAKKYKG IHYSSVWEWL DHIETPDNRT VIFVFKDPRY HEWNELLYTL PIVPKHIWEE KDETTILQSS NEYPLGSGPY VAHSWDQNKM IFERFENWWG TKVMGVKPAP KYVVIVRVLS NNVALGMLMK GELDFSNFML PGVPILKKVY NLNTWYDEPP YHLSSTVVGL FLNARKYPLS LPEFRRAIAM SINADPIVQR VYEGAVLKAD PLGFLPNSVW MKYYPKEVVE KHGFKYDPEE AKSILDKLGF RDVNGDGFRE APDGKPIKLT IECPYGWTDW MQAIQVIVDQ LKVVGINAEP YFPDSSKYYE NMYKGEFDIE MNANGTGISS TPWTYFNTIF YPDALESEFS YTGNYGRYQN PEVESLLEEL NRTPLDNVEK VTELCGKLGE ILLKDLPFIP LWYGAMAFIT QDNVWTNWPN EHNPYAWPCG WANWWQTGAL KILFNLKPAK
|
| |