Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1696 |
Symbol | |
ID | 6093146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1717396 |
End bp | 1718706 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488896 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739713 |
Protein GI | 170289475 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00506774 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTTTTGTTCT TCTGATGATT CTTCTCGTAG TTATCAGTTT AGCGAAGGTC AAGATTCAGT TCTGGCACGC CATGGGAGGA TGGAGGATCG AACTTCTTCA AAACATGGCG GAGGACTTCA TGAAGACCCA TCCAGACATT GAAGTAGAAG TTCAGTACAC CGGAAGCTAC AGAGATACTC TGAACAAACT CGTAGCAGCT GTACAGGGAG GAACCCCTCC ACACGTTGTT CAGATATACG AAATTGGCAC TCAGTTCATG ATTGACAGTG GTATAGCCGT CCCAATTGGT GATTTGATCG AGAAAGATCC CTCCTTCGAT GTTGGAAAAT TCCTTCCACA GGTTCTGGAC TACTACAGAG TGAAGGGGAA ACTCTACTCG ATGCCGTTCA ACTCTTCAAA TCCGATTCTC TACTACAACA AAACCCTCTT CAAAGAAGTG GGTCTCGATC CAAACAAACC TCCCAGGACA TTCAATGAGT TAATAGAATA CTGCAGAAAA CTCACGGTTA AAGATGAAAA AGGAAATATC GTTCGTGCTG GCATCACTTG GCCACTCCAC AGCTGGTTCT TCGAACAGTT CGTAGCTCTT CAGAATGCTC CTTTAGTTGA CAACGAAAAT GGAAGAGCGG GAAGAGCAAC AAAGGCAGTT TTCAACCACA AAGCGGCGCT CAGGTTCCTC AAACTCTGGA ATACACTTAC GAAAGAGGGT CTTATGATCA ACACAACAAA AGAAGACTGG ACAGGAGCAA GACAGCTCTT TATTTCTCAA AAAGTTGCCA TGCTTATCAC CTCTACCTCT GACGTGAAAC TGATGATGGA CGCTGCCAAG GAAAACGGAT TCGAGCTCGG AACAGCATTC CTTCCGAAAC CAGAGGGAGT TGAGCTTGGA GGAACACCAA TAGGTGGTGG AAGCTTGTGG ATCATAGGAG GCCATCCAGA AGAGGAGATA AAAGCGGCCT GGGAGTTCGT AAAATGGATG GCGGAGCCAG AACAACAGAT ACGCTGGCAC CTTGGAACAG GCTACTTCCC GGTAAGAAAA GACGCAGTAG AGACACTTCT CTACCAGGGT TACTACTCTG AATATCCTCA TCATCTCACT GCGCTTTTGC AGCTTCTGCT GTCGGTTCAA ACACCGAACA CCAGAGGAGC TGTTATAGGA CCGTTCCCAG AGGTGAGAGA CATAATAGAA ACCGCTATTG AAAAAATGAT TAATGGAGAA ATGACGCCCG AAGAAGCTCT CGCCTGGGCT GAAAAAGAAG CTACAAGGGC CATCAGAGAA TACAATGAAC TCTATGAGTG A
|
Protein sequence | MKKFFVLLMI LLVVISLAKV KIQFWHAMGG WRIELLQNMA EDFMKTHPDI EVEVQYTGSY RDTLNKLVAA VQGGTPPHVV QIYEIGTQFM IDSGIAVPIG DLIEKDPSFD VGKFLPQVLD YYRVKGKLYS MPFNSSNPIL YYNKTLFKEV GLDPNKPPRT FNELIEYCRK LTVKDEKGNI VRAGITWPLH SWFFEQFVAL QNAPLVDNEN GRAGRATKAV FNHKAALRFL KLWNTLTKEG LMINTTKEDW TGARQLFISQ KVAMLITSTS DVKLMMDAAK ENGFELGTAF LPKPEGVELG GTPIGGGSLW IIGGHPEEEI KAAWEFVKWM AEPEQQIRWH LGTGYFPVRK DAVETLLYQG YYSEYPHHLT ALLQLLLSVQ TPNTRGAVIG PFPEVRDIIE TAIEKMINGE MTPEEALAWA EKEATRAIRE YNELYE
|
| |