Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1666 |
Symbol | |
ID | 6093116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1689305 |
End bp | 1690810 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642488867 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739684 |
Protein GI | 170289446 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGC TTGTCTGGTT GTTTCTGGTT CTGGCGGTGA CTCTTTCTTT TGCAGCAAAG GACATCATCG TGGTGGGTAC CACGGACAAG ATCAGAACTC TCGATCCTGC AAACTGCTAT GACTACTTCT CCTCGAACAT ACTCCAGAAC GTCCTGGTCG GTCTGGTTGA CTACGAGATA GGAACCAGCA ACCTGAAACC GGTGCTCGCA GAAAGATGGG AAGTCGATGA AACGGGAACG GTCTACACCT TCTATCTGAG AAAAGACGCA AAGTTCGAGG ATGGAACACC GATCGATGCA CACGTGTTCA AGTATTCCTT CGACAGGGTT ATGAGACTCA ACGGAGATCC CGCATTTTTG CTTTCGGACA TAGTCGAAAA AACGGAAGTG GTGGACGATT ACACGTTCCG TGTAACACTG AAGTACCCGT TCTCCGCGTT CGTCTCCGTT CTTGGCTACA CCGTGGCCTA TCCGGTGAAT CCAAAGGTTT ATCCAGCCGA TTCCTTCTAC GAAGGGATAC CTTCGGCCTC TGGGCCTTAC AGGATCAAAG AGTGGATCAG AGACGTGAGG ATCGTTCTTG AGGCCAATCC GAACTACTTC GGTGAAAAGC CAAAGACAAA GACCATCGTG ATCAACTTCT ACGAGAGTGC CTCCACCCTC AGACTGGCAC TCGAAACAGG AGAGATCGAC GTTGCATACA GGCATCTCGA TCCAAGGGAT ATCATCGATC TTGAGGGAAG AGAGGACATT GTTGTCTACA AAGGAAACAG CCCGCAAATA AGATATCTCG TGATAAACGT GACACAGCCT CCGTTCGACA ACGTGAAAGT GAGACAGGCA CTCGCCTATG CGGTCAACAG GTCTGTCATC GTCGAGGACG TGTTTGCAGG GCTTGCAAAA CCGCTGTACT CGATGATTCC AGAAGGCATG TGGGGACACA AGAGTGTCTT CCCTGAGAGA GATCTGGAAA AAGCAAAAGC ATTACTCAAA GAGGCTGGCT ACGACGAAAA CAACCCGCTC GTGATCGATC TCTGGTACAC ACCCACACAC TACGGAACAA CGGAGGCGGA CGTTGCACAG GTGTTGAAGG AATCGTTCGA AGAAACGGGT GTCATAAAAG TGAACCTGAA GTACGCCGAA TGGTCCACCT ACGTGGAATA TTTCCTGAAC GGTACCATGG GACTGTTCCT GCTTGGTTGG TATCCGGATT ATCTCGACCC AGATGACTAC GTGTGGCCCT TCCTGAGCGA AAGTGGTGCA AAATCTCTGG GAAGTTTCTA TTCGAATCCC GAAGTGGAAA ACCTCATGAT AGAAGCCAGA AAGCTCACCG ATCAGGAAAA GAGAGCCGAG ATCTACTACA AGGTCCAGGA GATCCTCGCC AGGGACGTTC CCTACATACC GCTCTGGCAG GGTGTTGCCA CCTGTGCAGC GAAAAAGCAG GTGAAGGGGA TCCTGCTTGA GCCCACACAG ATATTCAGAT ACTACATACT CTACTGGGAA GAGTGA
|
Protein sequence | MKRLVWLFLV LAVTLSFAAK DIIVVGTTDK IRTLDPANCY DYFSSNILQN VLVGLVDYEI GTSNLKPVLA ERWEVDETGT VYTFYLRKDA KFEDGTPIDA HVFKYSFDRV MRLNGDPAFL LSDIVEKTEV VDDYTFRVTL KYPFSAFVSV LGYTVAYPVN PKVYPADSFY EGIPSASGPY RIKEWIRDVR IVLEANPNYF GEKPKTKTIV INFYESASTL RLALETGEID VAYRHLDPRD IIDLEGREDI VVYKGNSPQI RYLVINVTQP PFDNVKVRQA LAYAVNRSVI VEDVFAGLAK PLYSMIPEGM WGHKSVFPER DLEKAKALLK EAGYDENNPL VIDLWYTPTH YGTTEADVAQ VLKESFEETG VIKVNLKYAE WSTYVEYFLN GTMGLFLLGW YPDYLDPDDY VWPFLSESGA KSLGSFYSNP EVENLMIEAR KLTDQEKRAE IYYKVQEILA RDVPYIPLWQ GVATCAAKKQ VKGILLEPTQ IFRYYILYWE E
|
| |