Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0500 |
Symbol | |
ID | 6091915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 496773 |
End bp | 498704 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642487687 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738539 |
Protein GI | 170288301 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.464453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAT CGCTTGTAGT TGCTCTGGTT CTTTTTTCAT TGATGATGTT TGGCCTCGGC CTGAAAGAGG TTGTGCCCGG AGAACAGTAC AATCTTTCTG ACTATGAACG TCTGACAGGG AAGAAGATCA CAGAGTTTCA CGAAGCCCCT GCGCTATCAG AACTTGTAAA GCAGGGCAAA CTCCCACCTG TTGAAGAAAG ACTTCCGAAA AACCCACTTG TTGTAACTCC TCACGAGGAA ATTGGAGAGT ACGGGGGTGT TTTAAGAAGA GTTTGGTACG GCATAGCAGA TTGGTGGAAC ATAGGAAGAA TATGTTTCGA AGGATTAATT ATGGCGGATA AAACCGGAAG CGAGTTCTTA CCGAATATAT TCGAGGATTT GAGAATGGAA GAAAATGGTA AAGTGTTTAT AGCCAAAATA AGAGAAGGCT TGAAGTGGTC TGATGGAGTA CCAGTAACAA CAGAAGATGT TCTTTTCTGG TATCATGATA TCTTTCAAAA CAAAAACTTT ACATCTGCCA TTCCGGACTT TTTTAAGCCT GGCGGAGATT TCACAATAGA AGCTGTGGAT ACCTATACTT TTAAAATTAT CTTTCAGAAA CCCTATCCAC TTTTCCCACT ATTTCTAACT GACCAAATAT ATTCGGCTAT TCTGCTTCCT TCCCATTATG TCAGAAATTT CCTTCCAGAA TATATTGGTC AAGACAAAAT CGAGAAACAA GCTAAAGAAA AGGGATACTC TTCATGGCAA CAGTTTGTGC TTGATTATTG TTTATTAGAA AACGCTTGGG TGAGAAATCC TGACCTTCCT GTTCTTTTCC CTTGGAAACT TTCCGATCGT TCTACAGACG AGCTTACCAT TTTCGAAAGA AATCCTTATT ATTTCAAAGT TGATACTGAA GGAAATCAAT TGCCTTATAT TGACGAAATA CACCTCTACA GAGTAGCAAA TAAGCAAATA CTCCTTATGA AGGCTCTTGC GGGTGAAATT GACTTCCAAA TGAGGGGATT TACAACTGAT GATTATCAAA TTCTAACTTC GAATGCCGAA AAAGGAGGAT ACAGAGTTAT TCTGGTAAAG AAAACCACTG GTAGTGGGAC CTCTCTCTTC ATGAACCAGA CTTACACCGG GGATCCTATC ATAGCTGAAT TATTAAAGAA TCCAAAGTTT AGATATGCTA TCTCTTTAGC AATAAACCGT GAAGAAATAC TACATCTTGC CTATTTAGGT CTTGGAGAAC CACGTCAAGC TTCGCTTGTG ACAGGAGTTA AATACTACGA TCCTGAATGG GAAAAGGCAT TTGCTGAATA CGATCCAGAA AGAGCAAATA AACTTCTTGA TGAAATAGGA CTTGAAAAAC GTGATAAAGA AGGATACCGA CTAAGACCAG ATGGCAAAAG ACTAGAGCTC ATTATAGAAT ATACTGAACC ACGATCTGAA TTAGAACTTA TAAAATCCTA CATTGAATCC TTAGGAATAA AGGTTCTTCT AAAACAGGAA GAAAGATCAC TATGGTTTAC AAGGCTGGAA GCAGGAGAGA TGCAAATAGG TGTTTGGGTG TTTGGATCCA TTTCGATTTT CTTAGATCCA GATATGATGG GATATAAATG GGCTCCTTTA GCTTATCAAT GGTATGTTAA TGGAAAAAAA GGAGGTATTG AACCAGAAAA AGGAACTGAC CTCAGAAAAC TTTACGATTT GTGGGACCAA ATATTGTTTG AACCCAATGA GGAAAAAAGA GATGCACTCG TAAAAGAACT CATAAATCTC TTCAAAAAAA ACATATGGAT AATTGGTACA GTAGGTGAAT TCCCTCTTCC TGCAGTAGTG AAAGACAATC TTAAAAATGT CCCTGAAGGA TTACTTTTTG ATTTTCATTT GTTAAAAACA CCAAAAAACT TCAGGCCAGA ACAGTTCTTC TTCAAAAAAT AG
|
Protein sequence | MRRSLVVALV LFSLMMFGLG LKEVVPGEQY NLSDYERLTG KKITEFHEAP ALSELVKQGK LPPVEERLPK NPLVVTPHEE IGEYGGVLRR VWYGIADWWN IGRICFEGLI MADKTGSEFL PNIFEDLRME ENGKVFIAKI REGLKWSDGV PVTTEDVLFW YHDIFQNKNF TSAIPDFFKP GGDFTIEAVD TYTFKIIFQK PYPLFPLFLT DQIYSAILLP SHYVRNFLPE YIGQDKIEKQ AKEKGYSSWQ QFVLDYCLLE NAWVRNPDLP VLFPWKLSDR STDELTIFER NPYYFKVDTE GNQLPYIDEI HLYRVANKQI LLMKALAGEI DFQMRGFTTD DYQILTSNAE KGGYRVILVK KTTGSGTSLF MNQTYTGDPI IAELLKNPKF RYAISLAINR EEILHLAYLG LGEPRQASLV TGVKYYDPEW EKAFAEYDPE RANKLLDEIG LEKRDKEGYR LRPDGKRLEL IIEYTEPRSE LELIKSYIES LGIKVLLKQE ERSLWFTRLE AGEMQIGVWV FGSISIFLDP DMMGYKWAPL AYQWYVNGKK GGIEPEKGTD LRKLYDLWDQ ILFEPNEEKR DALVKELINL FKKNIWIIGT VGEFPLPAVV KDNLKNVPEG LLFDFHLLKT PKNFRPEQFF FKK
|
| |