Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0890 |
Symbol | |
ID | 6092320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 921899 |
End bp | 923872 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642488088 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738925 |
Protein GI | 170288687 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAAGGT TGCTTGTTTT GCTGTCGCTT GTGTTCATGG TTGTTTTAGC TCTTGCTGCC AACGACACAT GGGTCTTCTA CGCAACACCG GAAGAGTACT ACAAGGCTAC AGGAAAGAAG ATTACCGAGT ACCATGAATC ACCGATGCTG ACCAAACTCG TCGAAGAAGG AAAGCTTCCA CCCGTCGAAC AGAGACTTCC GGAGGAACCG CTCGTGGTTC AGCCTGTTGA AAAAGTTGGA CAGTTCGGTG GTACCTGGAG AAGGGTCTGG AAAGGGCCTT CTGACAGGTG GGGTATTTCC AAACTCATCG AAGTGAAACT CGCGTTCTGG GACAAAGAGG GTGGAAAACT CGTTCCGGGG CTTGCGAAGA GCTGGGAAGT TCTGGAGAAC GGAAGGGTAT ACATCTTCCA TCTGAGAAAG GGTGTGAAGT GGTCCGATGG AGTACCGTAC ACGGCCCACG ATATCGTGTT CTGGGTTAAC GACATCGTAG GAAACGACGA TATCACACCT TCGAAACCTG ACTGGTACAA CATTGGTGTG AAAGTCGAGG CACTCGATGA TTACACGGTG AAGTTCGAAT TCAGCAAGCC TTATGGATTG TTCCTTCTGA AAGTTCCATA CGGTGGATTT ACCGGAGCAC CAGCACACTA TCTGAAACAG TTCCATCCAA AGTACACACC GATGGAAGAA ATAGAGAAGA AGATGGTGGA AGGTGTGCAC AACACCTGGG TGGACCTCTT CAACGATAAA AACGACTTCC TTGAAAACAC CGAGCTTCCA ACACTATCAC CGTGGAAGCC TATCACCGAT CCAACAGAAC AGTTCTACAT ACTCGAGAGA AACCCGTACT TCTGGGCGGT TGATATCGAA GGGAATCAGC TTCCATACAT CGATTACGTG AGGCACGAAT ACGTCAAGAA CGACGAAGTC ATACTCCTGA AAGCGATCTC CGGTGAAATC GATATGCAGT GGAGACATAT CGGAGGACTG GGAGCGGGAG CAGGAAACTT CACACTGCTC ATGGAGAACG CCCAGAGTGG AGGATACAGG GTGCTGAAAT GGATCGCTGC GAACGGTTCT GCCAGCAGAA TCTCATTGAA CTACGCTCAC TCCGACGAGG TGCTGAGGAA GGTCTTCAAC GATGTGAGGT TCAGGCAGGC TCTCTCACTC GCTATCAACA GGGAAGAAAT CAACGAGATT CTCTTCAACG GTCTCGCTGA GCCAAGGCAG GCATCTCTCG TGAGTGGATC CCCATACTTC GATCCCGAGT GGGAAAAAGC TTACGCAGAG TACGATCCAG ACAGAGCGAA CAAGCTTCTC GATGAGATGG GGCTGAAGTG GGATGACAAG CACGAATACA GACTCTTACC AGATGGCAGA CCACTCCGAT TCACCATCAC TGTGACTGGA CAGTTTCATG TTGACGTCTG GACGATGGTG AAGGAATACT GGAGACAGAT AGGGGTCTGG GTGGAGATCG AGAACGTTGA AAGGTCTCTC TTCTACGAAA GAGCCGATGC CGGTGACTTC GATGCGATGG TGTGGAACAT GGATAGGGCT GCTCAACCAC TCTCTTCACC GATGGTCATC TTCCCGGGTT CCGAGGACAT AGCAGACTTC TGGTACATAG GATGGAGTGA CTGGATCTCG TACTACATCG ACAAGAACAT AAGAGGCGTG GAACCCGAAG AAGTACCCGA AGGGCCTGAA CCACCAGAGG TCGTCTACAG ACTTGTCGAT CTGTACTACC AGATAGCCTC CACGCCGGAT CCTGATAAAA TCAAAGAGCT CATGGCAGAA GCAACGAAGA TCCATAGAGA AAATCTCTGG ATGATAGGAA CCGTCGGAGA AGACCTTTCG CCTGCCATAG CGAAGAACAA CTTCAGAAAC GTACCAGAAT TTCTCGTAAC GGACGATGTG TTGAGAACTC CTCTGAATGC CATGCCGATG CAGTTCTTCA TCGAACAGAA ATGA
|
Protein sequence | MRRLLVLLSL VFMVVLALAA NDTWVFYATP EEYYKATGKK ITEYHESPML TKLVEEGKLP PVEQRLPEEP LVVQPVEKVG QFGGTWRRVW KGPSDRWGIS KLIEVKLAFW DKEGGKLVPG LAKSWEVLEN GRVYIFHLRK GVKWSDGVPY TAHDIVFWVN DIVGNDDITP SKPDWYNIGV KVEALDDYTV KFEFSKPYGL FLLKVPYGGF TGAPAHYLKQ FHPKYTPMEE IEKKMVEGVH NTWVDLFNDK NDFLENTELP TLSPWKPITD PTEQFYILER NPYFWAVDIE GNQLPYIDYV RHEYVKNDEV ILLKAISGEI DMQWRHIGGL GAGAGNFTLL MENAQSGGYR VLKWIAANGS ASRISLNYAH SDEVLRKVFN DVRFRQALSL AINREEINEI LFNGLAEPRQ ASLVSGSPYF DPEWEKAYAE YDPDRANKLL DEMGLKWDDK HEYRLLPDGR PLRFTITVTG QFHVDVWTMV KEYWRQIGVW VEIENVERSL FYERADAGDF DAMVWNMDRA AQPLSSPMVI FPGSEDIADF WYIGWSDWIS YYIDKNIRGV EPEEVPEGPE PPEVVYRLVD LYYQIASTPD PDKIKELMAE ATKIHRENLW MIGTVGEDLS PAIAKNNFRN VPEFLVTDDV LRTPLNAMPM QFFIEQK
|
| |