Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1614 |
Symbol | |
ID | 6093063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1627037 |
End bp | 1628212 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642488815 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739633 |
Protein GI | 170289395 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000159886 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTCTGGTGAT CGCTTTGCTT GTCGTTTCTC TAGTTGTCCT CGCCCAACCG AAACTCACCA TCTGGTGCTC TGAGAAGCAG GTTGACATCC TTCAGAAACT CGGAGAGGAG TTCAAGGCGA AGTACGGCGT AGAGGTTGAA GTGCAGTACG TGAACTTCCA AGACATCAAG TCTAAGTTCC TAACAGCAGC TCCTGAGGGA CAGGGTGCAG ATATCATCGT TGGAGCACAC GACTGGGTAG GCGAACTCGC AGTCAACGGT TTGATCGAAC CCATTCCAAA CTTCAGTGAC CTGAAAAACT TCTATGAAAC CGCTCTCAAC GCGTTCTCTT ACGGTGGAAA ACTCTACGGT ATTCCTTACG CCATGGAAGC GATCGCACTC ATCTACAACA AGGACTATGT TCCTGAACCC CCAAAGACCA TGGACGAGCT TATAGAAATA GCAAAACAGA TCGATGAAGA ATTTGGAGGA GAAGTGAGAG GTTTCATCAC CTCAGCGGCC GAGTTTTACT ACATTGCTCC TTTCATTTTC GGATACGGTG GATACGTATT CAAACAGACA GAAAAAGGAC TGGACGTCAA CGATATCGGA CTGGCCAACG AAGGAGCCAT CAAGGGTGTG AAACTCCTCA AAAGATTGGT TGATGAGGGA ATACTGGATC CCAGTGACAA TTATCAGATC ATGGATTCCA TGTTCAGGGA AGGCCAGGCG GCGATGATCA TCAACGGACC GTGGGCCATT AAGGCGTACA AGGATGCAGG AATAGACTAT GGTGTAGCCC CAATCCCCGA TCTGGAACCT GGCGTTCCTG CAAGACCTTT CGTTGGGGTC CAGGGCTTCA TGGTGAACGC AAAATCCCCA AACAAACTCC TTGCCATCGA ATTCCTGACC AGTTTCATTG CAAAAAAGGA AACGATGTAC AGAATCTACC TTGGAGATCC AAGACTTCCC TCCAGAAAGG ACGTGCTCGA ACTTGTGAAA GATAACCCAG ACGTAGTTGG CTTCACACTG AGCGCAGCCA ACGGTATTCC AATGCCCAAC GTTCCACAGA TGGCCGCTGT CTGGGCCGCT ATGAACGATG CGCTCAATCT CGTTGTGAAC GGAAAAGCAA CGGTCGAAGA AGCGCTCAAA AACGCCGTTG AAAGAATCAA AGCTCAGATT CAGTAA
|
Protein sequence | MKKFLVIALL VVSLVVLAQP KLTIWCSEKQ VDILQKLGEE FKAKYGVEVE VQYVNFQDIK SKFLTAAPEG QGADIIVGAH DWVGELAVNG LIEPIPNFSD LKNFYETALN AFSYGGKLYG IPYAMEAIAL IYNKDYVPEP PKTMDELIEI AKQIDEEFGG EVRGFITSAA EFYYIAPFIF GYGGYVFKQT EKGLDVNDIG LANEGAIKGV KLLKRLVDEG ILDPSDNYQI MDSMFREGQA AMIINGPWAI KAYKDAGIDY GVAPIPDLEP GVPARPFVGV QGFMVNAKSP NKLLAIEFLT SFIAKKETMY RIYLGDPRLP SRKDVLELVK DNPDVVGFTL SAANGIPMPN VPQMAAVWAA MNDALNLVVN GKATVEEALK NAVERIKAQI Q
|
| |