Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0914 |
Symbol | |
ID | 6092344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 946290 |
End bp | 948104 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642488111 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738948 |
Protein GI | 170288710 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGT CACTTGTACT GTTACTGGCT CTTTTGGTTC TTTCCAGTTT GATGGCGCAG GTGTCTCTGC CACGTGAAGA CACAGTCTAC ATCGGAGGAG CCCTCTGGGG TCCTGCAACC ACCTGGAACC TCTATGCACC GCAGTCCACG TGGGGTACTG ATCAGTTCAT GTACCTTCCG GCGTTCCAGT ACGACCTTGG AAGAGACGCT TGGATTCCTG TCATCGCAGA AAGATACGAA TTCGTGGACG ACAAAACTCT GAGGATCTAC ATCAGACCTG AAGCAAGATG GAGCGATGGG GTGTCTATCA CCGCAGAGGA TTTTGTCTAC GCTCTGGAGC TCACCAAAGA ACTCGGAATA GGCCCCGGCG GTGGATGGGA TACCTACATC GAATACGTGA AAGCTGTTGA CACAAAAGTG GTTGAATTCA AGGCGAAAGA AGAGAATCTC AATTACTTCC AGTTCCTTTC CTACTCCCTC GGTGCACAAC CGATGCCCAA ACACGTCTAC GAAAGGATCA GAGCACAGAT GAACATAAAA GACTGGATCA ACGACAAACC TGAAGAACAG GTTGTTTCTG GTCCTTACAA ACTCTACTAC TACGACCCGA ACATCGTTGT GTACCAGAGA GTTGACGACT GGTGGGGTAA AGACATTTTC GGACTTCCAA GACCCAAGTA TCTGGCTCAC GTCATTTACA AGGACAACCC GAGTGCCAGT CTCGCGTTCG AAAGAGGCGA CATTGACTGG AACGGACTCT TCATTCCGAG TGTCTGGGAA CTGTGGGAGA AGAAAGGCCT TCCGGTTGGA ACGTGGTACA AAAAGGAACC TTACTTCATT CCCGACGGTG TGGGATTCGT GTACGTAAAC AATACCAAAC CTGGTTTGAG CGACCCAGCT GTGAGAAAAG CGATCGCTTA CGCTATTCCG TACAACGAAA TGCTCAAAAA GGCTTACTTC GGTTATGGAA GCCAGGCTCA CCCGTCCATG GTGATCGATC TCTTCGAACC GTACAAGCAG TACATCGATT ACGACCTTGC AAAGAAAACC TTTGGAACTG AAGATGGAAG AATCCCGTTC GATCTCGATA TGGCAAACAA GATCTTGGAC GAGGCAGGGT ACAAAAAAGG ACCTGATGGT GTAAGGGTTG GCCCCGATGG CACGAAACTT GGTCCGTACA CGATATCTGT TCCGTACGGC TGGACTGACT GGATGATGAT GTGTGAGATG ATCGCAAAGA ATCTGAGAAG CATAGGTATC GATGTAAGAA CTGAATTTCC AGATTACTCT GTATGGGCAG ACAGAATGAC GAAAGGAACG TTCGACCTCA TCATATCCTG GAGTGTTGGT CCGAGCTTCG ATCATCCGTT CAACATATAC AGGTTTGTGC TCGATAAGAG GCTGTCTGCT CCTGTAGGTG AAGTCACGTG GGCTGGAGAC TGGGAAAGGT ACGATAATGA TGAGGTAGTC GAACTCCTCG ACAAAGCAGT TTCTACACTC GATCCTGAGG TGAGAAAACA GGCGTACTTC AGAATCCAGC AGATCATCTA CAGAGATATG CCGAGCATAC CCGCGTTCTA CACGGCTCAC TGGTACGAAT ACTCGACGAA GTACTGGATC AACTGGCCGA GCGAGGACAA TCCAGCCTGG TTCAGACCTT CTCCATGGCA CGCGGACACC TGGCCGACTC TCTTCATCAT CTCCAAGAAG AGCGATCCAC AGCCCGTACC GTCCTGGCTT GGAACGGTTG ATGAAGGAGG AATCGAGATA CCCACCGCGA AGATCTTCGA AGATCTCCAG AAAGCGGCCA TGTGA
|
Protein sequence | MRKSLVLLLA LLVLSSLMAQ VSLPREDTVY IGGALWGPAT TWNLYAPQST WGTDQFMYLP AFQYDLGRDA WIPVIAERYE FVDDKTLRIY IRPEARWSDG VSITAEDFVY ALELTKELGI GPGGGWDTYI EYVKAVDTKV VEFKAKEENL NYFQFLSYSL GAQPMPKHVY ERIRAQMNIK DWINDKPEEQ VVSGPYKLYY YDPNIVVYQR VDDWWGKDIF GLPRPKYLAH VIYKDNPSAS LAFERGDIDW NGLFIPSVWE LWEKKGLPVG TWYKKEPYFI PDGVGFVYVN NTKPGLSDPA VRKAIAYAIP YNEMLKKAYF GYGSQAHPSM VIDLFEPYKQ YIDYDLAKKT FGTEDGRIPF DLDMANKILD EAGYKKGPDG VRVGPDGTKL GPYTISVPYG WTDWMMMCEM IAKNLRSIGI DVRTEFPDYS VWADRMTKGT FDLIISWSVG PSFDHPFNIY RFVLDKRLSA PVGEVTWAGD WERYDNDEVV ELLDKAVSTL DPEVRKQAYF RIQQIIYRDM PSIPAFYTAH WYEYSTKYWI NWPSEDNPAW FRPSPWHADT WPTLFIISKK SDPQPVPSWL GTVDEGGIEI PTAKIFEDLQ KAAM
|
| |