Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0512 |
Symbol | |
ID | 6091927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 513529 |
End bp | 514800 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642487699 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738551 |
Protein GI | 170288313 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00024476 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAAAC TCGCAGTTGT TCTTTTGATC TCTTTGATAC TTCTACCGGT GCTGGTCAGT GCTGTGAAAC TCACCATATG GTGTGGAGGA GGTACTGAAA GAAAGGGTCT GGAAGCGGTG GTTGCTGAAT ACAAGAAATT GAATCCAGAT GTCGAGATCG AACTTGTGGA TGTTCCTTAC AGTTCTTATG AGCAGAAGAT AAGATTGGGA ATACTGAGTG GTGATCTCCC AGATCTTGTA ACAATTACGT ACCCATATGC ACCTGGATAC ATGCAGTACA TGCTCGATCT GAGACCTTAC ATTCAGAAGT ATCTTGGAAT CACACCAGAT GATTTCCTAA AATCTCTCTA CGATGTTGTA AGAATTCGTA TAACCACAAA CGAAGGAGAA ATCAAATATG TTCCTTTGCA CTTCACGGCA CAGTGTCTCT GGGTAAACAA GGATTATTTC GAAAAAGCAG GCGTCCCCTA TCCACCTTTT GGAGGAAGAG AAGAACCCTG GACATGGGAA GAATTCATTT CGGCTTTGAA AAAAGTGAAA GAAGCAAATG GCATTCCTTA CGCTCTCTCA ATGCAGAGAA CTGCTGAAAG ATTGTTCGCA TACATGGCAA TCAGAGGAGT CAAAATAATT GATGAAAATC TCGATTTTAC ACTGGATAAG GATCCAAGAG CAAAACAGCT GCTTCAGGAT TTTGCAAACA TGTTTAAAGA AGGTCTCATG GTTCCCGCCG AATGGATCTC TGCTCAGGAT CCAAACATGG CATTTGGAGG AGGATTGACA GCAGTTCTGT GGGCTGGAAG CTGGAGCACA GCCGATCTTC TTTCGATTGA AGGTAAAAAC TTTGTGCCTG CTTATCTTCC AAAGGATATG TACTGGCTGA GTCTCGAAGG TGGAAGGTTC TTTGGTACCT TCAAAACGGG CGATAAAGCC AGAGAAGAAG CAGCCGCTAA ATTCGCACTG TGGGCTGGTT GGAAAGGTCT CGGCTATGAC ATCTATTTGA AGACTACATT TCATATGTCT GCCTACAAGA ATCACCATGT GGACTATGGA AATCCAATCA TGGATCAGGT TCAGAAAGTC TGTGGCGATA TGATAGCTAG TACACCGGAA TGGGTTGTCA CCATAAGGAA TTCTGTTGCC TGGTCCAGAT TGCAGTCTCC AATTGTCAGT CAGATGTCCG CTCTGGTGGC GGGTCAAACA ACAGTGGACA ATGTTATAAA GGCGCTTCGA AATGAATACG ACAAAATAGT AGCCGAAGTT GGAAAGAAAT AA
|
Protein sequence | MRKLAVVLLI SLILLPVLVS AVKLTIWCGG GTERKGLEAV VAEYKKLNPD VEIELVDVPY SSYEQKIRLG ILSGDLPDLV TITYPYAPGY MQYMLDLRPY IQKYLGITPD DFLKSLYDVV RIRITTNEGE IKYVPLHFTA QCLWVNKDYF EKAGVPYPPF GGREEPWTWE EFISALKKVK EANGIPYALS MQRTAERLFA YMAIRGVKII DENLDFTLDK DPRAKQLLQD FANMFKEGLM VPAEWISAQD PNMAFGGGLT AVLWAGSWST ADLLSIEGKN FVPAYLPKDM YWLSLEGGRF FGTFKTGDKA REEAAAKFAL WAGWKGLGYD IYLKTTFHMS AYKNHHVDYG NPIMDQVQKV CGDMIASTPE WVVTIRNSVA WSRLQSPIVS QMSALVAGQT TVDNVIKALR NEYDKIVAEV GKK
|
| |