Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1595 |
Symbol | |
ID | 6093044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1609646 |
End bp | 1611319 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642488796 |
Product | extracellular solute-binding protein |
Protein accession | YP_001739614 |
Protein GI | 170289376 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0133917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGT TTTTAGTCGT TCTCGTTCTG GTCCTGGCAC TGGTTTCGGT TTTCGGACAG ACTTTTGAGA GAAACAAAAC GCTCTACTGG GGTGGAGCGC TGTGGTCTCC TCCATCCAAC TGGAACCCGT TCACACCATG GAACGCGGTT GCGGGAACCA TCGGTCTTGT CTATGAACCT CTGTTCCTCT ACGATCCTCT GAACGACAAG TTCGAGCCGT GGCTTGCAGA AAAAGGAGAA TGGGTCAGCA ACAACGAATA CGTACTCACG CTCAGAAAGG GTCTCAGATG GCAGGACGGA GTTCCTCTCA CGGCAGACGA CGTGGTTTTC ACCTTTGAAA TCGCCAAGAA GTACACTGGT ATCAGCTACA GTCCTGTGTG GAACTGGCTC GACAGGATCG AAAGGGTCGA TGAACGAACG CTGAAGTTCG TCTTCTCCGA CCCGAGGTAC CAGGAATGGA AACAGATGCT CATCAACACA CCGATCGTAC CAAAACACAT CTGGGAAAAC AAAACAGAGG AAGAAGTTCT TCAGGCGGCC AATGAAAATC CAGTTGGATC CGGTCCGTAC TACGTTGAAA GCTGGGCAGA CGACAGATGT GTATTCAAGA AGAACGGGAA CTGGTGGGGC ATCAGAGAAC TCGGTTACGA TCCCAAACCT GAAAGGATCG TGGAACTGAG AGTGCTCAGC AACAATGTCG CAGTAGGAAT GCTCATGAAA GGAGAACTCG ACTGGAGCAA CTTCTTCCTG CCGGGTGTTC CGGTTTTGAA GAAAGCATAC GGAATCGTCA CCTGGTATGA AAACGCTCCT TACATGCTCC CGGCCAACAC CGCAGGAATC TACATCAACG TGAGCAAGTA TCCTCTCAGC ATACCTGAGT TCAGAAGAGC AATGGCTTAC GCTATCAATC CCGAGAAGAT CGTTACCAGA GCTTACGAGA ACATGGTGAC GGCTGCCAAT CCCGCTGGAA TCCTGCCGCT TCCCGGTTAC ATGAAGTACT ATCCGAAAGA AGTCGTCGAT AAGTACGGAT TCAAGTACGA TCCGGAGATG GCAAAGAAGA TCCTCGACGA GCTTGGATTC AAAGATGTGA ACAAGGATGG ATTCAGAGAA GATCCGAACG GAAAGCCGTT CAAGCTCACG ATTGAGTGTC CGTACGGATG GACCGACTGG ATGGTTTCTA TCCAGTCCAT TGCAGAAGAT CTCGTGAAAG TCGGAATCAA CGTCGAACCT AAATACCCCG ACTACTCCAA ATACGCAGAC GACCTCTACG GTGGAAAGTT CGATCTCATA CTCAACAACT TTACAACCGG TGTTTCCGCT ACCATCTGGT CCTATTTCAA CGGTGTGTTC TATCCGGATG CAGTAGAATC CGAGTACTCC TACTCCGGAA ACTTTGGAAA GTACGCCAAT CCTGAAGTTG AGACTCTTCT CGACGAACTC AACAGAAGCA ATGATGATGC TAAAATTAAA GAAGTAGTAG CCAAGCTGTC AGAGATACTG CTCAAGGATC TGCCGTTCAT TCCTCTGTGG TACAACGGTG CATGGTTCCA GGCTTCTGAA GCTGTGTGGA CCAACTGGCC AACGGAGAAG AATCCGTACG CTGTCCCGAT AGGCTGGAAC GGCTGGTGGC AGCTCACAGG AATCAAGACG CTCTTTGGTA TTGAAGCAAA GTAA
|
Protein sequence | MKRFLVVLVL VLALVSVFGQ TFERNKTLYW GGALWSPPSN WNPFTPWNAV AGTIGLVYEP LFLYDPLNDK FEPWLAEKGE WVSNNEYVLT LRKGLRWQDG VPLTADDVVF TFEIAKKYTG ISYSPVWNWL DRIERVDERT LKFVFSDPRY QEWKQMLINT PIVPKHIWEN KTEEEVLQAA NENPVGSGPY YVESWADDRC VFKKNGNWWG IRELGYDPKP ERIVELRVLS NNVAVGMLMK GELDWSNFFL PGVPVLKKAY GIVTWYENAP YMLPANTAGI YINVSKYPLS IPEFRRAMAY AINPEKIVTR AYENMVTAAN PAGILPLPGY MKYYPKEVVD KYGFKYDPEM AKKILDELGF KDVNKDGFRE DPNGKPFKLT IECPYGWTDW MVSIQSIAED LVKVGINVEP KYPDYSKYAD DLYGGKFDLI LNNFTTGVSA TIWSYFNGVF YPDAVESEYS YSGNFGKYAN PEVETLLDEL NRSNDDAKIK EVVAKLSEIL LKDLPFIPLW YNGAWFQASE AVWTNWPTEK NPYAVPIGWN GWWQLTGIKT LFGIEAK
|
| |