Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0876 |
Symbol | |
ID | 6092306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 904001 |
End bp | 905878 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488074 |
Product | extracellular solute-binding protein |
Protein accession | YP_001738911 |
Protein GI | 170288673 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0197038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTCTTGTAGT TTTGTTCGTC GTTTCCATGT TCAGTCTGTT CTTTGCACAG CTTCCACCCA ATATTCCGAG AAATGAGACT TTCATAGCAC AGGTTTTGAC CGGAAGAGCA GCCAACCCCA CCAACTTCAA CGTGTGGACA GGATGGGTTT GGCAAGACAG GGGTGTTCAG AACCTGCTTC TGGAACCGCT CTGGTACGTG GATTTCGCAA CTGGAGAGAT CATAAACGCA CTTGCCGAAT CTCTCCCGAC CTACAATTCT GACTTCACAG AACTCACCAT CAAATTCAGA AAAGGTGTAT ACTGGAGCGA CGGAGAGCCG TTCACAGCAG ATGACATTGT GTTCACAATC GAAACAATCA TGAGCACTCC AGCGTTCGGA TACCATCAGG AACTGGTCAA CGAAGTTGAA AGCGTCGAGA AATTGGACGA CTACACCGTG AAAATAAAAC TCAAGAGACC AAATGCCAGA TTCCACGGCT ACTTCATCGA CAGATGGGGC GGAATAAGAC CTATGCCAAA ACACGTTTTT GAGAAAGTAG AAGATCCCGT TAACTTTGAA TTCAATCCAC CCGTTGGAAC TGGTCCATAC GTGCTCCATT CTGTCGATCC AGGAGGATAC TGGACACTCT GGCAGAGAAG AGAAGACTGG GACAGAACAC CGACTGGTAT GCTCTTTGGA ATGCCACAGC CCAAGTACGT ACTCTTCATA GATTACGGTT CCCCGGAAAA ACAGGTACTT GCTATGGCAC AGCATCAGCT TGACGAAGCA ATCCTCACGA TAGAAGCGCT CAAAGCGGTT CTCAGCAGAG TCAAAACAGC CAGAGCCTGG AGGAAAAACT TCCCGTGGAC TGTAAACAAC GATCCGTGTG TGACGGGATT TGTCTTCAAC ACGGCGAAAG AACCATTCAA CAACATCGAA GTAAGATGGG CTCTCACACT CGCAATTGAT ATTGTTGAAT ACGCTGCCAA CGCTTTCGAT GGTGCCGTAA CACTTTCGCC TATCCACATA CCACTTTCAA CCGCATACTA CAACTGGTAT TTCACAAGAC TTGAAGACTG GCTCAAAAAT TACGAAATCG ATCTTGGAAA CGGAGAGAAA TTCAAGCCAT ACGATCCAGA AGCAGGCCTT AGGTTAGCAG AATATGCAAA GAAGAGAGGC TATTCTGTAC CTGATAATCC TGAAATAATT AAGAGAACTT TCGGACCTGG TTGGTGGAAG TATGCCCCCG ATGTTGCAAC AAAACTTCTG GAAAAGAACG GATTCTACAG AGATAAGAAT GGAAAATGGC ATCTGCCGAA TGGAGATCTG TGGCAGATAA CGATAATTGC CCCCACCAAT CCATCTGATC CTGCTTACAG AAACGCATTT GCTCTCTCCC AGGCGTGGAA GAAATTCGGA ATAGATGCGG TCGTTCAGAC TTCCGAAAAT GCAAACTCGT TTGGTTCGGA GGGTAACTTC GATGTTCACA CCGCATGGCC AGCCGCAGAA CCTTGGGGTG GTCATCCAGA CCTTTACAGA ACACTTTATC CATTCCATTC TGAGTACGTT GTTCCAATAG GTGAAAATGC ACCATGGGGT AATTACTGCA GATGGTCTGA TCCAAGGCTT GACAAAATAA TCGAAGAGCT TAAGAACACA CCATGGGGTA ACACTCAAAA ACTCATAGAA CTCGGCACCG AAGCTCTTAA GATAATCGTT GAAGGACTTC CAAGTGTTCC GACATTCAAC TATCCTGGTG TTATCGCATG GGATGAATAC TACTGGACGA ATTATCCTGG AGCAGAAAAT CCATACTCGC AGCCCTATCA GCACTGGCCG AACTTCAAGT ACATGCTTCC GTTCTTGAAA CCTACCGGTA GAAAATAA
|
Protein sequence | MKKFLVVLFV VSMFSLFFAQ LPPNIPRNET FIAQVLTGRA ANPTNFNVWT GWVWQDRGVQ NLLLEPLWYV DFATGEIINA LAESLPTYNS DFTELTIKFR KGVYWSDGEP FTADDIVFTI ETIMSTPAFG YHQELVNEVE SVEKLDDYTV KIKLKRPNAR FHGYFIDRWG GIRPMPKHVF EKVEDPVNFE FNPPVGTGPY VLHSVDPGGY WTLWQRREDW DRTPTGMLFG MPQPKYVLFI DYGSPEKQVL AMAQHQLDEA ILTIEALKAV LSRVKTARAW RKNFPWTVNN DPCVTGFVFN TAKEPFNNIE VRWALTLAID IVEYAANAFD GAVTLSPIHI PLSTAYYNWY FTRLEDWLKN YEIDLGNGEK FKPYDPEAGL RLAEYAKKRG YSVPDNPEII KRTFGPGWWK YAPDVATKLL EKNGFYRDKN GKWHLPNGDL WQITIIAPTN PSDPAYRNAF ALSQAWKKFG IDAVVQTSEN ANSFGSEGNF DVHTAWPAAE PWGGHPDLYR TLYPFHSEYV VPIGENAPWG NYCRWSDPRL DKIIEELKNT PWGNTQKLIE LGTEALKIIV EGLPSVPTFN YPGVIAWDEY YWTNYPGAEN PYSQPYQHWP NFKYMLPFLK PTGRK
|
| |