Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1786 |
Symbol | |
ID | 6093237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1799846 |
End bp | 1802989 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642488983 |
Product | hypothetical protein |
Protein accession | YP_001739800 |
Protein GI | 170289562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00635446 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTAGAT TCGTTATCCT TCCATTGCTT CTTGTGGCCT CCCTTTCCGT TGCTCAAATT CCCACTTACC AGGAGGTCAG AAACGATTTT GGAATGGCAA AGGATTTTGC GAGTCTTCTC AAAATACCCG CTGAGGAGTA CAGCTTCTTT GTTTCAGGAG CCTCCAATTT TCTCGATGTT CTCGATGTCA GTGAAGGAAT CTCTTACTTG AAAGAAGGAA ATTACGACAA GGCTTTCGAA AATTTCTCAA AAGTAGGCGT CTCCCAGATT GTGAACCTGA TTCCTTACGT CAACACAATG CTCACCGTTC TAAGCTTCTC CCAGCCTTTC TGGGACAAAG TGGAAAAGTA CGTTTTCGAT CAGAGAATAC AAGAGTCTTC GCAGAAATTT GTGAATTTGA AGATGAAAGA ACTGGAAAAA ATCCTCGAGG AAGATCCGCT CGCTTTCAAG GGAGAGGATT TCAAAAGATG GTTGGAGACA GATGATTCCA TCGAAGTCAG AGATTTCCTT GAAGACATAT TCGTAAACGA GGGAATGAAG CACGGACTCT ACAACGTGGG AAACAAGGTT TTGAACGAAC TCGACTTCGC ATCCAAACCC TGGTATGAGA AACTGAAAAA GCTGTATTTG AACTTCTCCT ACAAAGCGTA CACAGTAGAC GAAGTCAAGC ACTTCTGGAT CACACAGTGG AGGAGGGAAC TTTTCAAACG TGGAGCGGAA TACGTTCTCA AGAGAAAAAA AGAAGCGCTG AACTACTTTC TCTCAGAAAG CACCGTGAAC ATCACTCTAA AAATCAACAG CCCTGAAATG TTCGTTCTCA CAGTTCCCGA ACTCAACCTG AGTGCACAGG TACGGAAAGA ACACAAGATC TCTTCTTCAC TGAAAGATAT GAAATCAAGA CAGATCACCA TCGTTTTGAA AGATCTCAAA GGCAACACTG TTTACACAAA GGTACTCGAT GAGAAAGATC TCTTCTCAAG ATACGATCCG TGGCTTTCAG ACAAGAAGAG CACCTATTAC GTTAAAACCG TCAACATAAC TCCCACTTAC AAGAGGGAAA TTGTGAAAAA TCTTCGCGTT CAACTTCCTG AGGAGGTCAA ATCTGCCACG GTCACGATCA AGACTCAAAA AACAGATCTT CACGTTGAGA ACTCTCCTTC TTTCTCTCTG TCAGACCTGG TTTTAACCCT GGACGAAGAA ATAAGGGTGG AAGCCGAGAT AGTCGATTCT GCCACAAAAA CACATCTGAT GGTTTACGAA ACGTTTGTTC CATCGTCCGG CGAAATCGTG TTGGATGAAC CGAAGATGTC CAAAATACAA TTCGAAAGTG AAGAAGAGTA CAGTGCGTAC CTCAAAGAAT TAAAGAAAGA ATTCCTCGAG AATGTGGATG ATCCAAACGC CGTGAAGGTA CTTCGGAAAA AACTGGAGGA AGCCTTCTAC GCTTTTCACG CTAACCCACT CGCTCGGTCT TCTTTTGAAG TCATCTCGTA TTCGCGAAAA TTCATAGGTG ATGAATACAG ACTCACGGAC GAGGAATTTC TCTTCGAAAG TCTCGACGAC GAATTCTCCT CCCTGAACCG GAAGTTTCAG GAAATCGATT CGGAATGGGG GAAAATCACT TACAACGCTC GAAACCTTGA ACACATCTCT TCCAGAACAC TCGATAAGGG AGCGGTCATA CCGGCCTACT ACAGGTACGT CGTACAGGCA AAAGAAGACC TCCAGAAAGA AATAGAAGAA ATGAAAAGCA ATCTTGAAGG AATTCTGAAC AGATTCTCGG AAATTTTAGA GCAAGCAGAA GAACTCAGAA AGAATGTTGA TCAAAAGATC GCACTCGATA TTCTTTTCGA GACTCATCTG ATCATGAAAA CTTCACAATT GATAGGGAAA ATTCAGGAGA AAATCGATGA AGCGATGAGC AGACTGAACG ATCTGGAGAG TATGGAAATC AAAACCGAAG AATATCTCGA TACCTTTTTA GCTCTGGTGA ACAATACCTA CAAAAAGGCG GAGATTTACA ATACACTTGT CGATCAATGG AATCTTCTGA TGAAACAGGG CGATGAGATA AAAAGGAGTA TTGAAGAATT CAAACCCGAG TACGTTCATC ATCTGACAGA GCTGATGAAA TACGACGGCA TTAACCTCTT TTATCTGAGA GAAAACGAGC ATCTTGGAGC GATCGGAATC GACGAGATAG CAGCCAGAAC TCTCCAGGAG TTTCTGAAAG AAGAAGAGCT CGCAGAGCGA TCTGAAGCGT TCAACCAAGC CGTGAGAAAG ACTCGAGAAC AAATACAACA GTATCACGTT TTTCACTATT ACATCTCTTT GACAGACAAC GACCTGAAAG TTGACAAGAA CTACAGATTG GTCGGTAATT ATTATCGCTA TTCGATCGAC TTCTTCGACA AAAATTTTCC GTACAAGAGC ATAGGGATTT CACGTGAACT GGAAAACGTT CCTGAACTTT TGGAAAAGTT CTTTCGGCTC AGGGAGAAAC TAGAAGGAGA AGATCCGGAA GGAGACAGAT TGAGAAAAAC ACTCGAAAGA CTCTACAGCC TGTCTTCGTT TGACACCGTA GAGGAATTCA ACGCAATTAT AGACGAATGG ACAGATCTAT ACGAAGCCTA CAAACAGGAA GATCCCTTCA AAACCAGCGT ACTGTCTTAC TCATACATAA ACAAAGACGG TCGAACAGTT CAAAAAACAT ATCCAACCTA TGACTTTCTG AGATGGCTCT CACAGAGAAA CAGAAAAAAT CAGGAGTACA TCCTCAACAA CTATCTCAGA AGAATCACTG ACGATCCAAC AAGGGAAGGT TACGAGATTC GAGATCCCAG TGCTATTCTC GAGAATCACC TGGTCGAAGC CACAATAAAA GTCCTCCCGA GCTACAAATC CGGAATAATG TCGGAAATAG ATTCCCTCAA AAAAATTAGC AGAATGAAAA GAAAGAGGAA AGAAGAAGAG ATACAGAAGA AAATCATGAC AGATCCGTTC GCTGCCGTGG GAGCAAAAGT GCCAGAAGAA ATCAAAGACT GGCTCTTGAG ACGTTTGGAC GTCACACACC CGGATCTCTA CGAAGAGTGG AACAGAACCA CCTCTGAACT TCAAGCGTAT CTCGAGGAGG TGACAAAGCA GTGA
|
Protein sequence | MFRFVILPLL LVASLSVAQI PTYQEVRNDF GMAKDFASLL KIPAEEYSFF VSGASNFLDV LDVSEGISYL KEGNYDKAFE NFSKVGVSQI VNLIPYVNTM LTVLSFSQPF WDKVEKYVFD QRIQESSQKF VNLKMKELEK ILEEDPLAFK GEDFKRWLET DDSIEVRDFL EDIFVNEGMK HGLYNVGNKV LNELDFASKP WYEKLKKLYL NFSYKAYTVD EVKHFWITQW RRELFKRGAE YVLKRKKEAL NYFLSESTVN ITLKINSPEM FVLTVPELNL SAQVRKEHKI SSSLKDMKSR QITIVLKDLK GNTVYTKVLD EKDLFSRYDP WLSDKKSTYY VKTVNITPTY KREIVKNLRV QLPEEVKSAT VTIKTQKTDL HVENSPSFSL SDLVLTLDEE IRVEAEIVDS ATKTHLMVYE TFVPSSGEIV LDEPKMSKIQ FESEEEYSAY LKELKKEFLE NVDDPNAVKV LRKKLEEAFY AFHANPLARS SFEVISYSRK FIGDEYRLTD EEFLFESLDD EFSSLNRKFQ EIDSEWGKIT YNARNLEHIS SRTLDKGAVI PAYYRYVVQA KEDLQKEIEE MKSNLEGILN RFSEILEQAE ELRKNVDQKI ALDILFETHL IMKTSQLIGK IQEKIDEAMS RLNDLESMEI KTEEYLDTFL ALVNNTYKKA EIYNTLVDQW NLLMKQGDEI KRSIEEFKPE YVHHLTELMK YDGINLFYLR ENEHLGAIGI DEIAARTLQE FLKEEELAER SEAFNQAVRK TREQIQQYHV FHYYISLTDN DLKVDKNYRL VGNYYRYSID FFDKNFPYKS IGISRELENV PELLEKFFRL REKLEGEDPE GDRLRKTLER LYSLSSFDTV EEFNAIIDEW TDLYEAYKQE DPFKTSVLSY SYINKDGRTV QKTYPTYDFL RWLSQRNRKN QEYILNNYLR RITDDPTREG YEIRDPSAIL ENHLVEATIK VLPSYKSGIM SEIDSLKKIS RMKRKRKEEE IQKKIMTDPF AAVGAKVPEE IKDWLLRRLD VTHPDLYEEW NRTTSELQAY LEEVTKQ
|
| |