Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0289 |
Symbol | |
ID | 6091693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 273080 |
End bp | 276058 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642487468 |
Product | polysaccharide export protein |
Protein accession | YP_001738330 |
Protein GI | 170288092 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.593882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAG CGCTGATCTT TCTGTTTCTT CTGATTGCGG TGTTTTCTCT GTCTTACACC ATAAGAAAGG GAGACGTTCT GAGGATCGAA GTGGTGGGAT ACCCAGATCT GACGAGAAAT TGTGCGGTGG ACATAGAAGG TGCGATCACG TTTCCGTACG TGGGACGTGT GAAGGTGGAA GGCCTCAGCG TGGACCAGTT AACAGAGCTT TTCAAGGAAA GGCTCTCCAG GAGTTTTTCT GAACCTGAGG TGATCGTGTC TCTTCAGCAG ATTGCTCCAA GGAACATCTA CGTCTCCGGT GTTGTGAACA GAGTGGTGGA CATGGGAATA GAAGATCTCA GCGTTTCTGA GCTTTTCTCT CTTCTTTCTG TGGATCTCTC CTCAGTCGAT CTTTCGAAGG TGAAGGTTTT GAGAGATGGG AAGGTTTTTG AACTGGATCT CTCTTCGCTT CTCTGGGGTG AAGCTCCCGA GAAAGACGTG ATGCTCCAGG AGAACGATCA GGTGATACTC CCGGAAAAGA GCTATACGGA GTTCGTGAAG GTCGTTGGGG CCGTGGCAAA GCCGGGCATC TATCCTTACA GAAGGGAGAT GACTCTGCTC GACGCCGTAG CGGCAGCGGG CGGTACCACG CAGGAGAGTT CGGGAAAGAT AATCGTCCTC TCGAAAGACA AAACGGTTGA GATATCAGAG AAAGATATCT ATCAAAAGAA CCTCCTTTTG AAACCAGGGG ACACGGTGCA CGTCCAAAAA CTCGACGAGA GGTTCGCCTA CGTTGTGGGA GCGGTGGCAA GACCGGGAAT GTACACCTTC TCCAGAGAAG AGTCTCTCAC GCTCAAAAAT CTCATCGCAA AAGCCGGCGG AGTCTCCGTT GATAACAGAT ACATAGAAAA GGTTCTGATC ACAAGGAATG GAAAAACCAC GGAGTACGCA CCGGAAGTGC TGAACGAGAA CGTTCAGCTG AATGTCGGAG ATGTCGTTGA GATAAAAAAA TACGAAGAGA CAAGAGTCTA TGTATCAGGA TATGTGGCAA GACCAGGTGT TTACGAGATC TCCCCGAAGG AATCGGTGAC CCTGGAAAAG CTCCTCTCGA TGGCAGGAGG GTTCAGAGGA AGTGTTGAAG AAATCGATTC GATCGTCATC GCAAGAGGCG GAAGCGTTGT GGAGCTTTCA CCAGACAAGT TAGATTTTCT TGTGAAACCC GGCGACATAG TGAACGTGAA GGAGTTCGTT CCAAAGAAGG CTTACATCCT CGGGTACGTG AGAAGCCCCG GGCTTTACGC CTTTGGAAAG AGCGAAGCGT TCACGCTGAG AAACCTCATA GCGAAGGCCG GTGGGTTCAT CGATGAAAGT CAGGTTGTCT CCGTGAAACT GTCTGGAAAG GAATACTTCC CAGAGGAGAT AGTGGAAAAA GAGATTCCCC TCGAAGACGG AGTGTTCGTC TACGTGGAGA AGTACACGGA CAGGTTCGTT TACATAGTGG GAGACAATGC TACAAGGAAC GGAAGGATGA GTTTTGCTAA GGACGAACCG TTCACACTCT CGACGGCCCT GAAGAAGTAC GGGATCGAGG ATTTCTCTCT TGTGAAGAGC CTATCACTTT TGAGAGACGG TGAAGAAAAG AGCTTCGATC CAGAGAAGGT ATTGACGGAG GATGTTTCTC TCAAAACAGG CGACACGATC CTCGTGAAGA CGATACAGAC GAAGAGGGTC TATTTCACCG GGGACGTGTA CGGATACGTC GATTTTGCGA AAGACGAAGA CATCACACTT GAGAAAGCCC TCGCGAGGTT CGGAAAAATA CAGAAGAAGT ACATAGCGGG ATTGAAGGTG CGCTCGAGCG GAAAGGTGCA GGAACTGGCG GAAGTTACCG ATCTTTCGCT CGAAGACGGT GCGGTTGTCG AAGTGGATGT GAAAGAAGCC GTCAGAGTCT ACGTGGACGG CTTTGTGAAG GTGCCCCAGA TGGTTGTCTT CGAGCCCGAT GAGTCGGCTC TCTTGGACAG AGCCATTGTG AAGGCTGGAG GGTACAAAGA AGACGCTCTC TTCGAAGCGG GCAACGTGGT CGTTCTGAGA GACGGCGGTG AGATCACCGT TTCTCAAAAC CAGCTCAGCT TGTTCGAGCT GAAGGACGGA GACCTCGTGT ACGTGAAGTA CACCGAAAGA CCTCACGTTT ACGTCTTCGG AGAAGGCATC ACGAACACGC TTGTGACCTT CAAAGACGAG GAAAAACCAA CGCTCAGGAA CGTTCTGGGT AAGGTGGGCG GTGTAAAGAG CACGGGATCG AGCAGGATCG TCGTTGTGAG TCCATCGGGT GAGAAGAAAG AAGTGTATTA TGAAGACGTG ATCAACACAG GTGGGCCTGT TCTTGAGAGC GGAAGCGTGG TCTTCGTGCC TCTTGAGACG GAGAACTTCG CCTACGTCGT GGGAGAGGTG GTAAGACCCG GTGCGTACGA ACTCAAAGGA GACGTGACAC TTCTGAAGCT CATAGCACAG GCCGGCGGTC TGAGCAACTG GGCCCTGAAG ACGAAGGTGA TCCTGAGAAG GGGAGAGAAT GAAACGACTT ACGACTTCAC GAACATAGAC GAGGTGCAGA AGGTGAAGAT AGAGCCAGGA GACGTGGTGT ACGTACCACC CGTTGAAACG AACTACGTGT ACGTGCTCGG TAACGTGAGG ACACCGGGTA TCGTGAAGGT GGACAGGTAC TCGACGGTGT TCGATGTGGT GATGAGGGCC GGTGGGTTCA CAGACAGAGC CGCCACAAGC AGGATATTCT TATTCAAGGG AGGACCTCAG GGAGAGGTCA CCGTTTGCGA TCTCTCAGGA GTTCTTTCTG GAAAGGGTGG AGGTGTGAAT CCGAACGTCG CACCGGGTGA TGTGGTCTTC GTTCCCGACA ACCCGCTCAT ACAGGTGACA GAGGCGCTCT CCATAGTGAA CACCGTGATA AACACAATCA GCAACGTCAG AGACTTCATG GGGTGGTAA
|
Protein sequence | MKRALIFLFL LIAVFSLSYT IRKGDVLRIE VVGYPDLTRN CAVDIEGAIT FPYVGRVKVE GLSVDQLTEL FKERLSRSFS EPEVIVSLQQ IAPRNIYVSG VVNRVVDMGI EDLSVSELFS LLSVDLSSVD LSKVKVLRDG KVFELDLSSL LWGEAPEKDV MLQENDQVIL PEKSYTEFVK VVGAVAKPGI YPYRREMTLL DAVAAAGGTT QESSGKIIVL SKDKTVEISE KDIYQKNLLL KPGDTVHVQK LDERFAYVVG AVARPGMYTF SREESLTLKN LIAKAGGVSV DNRYIEKVLI TRNGKTTEYA PEVLNENVQL NVGDVVEIKK YEETRVYVSG YVARPGVYEI SPKESVTLEK LLSMAGGFRG SVEEIDSIVI ARGGSVVELS PDKLDFLVKP GDIVNVKEFV PKKAYILGYV RSPGLYAFGK SEAFTLRNLI AKAGGFIDES QVVSVKLSGK EYFPEEIVEK EIPLEDGVFV YVEKYTDRFV YIVGDNATRN GRMSFAKDEP FTLSTALKKY GIEDFSLVKS LSLLRDGEEK SFDPEKVLTE DVSLKTGDTI LVKTIQTKRV YFTGDVYGYV DFAKDEDITL EKALARFGKI QKKYIAGLKV RSSGKVQELA EVTDLSLEDG AVVEVDVKEA VRVYVDGFVK VPQMVVFEPD ESALLDRAIV KAGGYKEDAL FEAGNVVVLR DGGEITVSQN QLSLFELKDG DLVYVKYTER PHVYVFGEGI TNTLVTFKDE EKPTLRNVLG KVGGVKSTGS SRIVVVSPSG EKKEVYYEDV INTGGPVLES GSVVFVPLET ENFAYVVGEV VRPGAYELKG DVTLLKLIAQ AGGLSNWALK TKVILRRGEN ETTYDFTNID EVQKVKIEPG DVVYVPPVET NYVYVLGNVR TPGIVKVDRY STVFDVVMRA GGFTDRAATS RIFLFKGGPQ GEVTVCDLSG VLSGKGGGVN PNVAPGDVVF VPDNPLIQVT EALSIVNTVI NTISNVRDFM GW
|
| |