Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0965 |
Symbol | |
ID | 6092395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 998851 |
End bp | 1001883 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642488161 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001738998 |
Protein GI | 170288760 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.272944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAGGCC GATTGTGGAG AATGCTTTCA GAGATTGTTC CGTATACTGT TCTGAGAATA GAAAGAATAG AAAGCTGGAT TTTCTCCGAT GGTGCTGTTG AGAGAATCGT GGATCCTTCC TTCGAATGGG ACTTCAGCTC CGCTCCCGTC TGGTTCAGGA AAGAGCTAGA GCCTTTCTCT GCCGCTGGAG AGCAGAGGGC CTACCTGAAA CTCTGGTTCG GTGGGGAGAC CCTTGTTTTT GTGGATGGGA AGCCTTACGG TGAGATCAAC GAGTATCATA GGATGTTGAA CATCACCCCC CTTGCTGATG GAAAACCACA CACGATAGAA GCTCAGGTGA TGCCAAGAGG TCTTTTTGGA AGACCGGAAA AACCGGTGTT TACAGAAGCT TTCTTCATCG TCGTTGATGA AGCACTGATG AAGGTGGTGA AAACTCTCGA ACTCACCATA AAAACGGCAG AAGTAATAGA AGATGAGTCA CTTTCTAAGA AACTGCTGGA CATCTCCGAG AAGTTTCTCT CGAAAGTGTG GATCCCAAGA GACACAGATA CCTATCTGAT GACAGCACCG GAAGATCCGG GAATAAAAGA TGAGATCAAA AACACCTGGA ACACACCGGA GTTCAAAGAG TTCACAGGTG TGAAGCTTTC TGAAGAGTTG AGAAATCAGA TTCTGGAAGA GTTCGAAAAA TTCAAAGAAA AGCTGGATAG AATAAGAAAA GACCATCCGG GTTTTGGAAC GATTCACCTT GTGGGGCACG CGCACATAGA CTACGCCTGG CTCTGGCCAG TTGAGGAGAC GAAGAGAAAG ATCCTACGCA CTTTCGCAAA CTCTGTGTTG CTCTCTAAGC TTTATCCGGT GTTCGTTTAC ACTCAGTCTT CCGCTCAGAT GTACGAGGAT CTCAAGCAAA ATTCACCAGA ACTTTTCGAG GAAGTCAGAA AACTTGTAAA GGAAGGAAGA TGGGAACCCG TTGGTGGCAT GTGGGTGGAG TCGGACTGCA ACGTTCCATC GATAGAGTCG CTTGTGAGAC AGTTCTACTA TGGGCAAAAA TTCTTCGAAA GGGAATTCAA AAGAAAGAGC AAGGTGTGCT GGCTTCCAGA CGTGTTTGGG TTTTCCTGGG TGCTTCCCCA AATTCTGAAA GAAGCCGGGA TAAAATACTT CGTCACCACG AAACTCAACT GGAACGACAC GAACGAGTTT CCGTACGATC TGTGCCGCTG GAGGGGAATA GATGGATCCG AAGTGATCTA TTTCAGTTTC AAAAATCCCA ACGAGGGGTA CAACGGAAAG ATAGATCCCG ATACGGTCTA CAAAACCTGG AAGAACTTCA GGCAGAAAGA TCTCACAAAC AGAGTTCTTC TTTCGTTCGG ACACGGTGAT GGTGGTGGCG GTCCAACCGA AGAGATGCTG GAAAATTACG AAGTTCTGAA GGATTTTCCT GGACTGCCGC GCCTTGAGAT GGGAACTGTG GAAGAATTCT TCAAAAAAAT GGATATCGAC GGAGAACTTC CTGTGTGGGA CGGAGAGCTT TACCTTGAAC TTCACAGGGG AACCTACACT TCACAGTCCA GGACAAAGAA ACTTCACAAA GAAGCGGAAG ACAGTCTTTA CCTTGCGGAA TTGATTTCTG CTTTCACGGA TAAAGATTTT TCTGACGAAA TAGACGAACT CTGGAAGATT CTGTTGAGAA ACGAATTTCA CGATATTCTA CCTGGTTCTT CTATAAAGGA AGTCTATAAA GATACAGAAA AAGAGCTCAG ACATGTGATA GAAAAATCAA AAGACATCGT TATCGAATCT CTCAAAGTTT TTTCTTCTGA GAACGAAGAG GTTTTAACCC TTTTGAACGT TTCCTCTTTT CCAAAGAAGT GTCTTTTCTT CCTCAACGAA GATCTCGCGA TTTCCTTTGA AGGAGAAGCG CTCTTGAAAC AGAGAACTCA CGATGGAAGG TATGTGTACT TCATAGACAG GGAGATTCCT CCGTTCACGA AAGTAGAACT GAAATTTCGC AAAGCCACGT TTGAGGAAAC TCCAAGTGAG TTGAGAGAAA CAAACATCAT GGAGAACGAA TTTCTCAGGG TGCACGTCAA CGATGACGGA ACAATTCAAA TCTACGACAA AGAACTGGAC AGGTACGTTT TCGAAGAGAA GGGAAACATC TTGAAACTTC ATAAAAACAT CCCTGCCTAC TGGGACAACT GGGATATCGC AGAAAACGTG GAAAAGACAG GATATACCCT GAGGGCGAAA AACATAGAAA AAATAGAGTC TGGCCCTGTT CGAGAAGTGA TCCGTGTTGA ATATGAATCA GAAGGAAGCA GGATCACGCA GCATTACATC CTTTACAGAA AGAGTAGAAG GCTCGATATA GAAACGAAGG TAGACTGGCA CACAAGACGA GCTCTTCTCA GAGCTTACTT CCCAACAACT GTTCTGTCGA GAAAGGCAAG GTTCGATATC TCCGGTGGTT TCATCGAAAG GCCCACACAC AGAAACACCA GTTTCGAACA GGCACGTTTC GAGGTGCCGT TTCACAGGTG GATGGATCTC TCCCAGACAG ACTTCGGTGT GTCTATTATG AACGACGGAA AATACGGTGG CAGTGTTCAT CAGGGTACTA TGGCGCTTTC ACTGATAAAA GCGGGTATTT TCCCCGATTT TCTCTGTGAC GAGGGTAAAC ACAGTTTCAC CTATTCTGTC TACGTGCACC CTGGGGACAG CTTGAGAGAT GTTGTAAAAG AATCAGAAGA TCTCAACAGA TCTTTCATCG TTCATCGCGG GGTGTTGAAC CTCCCCTCTC CTTTACTGGA GATCTCTCCC CAGAATTTCC GTCTCACTTC ACTGAGAAGG GTGAATGGCA AAATTGTTCT GAGGCTTGTT GAGATCTTCG GAACATCAGG AAAACTTTCC ATTAAGACCC CGTGGAACGG TGAAATCTAC CAGACGAACG TTCTGGAAGA GAAAAAACAG AAAGTCACCT TCCCAGTGGT TTACCATCCG TTCAAGATCT ACACTTTTGT TGTAGAAGGT TGA
|
Protein sequence | MKGRLWRMLS EIVPYTVLRI ERIESWIFSD GAVERIVDPS FEWDFSSAPV WFRKELEPFS AAGEQRAYLK LWFGGETLVF VDGKPYGEIN EYHRMLNITP LADGKPHTIE AQVMPRGLFG RPEKPVFTEA FFIVVDEALM KVVKTLELTI KTAEVIEDES LSKKLLDISE KFLSKVWIPR DTDTYLMTAP EDPGIKDEIK NTWNTPEFKE FTGVKLSEEL RNQILEEFEK FKEKLDRIRK DHPGFGTIHL VGHAHIDYAW LWPVEETKRK ILRTFANSVL LSKLYPVFVY TQSSAQMYED LKQNSPELFE EVRKLVKEGR WEPVGGMWVE SDCNVPSIES LVRQFYYGQK FFEREFKRKS KVCWLPDVFG FSWVLPQILK EAGIKYFVTT KLNWNDTNEF PYDLCRWRGI DGSEVIYFSF KNPNEGYNGK IDPDTVYKTW KNFRQKDLTN RVLLSFGHGD GGGGPTEEML ENYEVLKDFP GLPRLEMGTV EEFFKKMDID GELPVWDGEL YLELHRGTYT SQSRTKKLHK EAEDSLYLAE LISAFTDKDF SDEIDELWKI LLRNEFHDIL PGSSIKEVYK DTEKELRHVI EKSKDIVIES LKVFSSENEE VLTLLNVSSF PKKCLFFLNE DLAISFEGEA LLKQRTHDGR YVYFIDREIP PFTKVELKFR KATFEETPSE LRETNIMENE FLRVHVNDDG TIQIYDKELD RYVFEEKGNI LKLHKNIPAY WDNWDIAENV EKTGYTLRAK NIEKIESGPV REVIRVEYES EGSRITQHYI LYRKSRRLDI ETKVDWHTRR ALLRAYFPTT VLSRKARFDI SGGFIERPTH RNTSFEQARF EVPFHRWMDL SQTDFGVSIM NDGKYGGSVH QGTMALSLIK AGIFPDFLCD EGKHSFTYSV YVHPGDSLRD VVKESEDLNR SFIVHRGVLN LPSPLLEISP QNFRLTSLRR VNGKIVLRLV EIFGTSGKLS IKTPWNGEIY QTNVLEEKKQ KVTFPVVYHP FKIYTFVVEG
|
| |