Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1330 |
Symbol | |
ID | 6092772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1360051 |
End bp | 1361637 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488532 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001739357 |
Protein GI | 170289119 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00276943 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGGAA AAATACTGAT ATTTCTGCAC GCGCACCTTC CATACGTTCA CCATCCTGAA TACGATCATT TTTTAGAAGA AAGGTGGCTT TTTGAGGCCA TAACAGAAAC TTACATACCG CTTTTAATGA TGTTCGATGA AATAGAAGAT TTCAGGTTGA CCATGTCGAT CACTCCTCCG CTGATGGAGA TGCTCTCCTC CAGAGACCTT CAGGAGAAGT ACGAAAGACA CATGGAAAAA CTGATCGAAC TCGCAGACAA GGAAGTGGAG AGAACTAAAA AGGAGCACCC GCTGAAGCAT AAGATGGCTA AATTCTACCG TGAACATTTT GAAAAAATTC TGAATGTATT TCGCTCTTAC GATGGAAACA TCTTGGAGGG CTTCAAAAAA TACCAGGAGA CCGGAAAGCT GGAGATAGTA ACCTGCAACG CCACACACGC GTTTTTGCCG CTCTATCAGA TGTACCCAGA GGTGGTGAAC GCTCAGATCA CAGTTGGTGT GAAGAACTAC GAAAAGCACA TGAAGAAACA CCCAAGGGGT ATTTGGCTTG CGGAATGCGG ATACTATCAG GGGCTGGATC TGTACCTTGC CCAGAACAAC GTTGAGTATT TCTTTGTAGA TTCTCATGCC TTTTGGTTCG CCGATGAACA ACCCAGATAC GGTGTCTACA GACCCATCAT GACGCCAAGT GGTGTTTTCG CCTTCGCACG AGATCCGGAG TCGAGCGAAC AGGTCTGGAG TGCAGCCGTT GGGTATCCTG GTGATCCAAG GTACAGAGAA TTCTACAGAG ATATAGGTTT CGACAGAGAG ATGGAGTACA TAAAAGATTA CATAGACCCT TCTGGAGTCA GAATAAACAC CGGAATAAAA TACCACAGGA TAACTTCGAA GAGTCTGGAC GCTTCGCAGA AAGAATATTA CGATATAGAT CTGGCCATGG AAGCGGTGGA AGAACACGCG AGGGACTTCC TTCACAAAAA GGAAAATCAG GCAAGAAGAT TGATGGACAT AATGGGTGTC GAACCAGTCA TCGTTGCTCC ATTCGACGCT GAGCTCTTCG GTCACTGGTG GTTCGAGGGT GTGTTCTTCT TGAAGAGGTT CTTTGAACTG GTGAATGAAT CAAAAGACCT GAAGCTCGTC ACCGCATCCG AAGTTATAGA CACTCTCGAA GAGGTTCAGA TCGCCACACC CGCCGACTCG AGCTGGGGTG CCGGAGGATA CTACGAAACG TGGCTCAACG GAACGAACGA CTGGATCTAC AGGCATCTCC ATGAGATGAT CGAGAGAATG ATAGATCTTT CGAAAAAGTA TTACAACAGT TCCGATCCAC TCGTGGAAAG GGTTTTGAAT CAGATGCTGA GAGAGCTATT TCTCGCACAA TCGAGCGACT GGGCTTTCAT TATGACCACA AGAACGAGTG TTCAATACGC AGAAAACAGA ACGAAGCTTC ACATAAAAAG ATTTCTGAAC CTCTACGATC AACTCGTTTC TGGAAGAATA GACGAAGAGA TGCTAAGATA CTACGAGTGG ACGGATGCAA TCTTTCCGGA GATAAACTTC AGGGTGATGG CGAGGGACGT GATTTGA
|
Protein sequence | MRGKILIFLH AHLPYVHHPE YDHFLEERWL FEAITETYIP LLMMFDEIED FRLTMSITPP LMEMLSSRDL QEKYERHMEK LIELADKEVE RTKKEHPLKH KMAKFYREHF EKILNVFRSY DGNILEGFKK YQETGKLEIV TCNATHAFLP LYQMYPEVVN AQITVGVKNY EKHMKKHPRG IWLAECGYYQ GLDLYLAQNN VEYFFVDSHA FWFADEQPRY GVYRPIMTPS GVFAFARDPE SSEQVWSAAV GYPGDPRYRE FYRDIGFDRE MEYIKDYIDP SGVRINTGIK YHRITSKSLD ASQKEYYDID LAMEAVEEHA RDFLHKKENQ ARRLMDIMGV EPVIVAPFDA ELFGHWWFEG VFFLKRFFEL VNESKDLKLV TASEVIDTLE EVQIATPADS SWGAGGYYET WLNGTNDWIY RHLHEMIERM IDLSKKYYNS SDPLVERVLN QMLRELFLAQ SSDWAFIMTT RTSVQYAENR TKLHIKRFLN LYDQLVSGRI DEEMLRYYEW TDAIFPEINF RVMARDVI
|
| |