Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1549 |
Symbol | |
ID | 6092997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1557661 |
End bp | 1560723 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642488749 |
Product | endo-1,4-beta-xylanase |
Protein accession | YP_001739568 |
Protein GI | 170289330 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0176143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTCTC TTCTGAGATT GTGTCTTTTC CTTGTTCTTG TGAGCGCTCT TGTCACTGGA GGTGAAATCA TGAACTTTGA AAGGGAGGAA GAAGGGGTTT CACCCTTTGG AAGGGCAGTT GTTACACTCT CACAGGACGT TTCCTTCAGA GGAGTTTACT CCCTGAAAGT GGATGGCAGA ACTTCACTCT GGGATGGAGT GGAGTTTGAC CTCACTGGAA AGGTTTCACC GGGGAAAGAG TACAGGGTTT TCTTTTATGT GTACCAGACG TCGAACACAC CTCAACTCTT CAGCGTTCTT TCCAGAGTTG TGGATGAAAG TGGAGAAAGG TACGAGATAC TCCTCGATAA AGTCGTCACA CCGGATGTCT GGAAGAAGAT GGAACTGATC TTCACGTCAC CTCAGAGAGC AGAAAAATTC TCGTTGATCG TTGCCTCACC TGAAAAAACG AACTTTCCGT TCTACATCGA TGAACTTCAA CTCTCCAGCC CGGATGAAGT TCAGGCACCT CCACCGGTTC TTCACTGCTC CTTCGAGAGT GAAACAGCCG AAGGCTGGAT ACCACGAGGA AATGCAAAAC TTCAGGTCAC TTCCAGGGTT TCTCACACCG GTAGAAACGC CCTTCTCATC TCTGAAAGGT CTGCCAGCTG GGAGGGAGCA CAGTTCGATC TGAAAAGCAT CGTAAAACCA GGAAAGACCT ACACCTTCGA GATGTGGGTG TATCAGGATT CTGGAAGCCC TGTCGGCATC CTCATGAGGA TGACACGAAA GTTCAAAGAC GAAATCACAA CGAAGCACCC CATCTGGCTG TATGGAAGGA CGGTTCCGTC AGGAAAGTGG GTGAGGCTGT TTGGTATCTT TGGACTCCCA GAAGGAGTGG ACGTGGATCA ACTTGTTCTT TACGTCTACA CGGATGGCTC GAACACGGAC TTCTACATCG ACGATGTGAA GATCTACGAC AGACCGCTTG TGTCGTTCGA AGAGGATGTA CCTTCCCTGA AAGAGGTCTT CAAGGATCAA TTCAAAGTCG GTGCCGGGAT CTCCGAAAAA TCCATCCTCA CTCCCTTTGA CCTTGAGTTT CTCAAAAAGC ACTTCAACAG TGTCACGGAG AGGAACAACA TGAAACCGGT GAATCTGTTC GCCGGTGTTG AAAACGGCAA ACTGAAGTTT GACTTTTCAC TCGCTGATCT GTTCGTTGAC ACAGCACTTA AGAACGGGAT CTCCGTGAGA GGTCACACAC TGGTATGGCA CAATCAGACG CCCGAGTGGT TCTTCAAAGA CGAAAATGGA AACCTTCTGA GCAAAGAAGA GATGACAGAA AGGATCAGAG AATACATACA CACCGTTGTT GGACACTTCA AGGGGAAGGT CTACGCATGG GACGTTGTAA ACGAAGCGGT CGATCCAAAC CAGCCGGATG GACTGAGAAG ATCGACGTGG TATCAGATCA TGGGGCCTGA CTACATAGAA CTTGCGTTCA GGTTTGCAAG GGAAGCAGAT CCCGATGCGA AACTCTTCTA CAACGACTAC AACACCTTCG AACCCAAAAA GAGAGACATC ATCTACAATC TTGTGAAGAG TTTCAAGGAG AAAGGTCTCA TCGATGGGAT AGGCATGCAG TGTCACATCA GTCTTGCAAC GGACATCAGG CAGATCGAAG AGGCCATCAA AAAGTTCAGC ACCATCCCCG GTATAGAAAT TCACATAACC GAACTGGACA TCAGTGTTTA CAAAAGCTCC GGAGGTTACT ACGAGAGGCT TCCAAGAAAC GTGGAGGTGG AACTTGCACA CAAATATGCC CAGCTGTTCA GCATCTTCAG AAAGTACAGC AACGTGATCA CCAGCGTGAC GATGTTCGGC CTGAACGACG GAGACGCCTG GAGTAGAAGA AACAACTGGC CATTTTTGTT CGATGAGTAC TACCAGACAA AACTTGCCTT CTGGGGAGTT GTGGATCCTG AACTTCTTCC GCCACTGCCA AAGACCTCTA CCATCTCAGA GGGTGAGGCC GTGGTGGTTG GAAAGATGGA CGACTCCTAT CTGATGTCGA AGCCGATAGA GATCTACGAT GAAGAAGGGA ACGTGAAGGC AACGATCAGA GCGATATGGA AAGACAGCAC GATCTACGTG TACGGAGAGG TTCAGGATGC GACGAAGAAG CCAGCAGAGG ATGGAGTAGC GATCTTCATC AACCCGAACA ACGAAAGGAC ACCATACCTG CAGCCGGATG ACACCTACGT TGTGCTGTGG ACGAACTGGA AGAGTGAGGT CAACAGGGAA GACGTAGAGG TGAAGAAATT CGTTGGGCCT GGATTCAGAA GGTACAGTTT TGAGATGTCG ATCACGATAC CTGGTGTGGA GTACAGGAAA GACAGTTACA TAGGGTTCGA TGTAGCGGTG ATAGACGACG GGAAATGGTA CAGCTGGAGC GACACGACGA ACAGCCAGAA GACGAACACG ATGAACTACG GAACGCTGAA ACTTGAGGGA GTGATGGTGG CAACGGCGAA ATATGGAACA CCAGTCATCG ATGGAGAGAT CGACGATATT TGGAACACGA CGGAGGAGAT AGAGACGAAA TCGGTTGCGA TGGGATCACT GGAGAAGAAC GCAACGGCTA AGGTGAGGGT GCTGTGGGAC GAGGAGAATC TGTACGTTCT TGCGATAGTG AAGGATCCGG TTTTGAACAA AGACAACAGC AATCCGTGGG AGCAAGATTC GGTGGAGATC TTCATAGACG AGAACAACCA CAAGACAAGC TACTACGAAG ACGATGATGC GCAGTTCAGG GTGAACTACA TGAACGAGCA GTCGTTTGGG ACGGGAGCGA GTGCGGCGAG GTTCAAGACG GCGGTGAAGT TGATCGAGGG AGGCTACATA GTAGAGGCGG CCATCAAGTG GAAGACGATC AAGCCGAGTC CGAACACGGT GATAGGTTTC AACGTTCAGG TGAACGATGC GAACGAGAAG GGTCAGAGGG TTGGTATCAT CTCCTGGAGT GATCCAACGA ACAACAGCTG GAGAGATCCT TCAAAGTTCG GAAACCTGAA ACTCCTGAAA TGA
|
Protein sequence | MRSLLRLCLF LVLVSALVTG GEIMNFEREE EGVSPFGRAV VTLSQDVSFR GVYSLKVDGR TSLWDGVEFD LTGKVSPGKE YRVFFYVYQT SNTPQLFSVL SRVVDESGER YEILLDKVVT PDVWKKMELI FTSPQRAEKF SLIVASPEKT NFPFYIDELQ LSSPDEVQAP PPVLHCSFES ETAEGWIPRG NAKLQVTSRV SHTGRNALLI SERSASWEGA QFDLKSIVKP GKTYTFEMWV YQDSGSPVGI LMRMTRKFKD EITTKHPIWL YGRTVPSGKW VRLFGIFGLP EGVDVDQLVL YVYTDGSNTD FYIDDVKIYD RPLVSFEEDV PSLKEVFKDQ FKVGAGISEK SILTPFDLEF LKKHFNSVTE RNNMKPVNLF AGVENGKLKF DFSLADLFVD TALKNGISVR GHTLVWHNQT PEWFFKDENG NLLSKEEMTE RIREYIHTVV GHFKGKVYAW DVVNEAVDPN QPDGLRRSTW YQIMGPDYIE LAFRFAREAD PDAKLFYNDY NTFEPKKRDI IYNLVKSFKE KGLIDGIGMQ CHISLATDIR QIEEAIKKFS TIPGIEIHIT ELDISVYKSS GGYYERLPRN VEVELAHKYA QLFSIFRKYS NVITSVTMFG LNDGDAWSRR NNWPFLFDEY YQTKLAFWGV VDPELLPPLP KTSTISEGEA VVVGKMDDSY LMSKPIEIYD EEGNVKATIR AIWKDSTIYV YGEVQDATKK PAEDGVAIFI NPNNERTPYL QPDDTYVVLW TNWKSEVNRE DVEVKKFVGP GFRRYSFEMS ITIPGVEYRK DSYIGFDVAV IDDGKWYSWS DTTNSQKTNT MNYGTLKLEG VMVATAKYGT PVIDGEIDDI WNTTEEIETK SVAMGSLEKN ATAKVRVLWD EENLYVLAIV KDPVLNKDNS NPWEQDSVEI FIDENNHKTS YYEDDDAQFR VNYMNEQSFG TGASAARFKT AVKLIEGGYI VEAAIKWKTI KPSPNTVIGF NVQVNDANEK GQRVGIISWS DPTNNSWRDP SKFGNLKLLK
|
| |