Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0871 |
Symbol | |
ID | 6092301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 897714 |
End bp | 900050 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642488069 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001738906 |
Protein GI | 170288668 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCGT ACAGGGATCC TTCGCAACCC ATCGAAGTGA GAGTGAGAGA TCTTCTTTCC AGAATGACGC TGGAAGAGAA AGTAGCCCAG CTTGGGTCTG TCTGGGGTTA CGAACTGATA GACGAGAGGG GAAAGTTCAG TAGAGAAAAA GCAAAAGAAC TCCTCAAAAA TGGTATTGGT CAGGTCACGA GGCCCGGTGG ATCAACGAAC CTTGAACCTC AAGAAGCCGC GGAACTTGTG AACGAAATAC AGAGATTTCT CGTGGAAGAA ACACGTCTTG GAATTCCTGC GATGATACAC GAGGAATGTC TCACCGGTTA CATGGGACTT GGAGGAACCA ACTTCCCTCA GGCAATAGCA ATGGCGAGTA CATGGGATCC AGATCTCATA GAAAAAATGA CCACCGCCAT CAGAGAGGAT ATGAGAAAGA TAGGAGCACA TCAGGGTCTC GCACCTGTTC TGGATGTCGC AAGAGACCCG AGGTGGGGGA GAACAGAAGA GACGTTCGGA GAATCTCCCT ATCTGGTGGC GAGGATGGGA GTCTCTTACG TGAAAGGCCT CCAGGGGGAG GATATCAAAA AAGGTGTCGT TGCCACAGTG AAACACTTCG CCGGATACAG CGCTTCTGAA GGCGGAAAGA ACTGGGCACC AACGAACATT CCAGAGAGGG AATTCAAAGA GGTCTTTCTC TTTCCGTTCG AAGCGGCCGT TAAAGAAGCG AATGTGCTTT CTGTGATGAA CTCCTACAGC GAAATAGACG GTGTCCCATG TGCAGCGAAC AGGAAACTCC TCACAGACAT TCTCAGAAAA GACTGGGGAT TCGAAGGAAT CGTCGTTTCT GACTATTTCG CTGTGAAAGT TCTGGAAGAT TATCACAGAA TAGCAAGGGA TAAGTCAGAA GCCGCAAGAC TCGCACTTGA AGCGGGGATA GATGTTGAAC TTCCGAAGAC AGAATGTTAT CAATATTTGA AAGACCTTGT TGAAAAAGGC ATCATCTCCG AAGCTTTGAT CGACGAGGCA GTCGCCAGGG TGCTGAGGCT GAAGTTCATG CTTGGGCTCT TCGAAAATCC CTACGTTGAG GTGGAAAAAG CAAAGATAGA AAGCCACAGA GACATCGCGC TCGAGATAGC AAGGAAATCT ATCATACTCC TCAAAAACGA TGGGATACTG CCGCTTTCGA AAGAAAAGAA AGTGGCGTTG ATCGGACCGA ACGCGGGTGA GGTGAGAAAT CTCCTCGGAG ATTACATGTA TCTTGCACAC ATAAGGGCTC TCCTCGACAA CATAGACGAT GTCTTTGGAA ATCCTCAGAT CCCGAGAGAA AACTACGAAA GACTGAAGAA GAGCATAGAA GAACATATGA AGAGTATTCC GAGTGTTCTC GACGCCTTCA AAGAAGAAGG GATCGAATTC GAATACGCAA AGGGCTGTGA AGTGACAGGG GAAGACAGAA GCGGCTTCGA AGAGGCGATA GAAATTGCAA AGAAATCCGA CGTTGCCATC GTTGTCGTAG GAGACAAATC TGGACTCACC CTTGATTGCA CAACCGGTGA GTCCAGAGAC ATGGCAAACC TCAAGCTTCC AGGAGTCCAG GAAGAACTCG TCCTCGAAGT CGCAAAGACA GGAAAACCCG TCGTTCTTGT CCTCATCACG GGAAGACCCT ACTCACTCAA AAACGTCGTC GACAAGGTGA ACGCCATCCT TCAGGTGTGG CTTCCGGGGG AGGCGGGAGG AAGATCGATC GTTGACATCA TCTACGGAAA GGTGAATCCC TCTGGAAAAC TCCCGATCAG CTTTCCAAGA AGCGCTGGTC AGATTCCTGT CTTCCACTAC GTCAAACCCT CCGGAGGAAG GTCTCACTGG CACGGAGACT ACGTAGACGA GAGCACAAAG CCTCTTTTCC CGTTTGGGCA CGGTTTGTCT TACACGAAGT TCGAGTACAG CAACCTCCGG ATCGAGCCGA AGGAAGTGCC ACCGGCCGGT GAGGTGGTGA TAAAGGTGGA CGTGGAAAAC ACTGGAGACA GAGACGGAGA CGAGGTGGTT CAGCTTTACA TCGGTCGTGA GTTTGCGAGC GTCACAAGGC CTGTGAAAGA GCTGAAGGGC TTCAAGAGGG TTTCTTTGAA GGCGAAAGAG AAGAAGACTG TTGTGTTCAG GCTTCACATG GACGTGCTCG CCTACTACGA CAGAGACATG AAACTTGTAG TTGAACCCGG TGAGTTCAGG GTGATGGTGG GAAGCTCTTC TGAAGACATA AGACTCACAG GTTCTTTCTC CGTCGTCGGT GAAAAAAGAG AAGTGGTGGG AATGAGGAAA TTCTTCACGG AAGCCTGCGA GGAGTGA
|
Protein sequence | MEPYRDPSQP IEVRVRDLLS RMTLEEKVAQ LGSVWGYELI DERGKFSREK AKELLKNGIG QVTRPGGSTN LEPQEAAELV NEIQRFLVEE TRLGIPAMIH EECLTGYMGL GGTNFPQAIA MASTWDPDLI EKMTTAIRED MRKIGAHQGL APVLDVARDP RWGRTEETFG ESPYLVARMG VSYVKGLQGE DIKKGVVATV KHFAGYSASE GGKNWAPTNI PEREFKEVFL FPFEAAVKEA NVLSVMNSYS EIDGVPCAAN RKLLTDILRK DWGFEGIVVS DYFAVKVLED YHRIARDKSE AARLALEAGI DVELPKTECY QYLKDLVEKG IISEALIDEA VARVLRLKFM LGLFENPYVE VEKAKIESHR DIALEIARKS IILLKNDGIL PLSKEKKVAL IGPNAGEVRN LLGDYMYLAH IRALLDNIDD VFGNPQIPRE NYERLKKSIE EHMKSIPSVL DAFKEEGIEF EYAKGCEVTG EDRSGFEEAI EIAKKSDVAI VVVGDKSGLT LDCTTGESRD MANLKLPGVQ EELVLEVAKT GKPVVLVLIT GRPYSLKNVV DKVNAILQVW LPGEAGGRSI VDIIYGKVNP SGKLPISFPR SAGQIPVFHY VKPSGGRSHW HGDYVDESTK PLFPFGHGLS YTKFEYSNLR IEPKEVPPAG EVVIKVDVEN TGDRDGDEVV QLYIGREFAS VTRPVKELKG FKRVSLKAKE KKTVVFRLHM DVLAYYDRDM KLVVEPGEFR VMVGSSSEDI RLTGSFSVVG EKREVVGMRK FFTEACEE
|
| |