Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1051 |
Symbol | |
ID | 6092483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1095388 |
End bp | 1097004 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642488246 |
Product | TPR repeat-containing protein |
Protein accession | YP_001739081 |
Protein GI | 170288843 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGGTG GAAATATAAA GGCAATCGTC TATCTTCCGC TGGCTCCAGA AAAAGCGAAA CAGAATAATC TTCCAGTGAA ACTCCCTGTC CTCGCTGAAG ATCTCCCGAA GGTACTGGAA GAAGATAGGA TACCCCTCGA TGTCATATTG AGGGGTCTGG AAGCCCAATA CGAGATCACA AAGGATGAAT ACTACAGATC CTATTATGTG TTCTTCCTCT ACGAGAAATT CAAACAGCTC CTGCGCGAAG GAAAGCTCGA CGAAGCCGAG AAAATTCTGG AAAAAGCCAA GGAAGTTCAG TACGACTACC GCTATCATTT CTACCGTGGA CTTCTCCTGA AACACAGAGG AGAGCTTGGA GAAGCAGAAA TCGAAATGAG GCTCGCCATT TCGATGAACG ACCGTTTCGC ACCAGCATAT TTTGAACTCG CAGGCATACT CAAAGAGAAA AATGAGATCG AAGACAGTCT TCTCTTCTAC GAAAAAGCCT ACGAAGTCAA CAAAGAGTTT CTCTTACCAC TCCTCAAGAA AGGAGATCTC CTTCTGGAAG AAGGAAGACT CGAAGAAGCG ATCGAAGAAT ACAGGAGAAT CCTTGAGAAA GACTCCAATT TTGTTGAAGT TTACGAAAGA CTGGGAGTTA TCTACAATCA GCTCCAACGA TTCAAAGAAG CAGAGAAGTT CTTCAAAAAG GCTCTGGAAA TAGAACGAAA AGATCATGTG GAGTACAATC TCTCTTACAC TCTCATAAAA CTCGGAAAGC TCTTCGAAGC TCTCGAAATC TTAAAGAGAC TTTATGAAAA AAACCCTGAT GATCCCATGG TAGCCAACGA ATACGGGCTC CTCCTCAAAA CACTCGGGCT GTACGAAGAA GCCCTCGAGG TCTTTGAGGA TGCTTACCGC AGGCATAAAG AAGAGGAGAT TCTGAAGTAC AACTATGGAA CGATCCTGCT TCACTTTGAG AAGGAGAAAG CCATATCCAT CCTCTCGGAA ATCTCAGGTG AGCTGAAAGA TAGAGCCGAA TTCATGATCT CACTCGCAGA AAAGGATGTG GTGATCCCCT CTTACGAGGA ATTCGAATGG CTGAAAGATT ACTTTTTCGA GGGCACCATC GATGTTGTTG CCCTTTCGGA AGAAATCGAT TCAGAAGACG AGGATGTGAA AAGGCGAATA GAGAAACTCA GAGAAGGGGA ATTTCCGTTT TACGATACAA CACTGGACTT TTCAGAAATG CTCGAAGTGA TCCTCGGTAT CATGTTCGAA TCCCCCGATA TCTTCAAAAT GGAAGAAAAC GCCGTGAAAT TCGTCTCCGC GTTCTACGGT AGCTCTGTGA TGATCGCCTC CACAATCGTT TTGACGAGAA CTTTCCAGTA TTTTCTCGCT GAAGAAGAAC CTACCATGGA GGAACTTTTG AGAGAACTGG TCGCGGAGAC CCAGGATGTG AACTGGAAAT TTTCTCTGAG GCTCGCCAGA TTTAGGCACG CAGACAGGTT TGATTTCAAC AGGCTCTCGG ATCTTGTAAT CGCGTTTCTG CAATCCATAG AGCAGGGAAC ACCCGTATCG GATGACGAAA GGTTAAAATA TCTACTGGAG AAACTGACTT CTCAGAAGGA GGGATAG
|
Protein sequence | MRGGNIKAIV YLPLAPEKAK QNNLPVKLPV LAEDLPKVLE EDRIPLDVIL RGLEAQYEIT KDEYYRSYYV FFLYEKFKQL LREGKLDEAE KILEKAKEVQ YDYRYHFYRG LLLKHRGELG EAEIEMRLAI SMNDRFAPAY FELAGILKEK NEIEDSLLFY EKAYEVNKEF LLPLLKKGDL LLEEGRLEEA IEEYRRILEK DSNFVEVYER LGVIYNQLQR FKEAEKFFKK ALEIERKDHV EYNLSYTLIK LGKLFEALEI LKRLYEKNPD DPMVANEYGL LLKTLGLYEE ALEVFEDAYR RHKEEEILKY NYGTILLHFE KEKAISILSE ISGELKDRAE FMISLAEKDV VIPSYEEFEW LKDYFFEGTI DVVALSEEID SEDEDVKRRI EKLREGEFPF YDTTLDFSEM LEVILGIMFE SPDIFKMEEN AVKFVSAFYG SSVMIASTIV LTRTFQYFLA EEEPTMEELL RELVAETQDV NWKFSLRLAR FRHADRFDFN RLSDLVIAFL QSIEQGTPVS DDERLKYLLE KLTSQKEG
|
| |