Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0672 |
Symbol | |
ID | 6092089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 687931 |
End bp | 689421 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642487858 |
Product | L-arabinose isomerase |
Protein accession | YP_001738708 |
Protein GI | 170288470 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATC TCAAGCAGTA CGAGTTCTGG TTTCTCGTTG GAAGTCAGTA TCTTTACGGT CTGGAAACCC TGAAAAAGGT GGAACAACAG GCAAGTAAGA TCGTGGATTC ACTGAACGAT GACCCCATCT TCCCTTCAAA GATCGTTCTG AAGCCGGTTT TGAAGAGTTC TTCTGAAATC ACGGAAATCT TTGAGAAAGC AAACGCAGAC CCAAAATGTG CAGGGGTTAT CGTTTGGATG CACACTTTTT CCCCATCGAA GATGTGGATA CGGGGACTCT CTATAAACAA AAAACCCCTG CTTCACCTTC ACACGCAGTA CAACAGAGAG ATTCCATGGG ATACGATCGA TATGGATTAC ATGAACCTGA ACCAGTCTGC ACACGGAGAC AGAGAGCATG GGTTCATACA TGCAAGGATG AGACTTCCAA GGAAAGTTGT GGTGGGACAC TGGGAAGAGA AAGAAGTCAG AGAAAAGATC GCAAAGTGGA TGAGAGTGGC CTGTGCGATA CAGGATGGAA GAATGGGACA GATAGTCAGG TTCGGTGACA ACATGAGAGA AGTCGCCAGC ACCGAAGGCG ACAAGGTAGA AGCACAGATA AAACTCGGCT GGTCCATAAA CACTTGGGGA GTTGGAGAAC TTGCAGAGAG AGTGAAAGCT GTTCCAGAGC GCGAAGTAGA GGAACTTCTC ACAGAATACA GAGAAAAATA CATCATGCCA GAGGATGAAT ACAGTCTCAA GGCAATAAGA GAGCAGGCGA AGATAGAAAT TGCACTGAGA GAATTTTTGA AAGAAAAGAA CGCTATTGCC TTCACCACCA CGTTCGAGGA CCTGCACGAT CTTCCACAGC TTCCCGGGCT CGCGGTCCAG AGACTCATGG AGGAAGGATA CGGTTTTGGT GCAGAAGGTG ACTGGAAAGC AGCGGGATTG GTTAGGGCCA TCAAGGTAAT GGGAACGGGT CTTCCCGGCG GAACTTCCTT TATGGAAGAC TACACTTACC ACCTCACTCC CGGGAACGAA CTCGTTTTAG GGGCTCACAT GCTCGAGGTA TGTCCAACGA TAGCAAAGGA AAAACCACGA ATAGAGGTTC ACCCTCTCAG CATCGGCGGG AAAGCAGATC CTGCCCGTCT TGTCTTCGAC GGTCAGGAAG GACCAGCCGT TAACGCATCG ATCGTTGACA TGGGTAACAG GTTCAGACTG GTTGTGAACA AGGTTTTATC CGTTCCCATA GAGAGGAAGA TGCCAAAACT CCCAACAGCC AGGGTCCTCT GGAAACCGAT GCCTGATTTC AAGAGGGCAA CTACTGCATG GATACTCGCT GGAGGATCAC ACCACACTGC CTTCTCAACA GCCATTGACA TAGAATACCT CATCGACTGG GCGGAGGCAT TGGAAATCGA ATACGTCGTC ATCGATGAGA ATTTGGATCT CGAGGACTTC AAAAAAGAAC TGAGATGGAA CGAACTCTAC TGGGGGCTTT TGAAAAGATG A
|
Protein sequence | MIDLKQYEFW FLVGSQYLYG LETLKKVEQQ ASKIVDSLND DPIFPSKIVL KPVLKSSSEI TEIFEKANAD PKCAGVIVWM HTFSPSKMWI RGLSINKKPL LHLHTQYNRE IPWDTIDMDY MNLNQSAHGD REHGFIHARM RLPRKVVVGH WEEKEVREKI AKWMRVACAI QDGRMGQIVR FGDNMREVAS TEGDKVEAQI KLGWSINTWG VGELAERVKA VPEREVEELL TEYREKYIMP EDEYSLKAIR EQAKIEIALR EFLKEKNAIA FTTTFEDLHD LPQLPGLAVQ RLMEEGYGFG AEGDWKAAGL VRAIKVMGTG LPGGTSFMED YTYHLTPGNE LVLGAHMLEV CPTIAKEKPR IEVHPLSIGG KADPARLVFD GQEGPAVNAS IVDMGNRFRL VVNKVLSVPI ERKMPKLPTA RVLWKPMPDF KRATTAWILA GGSHHTAFST AIDIEYLIDW AEALEIEYVV IDENLDLEDF KKELRWNELY WGLLKR
|
| |