Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1283 |
Symbol | |
ID | 6092724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1309605 |
End bp | 1312286 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642488484 |
Product | DNA polymerase I |
Protein accession | YP_001739310 |
Protein GI | 170289072 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGAC TATTTCTCTT TGATGGAACT GCTCTGGCCT ACAGAGCGTA CTATGCGCTC GATAGATCGC TTTCTACTTC CACCGGCATT CCCACAAACG CCACATACGG TGTGGCGAGG ATGCTGGTGA GATTCATCAA AGACCATATC ATTGTCGGAA AAGACTACGT TGCTGTGGCT TTCGACAAAA AAGCTGCCAC CTTCAGACAC AAGCTCCTCG AGACTTACAA GGCTCAAAGA CCTAAGACGC CGGATCTCCT GATTCAGCAG CTTCCGTACA TAAAGAAGCT GGTCGAAGCA CTTGGAATGA AGGTGCTGGA GGTAGAAGGA TACGAAGCGG ACGATATAAT TGCCACTCTG GCTGTGAAGG GGCTTCCGCT CTTTGATGAA ATATTCATAG TGACCGGAGA TAAAGACATG CTTCAGCTTG TGAACGAAAA GATCAAGGTG TGGCGAATCG TAAAAGGGAT ATCCGATCTG GAACTTTACG ATGCGCAGAA GGTGAAGGAA AAATACGGTG TTGAACCCCA GCAGATCCCG GATCTTCTGG CTCTAACCGG AGATGAAATA GACAACATCC CCGGTGTAAC TGGGATAGGT GAAAAGACTG CTGTTCAGCT TCTAGAGAAG TACAAAGACC TCGAAGACAT ACTGAATCAT GTTCGCGAAC TTCCTCAAAA GGTGAGAAAA GCCCTGCTTC GAGACAGAGA AAACGCCATT CTCAGCAAAA AGCTGGCGAT TCTGGAAACA AACGTTCCCA TTGAAATAAA CTGGGAAGAA CTCCGCTACC AGGGTTACGA CAGAGAAAAA CTCTTACCAC TTTTGAAAGA ACTGGAATTC GCATCCATCA TGAAGGAACT TCAACTGTAC GAGGAATCCG AACCCGTTGG ATACAGAATA GTGAAAGACC TGGTGGAATT TGAAAAACTC ATAGAGAAAC TGAGAGAATC CCCTTCGTTC GCCATAGATC TTGAGACGTC TTCCCTCGAT CCTTTCGACT GCGACATTGT CGGTATCTCT GTGTCTTTCA AACCAAAGGA AGCGTACTAC ATACCACTCC ATCATAGAAA CGCCCAGAAC CTGGATGAAA AAGAAGTTCT GAAAAAGCTA AAAGAAATCC TGGAGGACCC CGGAGCAAAG ATCGTTGGTC AGAATTTGAA ATTCGATTAC AAGGTGTTGA TGGTAAAGGG TGTTGAACCT GTCCCTCCTC ACTTCGACAC GATGATAGCG GCTTACCTTC TTGAGCCGAA CGAAAAGAAG TTCAATCTGG ACGATCTCGC ATTGAAATTT CTTGGATACA AAATGACCTC TTACCAGGAA CTCATGTCCT TCTCTTCTCC GCTGTTTGGT TTCAGTTTTG CCGATGTTCC TGTAGAAAAA GCAGCGAACT ATTCCTGTGA AGATGCAGAC ATCACCTACA GACTCTACAA GATCCTGAGC TTAAAACTCC ACGAGGCAGA TCTGGAGAAC GTGTTCTACA AGATAGAAAT GCCTCTTGTG AGCGTGCTTG CACGGATGGA ACTGAACGGT GTGTACGTGG ACACAGAGTT CCTGAAGAAA CTCTCAGAAG AGTACGGAAA AAAACTCGAA GAACTGGCAG AGGAAATATA CAGGATAGCT GGAGAGCCGT TCAACATAAA CTCACCGAAG CAGGTTTCAA GGATCCTTTT TGAAAAACTC GGCATAAAAC CACGTGGTAA AACGACGAAA ACGGGAGACT ACTCAACACG CATAGAAGTC CTCGAGGAAC TTGCCGGTGA ACACGAAATC ATTCCTCTGA TTCTCGAATA CAGAAAGATA CAGAAATTGA AATCAACCTA CATAGACGCC CTCCCCAAGA TGGTCAACCC AAAGACCGGA AGAATTCATG CTTCTTTCAA TCAAACGGGG ACTGCCACTG GAAGACTCAG CAGCAGCGAT CCCAACCTTC AGAACCTCCC GACGAAAAGC GAAGAGGGAA AAGAAATCAG GAAAGCGATA GTTCCTCAGG ATCCAAACTG GTGGATCGTC AGCGCCGACT ACTCCCAGAT AGAACTGAGG ATCCTCGCCC ATCTCAGCGG TGATGAGAAT CTTTTGAAAG CATTCGAAGA GGGCATCGAC GTCCACACTC TAACAGCTTC CAGAATATTC AACGTGAAAC CCGAAGAAGT AACCGAAGAA ATGCGCCGCG CCGGTAAAAT GGTGAATTTT TCCATCATAT ACGGAGTAAC ACCTTACGGT CTGTCTGTGA GGCTTGGAGT ACCTGTGAAA GAAGCAGAAA AGATGATCGT CAACTACTTC GTCCTCTACC CAAAGGTGCG CGATTACATT CAGAGGGTCG TATCGGAAGC GAAAGAAAAA GGCTATGTTA GAACGCTGTT TGGAAGAAAA AGAGACATAC CACAGCTCAT GGCCAGGGAC AGAAACACAC AAGCTGAAGG AGAACGAATT GCCATAAACA CTCCCATACA GGGTACAGCA GCGGATATAA TAAAGCTGGC TATGATAGAA ATAGACAGGG AGCTGAAAGA AAGAAAAATG AGATCGAAGA TGATCATACA GGTCCACGAC GAACTGGTTT TTGAAGTGCC CGATGAGGAA AAGGACGCGC TCGTCGAGCT GGTGAAAGAC AGAATGACGA ATGTGGTAAA GCTTTCAGTG CCGCTCGAAG TGGATGTAAC CATCGGCAAA ACATGGTCGT GA
|
Protein sequence | MARLFLFDGT ALAYRAYYAL DRSLSTSTGI PTNATYGVAR MLVRFIKDHI IVGKDYVAVA FDKKAATFRH KLLETYKAQR PKTPDLLIQQ LPYIKKLVEA LGMKVLEVEG YEADDIIATL AVKGLPLFDE IFIVTGDKDM LQLVNEKIKV WRIVKGISDL ELYDAQKVKE KYGVEPQQIP DLLALTGDEI DNIPGVTGIG EKTAVQLLEK YKDLEDILNH VRELPQKVRK ALLRDRENAI LSKKLAILET NVPIEINWEE LRYQGYDREK LLPLLKELEF ASIMKELQLY EESEPVGYRI VKDLVEFEKL IEKLRESPSF AIDLETSSLD PFDCDIVGIS VSFKPKEAYY IPLHHRNAQN LDEKEVLKKL KEILEDPGAK IVGQNLKFDY KVLMVKGVEP VPPHFDTMIA AYLLEPNEKK FNLDDLALKF LGYKMTSYQE LMSFSSPLFG FSFADVPVEK AANYSCEDAD ITYRLYKILS LKLHEADLEN VFYKIEMPLV SVLARMELNG VYVDTEFLKK LSEEYGKKLE ELAEEIYRIA GEPFNINSPK QVSRILFEKL GIKPRGKTTK TGDYSTRIEV LEELAGEHEI IPLILEYRKI QKLKSTYIDA LPKMVNPKTG RIHASFNQTG TATGRLSSSD PNLQNLPTKS EEGKEIRKAI VPQDPNWWIV SADYSQIELR ILAHLSGDEN LLKAFEEGID VHTLTASRIF NVKPEEVTEE MRRAGKMVNF SIIYGVTPYG LSVRLGVPVK EAEKMIVNYF VLYPKVRDYI QRVVSEAKEK GYVRTLFGRK RDIPQLMARD RNTQAEGERI AINTPIQGTA ADIIKLAMIE IDRELKERKM RSKMIIQVHD ELVFEVPDEE KDALVELVKD RMTNVVKLSV PLEVDVTIGK TWS
|
| |