Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1172 |
Symbol | |
ID | 5170295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1196064 |
End bp | 1198745 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640563691 |
Product | DNA polymerase I |
Protein accession | YP_001244762 |
Protein GI | 148270302 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGGC TATTTCTCTT TGATGGAACC GCTCTGGCTT ACAGAGCGTA TTATGCGCTC GATAGATCGC TTTCCACCTC CGCCGGCATT CCCACAAACG CCACGTACGG TGTGGCGAGG ATGCTGGTGA GATTCATCAA AGACCATATC ATCGTCGGGA AAGACTACGC TGCTGTGGCT TTCGACAAAA GAGCTGCTAC CTTCAGGCAC AAGCTCCTCG AGACTTACAA GGCTCAAAGA CCAAAGACTC CGGATCTCCT GATCCAGCAA CTTCCGTACA TAAAAAGACT TGTTGAAGCG CTTGGAATGA AGGTGCTGGA GATAGAGGGA TACGAAGCAG ACGATATAAT TGCCACTCTG GCTGTGAAGG GGCTTTCGCT TTTCGACGAG ATATTCATAG TGACTGGTGA CAAAGATATG CTTCAGCTTG TGAACGAAAA GATCAAGGTG TGGCGAATCG TAAAAGGGAT CTCCGACCTG GAACTTTACG ATGCACAGAA GGTGAAGGAA AAATACGGTG TTGAACCCCA TCAGATTCCC GATCTTCTGG CTCTAACCGG AGACGAAATA GACAACATCC CCGGTGTGAC CGGGATAGGT GAAAAAACGG CCGTTCAGCT CCTCGAGAAA TACAGAGACC TCGAAGACAT CCTGAATCAT ATTCACGAAC TTCCTCAAAA GACGAGAAAA ACCATGCTTC GGGACAGAGA AAGTGCCATT CTCAGCAAAA AGCTGGCGAT TCTGGAAACA AACGTTCCCA TCGAAATAAA CTGGGAAGAA CTTCGCTACC AGGGCCACGA CAGAGAGAAA CTCCTGTCAC TCTTGAAAGA ACTGGAATTC GCATCCATCA TGAAGGAACT CCAACTGTAC GAAGAGTCCG AACCTGTTGG ATACAGAATA GTGAAAGATC CGGTGGAATT TGAAAAACTC GTAGAGAAGC TGAAAGAAAC CCCTTCGTTT GCCATAGATC TTGAAACGTC TTCCCTCGAT CCCTTCGAAT GCGACATTGC TGGTATTTCC TTGTCATTCA AACCGAAGGA AGCGTACTAC ATACCACTCC ATCACAGAAA CGCCCAGAAC CTGGATGAAA AAGAGGTCTT GAAAAAGCTA AAAGAAATCC TGGAGGATCC TGGAGCAAAG ATCGTTGGTC AGAATCTGAA ATTCGACTAC AAGGTTTTGA TGGTAAAGGG TATAGAACCT GTCCCTCCTC ACTTCGACAC GATGATAGCG GCTTACCTTA TTGAACCGAA CGAAAAAAAA TTCAATCTGG ACGATCTCGC ACTTAAATTT CTCGGTTACA AAATGACTTC CTACCAGGAA CTCATGTCTT TCTCCTCACC GCTGTTTGGT TTCAGTTTTG TCGATGTTCC TCTGGAAAAA GCAGCGAACT ATTCCTGTGA AGATGCAGAC ATCACCTACA GGCTCTACAA GACCCTGAGC TTAAAACTCC ACGAGGCAGA TCTGGAGAAC GTGTTCTACA AGATAGAAAT GCCTCTTGTA AGCGTGCTTG CACGGATGGA ACTGAACGGT GTGTACGTGG ACACAGAGTT CCTGAAGAAA CTCTCAGAAG AGTACGGAAA AAAACTCGAA GAACTGGCAG AGGAAATATA CAGGATAGCT GGAGAACCGT TCAACATAAA CTCACCGAAA CAGGTTTCAA GGATCCTCTT TGAAAAACTC GGCATAAAAC CACGTGGTAA AACGACGAAA ACGGGAGACT ACTCAACACG CATAGAAGTC CTCGAGGAAC TTGCCGGTGA ACACGAAATC ATTCCTCTGA TTCTCGAATA CAGAAAGATA CAGAAATTGA AATCAACCTA CATAGACGCC CTCCCCAAGA TGGTCAACCC AAAGACCGGA AGAATTCATG CTTCTTTCAA TCAAACGGGG ACTGCCACTG GAAGACTCAG CAGCAGCGAT CCCAACCTTC AGAACCTCCC GACGAAAAGC GAAGAGGGAA AAGAAATCAG GAAAGCGATA GTTCCTCAGG ATCCAAACTG GTGGATCGTC AGCGCCGACT ACTCCCAGAT AGAACTGAGG ATTCTTGCCC ATCTCAGTGG TGATGAGAAT CTTTTGAGGG CATTCGAAGA GGGCATCGAC GTCCACACTC TAACAGCTTC CAGAATATTC AACGTGAAAC CCGAAGAGGT AACTGAAGAG ATGCGCCGCG CCGGTAAAAT GGTGAATTTT TCCATCATAT ACGGAGTAAC ACCTTATGGT CTGTCTGTGA GGCTTGGGGT ACCTGTGAAA GAAGCAGAAA AGATGATCGT CAACTACTTC GTCCTCTACC CAAAGGTGCG CGATTACATT CAGAGGGTCG TATCGGAAGC GAAAGAAAAA GGCTATGTTA GAACGCTGTT TGGAAGAAAA AGAGACATAC CACAGCTCAT GGCCCGGGAC AGGAACACAC AGGCTGAAGG AGAACGAATT GCCATAAACA CTCCCATACA GGGTACAGCA GCGGATATAA TAAAGCTGGC TATGATAGAA ATAGACAGGG AACTGAAAGA AAGAAAAATG AGATCGAAGA TGATCATACA GGTCCACGAC GAACTGGTTT TTGAAGTGCC CAATGAGGAA AAGGACGCGC TCGTCGAGCT GGTGAAAGAC AGAATGACGA ATGTGGTAAA GCTTTCAGTG CCGCTCGAAG TGGATGTAAC CATCGGCAAA ACATGGTCGT GA
|
Protein sequence | MARLFLFDGT ALAYRAYYAL DRSLSTSAGI PTNATYGVAR MLVRFIKDHI IVGKDYAAVA FDKRAATFRH KLLETYKAQR PKTPDLLIQQ LPYIKRLVEA LGMKVLEIEG YEADDIIATL AVKGLSLFDE IFIVTGDKDM LQLVNEKIKV WRIVKGISDL ELYDAQKVKE KYGVEPHQIP DLLALTGDEI DNIPGVTGIG EKTAVQLLEK YRDLEDILNH IHELPQKTRK TMLRDRESAI LSKKLAILET NVPIEINWEE LRYQGHDREK LLSLLKELEF ASIMKELQLY EESEPVGYRI VKDPVEFEKL VEKLKETPSF AIDLETSSLD PFECDIAGIS LSFKPKEAYY IPLHHRNAQN LDEKEVLKKL KEILEDPGAK IVGQNLKFDY KVLMVKGIEP VPPHFDTMIA AYLIEPNEKK FNLDDLALKF LGYKMTSYQE LMSFSSPLFG FSFVDVPLEK AANYSCEDAD ITYRLYKTLS LKLHEADLEN VFYKIEMPLV SVLARMELNG VYVDTEFLKK LSEEYGKKLE ELAEEIYRIA GEPFNINSPK QVSRILFEKL GIKPRGKTTK TGDYSTRIEV LEELAGEHEI IPLILEYRKI QKLKSTYIDA LPKMVNPKTG RIHASFNQTG TATGRLSSSD PNLQNLPTKS EEGKEIRKAI VPQDPNWWIV SADYSQIELR ILAHLSGDEN LLRAFEEGID VHTLTASRIF NVKPEEVTEE MRRAGKMVNF SIIYGVTPYG LSVRLGVPVK EAEKMIVNYF VLYPKVRDYI QRVVSEAKEK GYVRTLFGRK RDIPQLMARD RNTQAEGERI AINTPIQGTA ADIIKLAMIE IDRELKERKM RSKMIIQVHD ELVFEVPNEE KDALVELVKD RMTNVVKLSV PLEVDVTIGK TWS
|
| |