Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_1917 |
Symbol | |
ID | 5297361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | - |
Start bp | 1899984 |
End bp | 1902656 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640770183 |
Product | DNA polymerase I |
Protein accession | YP_001307134 |
Protein GI | 150021780 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.100077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGAAC TATTTCTTTT TGATGGTACG GGATTGGTAT ATAGAGCATT TTATGCAATA GATCAATTCT TAAAAACTTC TACTGGAATG CATACAAATG CATTGTATGG AATTGCAAAG ATGTTAATAA AGTTTTTAAA AGAGCATGTT AATATGGAAA AAGATGCTTG TGCTTTTATT TTGGATTCCA AAGGTGGAAG TAAAAAAAGA AAGGAGATTT TAAAAGATTA TAAATCAAAT AGACCAGAAA CACCAGATCT TATATTAGAG CAACTGCCAT ACATTGAAGA ATTTGTTGAT GCATTTGGTG TGAAGGTTTT AAAGTTATTG GGATATGAGG CAGATGATAT AATTGCAACT ATTGCAAAAA GGTTCTGCAA TGCCTTTGAG AAGGTGAATA TAATCACTGG TGATAAAGAC TTATTGCAAC TTGTTGATGA AAAGGTTTAT GTTTGGCGAA TAGAAAGGGG TATTACTGAG TTAGTATTGT ACGATAGAAA AAAGGTTTTT GAAAAATATG GAGTTTTTCC CGAGCAATTT GGTGATTATC TTTCACTTGT GGGTGATCAA ATCGATAACA TTCCTGGAGT TAAAGGTATT GGGAAAAAGA CAGCTGTTTC GCTACTTAAA AAGTACGGAA CAATTGATGA GGTTTTGAAA AATAAAAAAT TACTTACAGA AAAGTTGCAA AAATTATTAG AAAATGCAAC CGAGAGCTTA GAAAAAAGTA GACAATTAGT TCAACTTATA TACGATGTTC CTTTAGACGT AAACATCGAA GATTTAATAT ACAAAGGTTA TGATTCAAAA AAGCTTTTAG TTGTGTTGAA AAAATATGAA TTTTCTTCTA TTATAAAAGA ATTAGGTTTA AAAGAAGAGT TTGAAAAAAA GTATACAATA GTCAATTCTG AGAAGGAATT ATCTAAGTTG AGAAAAAGAA TTGATGAAGT TAAAACTTTT TCCATAGATA CGGAAACAAC CTCACTTGAT CCATTTAGTG CAAAACTTGT AGGTGTTTCT ATTTCTACTA ATGAGGGTGA AGCGTATTAT ATTCCCATTT CACATGTAAG TGAGAATAAT TTAACTAAAG AAATAGTTTT AAAATTTTTA AAGGAAATTT TGGAATGTGA AAGATATAAT ATAGTTGGAC AGAATTTAAA ATTTGATTAT AAGGTATTTA TGGTAAATGG AATTGAACCA CAAATTCCAC ACTTTGATAC AATGGTTGCA GCATATCTTA TCAATCCGGA AGAGAGAAGG TACAACCTTG AAGAGTTAGC TTTAAAGTAT CTGGGGTATA AAATGATAAG TTTTGAGGAG TTAGTAGACA ACAATATGCC ATTGTTTGGT AACGATTTTT CTTTTATCTC AATAGAAAAA GCTGCGGAGT ATTCTTGTGA GGATGTAGAT ATTACATTCA GACTTTATAG CTATTTGAGT AAGTACATAG GTGAAATGAA AGAACTCTTT TACAATATAG AGATGCCATT AATAAATGTT TTGGCTCAGA TGGAGTTGAA CGGGGTTTAT TTTGATGTTG ATTATTTAAA AGAGTTATCT AAAAGATATG AAGAAGAAAT GAAAAAGTTG GAAGAAAAAA TATTTGAAAT TTCTGGTGAA CAGTTTAATA TAAATTCTTC AAAACAAGTT GCGGAAATAC TTTTTGAGAA ATTGAAACTT CCTATAGTTA AAAAAACAGC AACTGGTAGA AATTCAACAA ATGCGGAAGT TTTAGAGGAA CTTGCTAAAG ATTATGAAAT CGCGCGATTA ATCCTGGAAT ACAGAAAATT TCAAAAGTTA AAAAGTACGT ACGTAGATTC TATACCTAGT AGTGTTAATA TCACAACAAA TAGAGTACAT TCTAGTTTTC ATCAAACAGG TACTTCGACT GGGAGGTTGA GTAGTTCAGC TCCAAATCTT CAAAATTTGC CTACTAGGAG TGAAGAAGGG AAAGAAATTA GACATGCTGT TAAACCTCAA TTTGAAAATT GGTATATTGT TGGAGCAGAT TATTCTCAAA TAGAATTGAG AGTGTTAGCA CATATGAGCG AGGATGAAAA ATTGTTAGAT GCATTTGAAA ATGATTATGA TATACATACA ATTACTGCTT CGAAGATATT TAATGTATCT GAATTAATGG TTACTGAAGA TATGAGAAGA ATAGGTAAAA TGATTAATTT TGCAATAATT TATGGTATTT CGCCATATGG TCTTTCAAGA CGAATAGGAT TAAATGTGAA TGAAACAAAA AAGATAATAG ATAATTATTT TAAATATTAT CAAGGTGTAT TTGAATTTAT TAAGAAAACA ATAGATTTTG CTAAAAAAAA TGGTTTTGTA AAAACCTTGT TTGGAAGGAA GAGGTTTATT CCACAGCTTA AGTTAAAAAA TAAAAATTTA ATTCAAGAGG GAGAGAGAAT TGCTATAAAC ACTCCTGTAC AAGGAACTGC CGCTGACATT ATTAAAATTG CAATGGTAAA GGTTCATAAT GAGTTAAAAA GGAATAGTTT GAAAACAAAA CTGATTTTAC AAGTTCATGA TGAACTGGTA TTTGAGGTAC CTTTTGACGA ATTACAGATT GTAAAGGAGA TAATAAAAGA TAAAATGGAG AATGCTGTTA AGCTAAAAGT TCCATTAAAA GTGGATTTAT ATGAAGGAAG AGAGTGGGAG TGA
|
Protein sequence | MRELFLFDGT GLVYRAFYAI DQFLKTSTGM HTNALYGIAK MLIKFLKEHV NMEKDACAFI LDSKGGSKKR KEILKDYKSN RPETPDLILE QLPYIEEFVD AFGVKVLKLL GYEADDIIAT IAKRFCNAFE KVNIITGDKD LLQLVDEKVY VWRIERGITE LVLYDRKKVF EKYGVFPEQF GDYLSLVGDQ IDNIPGVKGI GKKTAVSLLK KYGTIDEVLK NKKLLTEKLQ KLLENATESL EKSRQLVQLI YDVPLDVNIE DLIYKGYDSK KLLVVLKKYE FSSIIKELGL KEEFEKKYTI VNSEKELSKL RKRIDEVKTF SIDTETTSLD PFSAKLVGVS ISTNEGEAYY IPISHVSENN LTKEIVLKFL KEILECERYN IVGQNLKFDY KVFMVNGIEP QIPHFDTMVA AYLINPEERR YNLEELALKY LGYKMISFEE LVDNNMPLFG NDFSFISIEK AAEYSCEDVD ITFRLYSYLS KYIGEMKELF YNIEMPLINV LAQMELNGVY FDVDYLKELS KRYEEEMKKL EEKIFEISGE QFNINSSKQV AEILFEKLKL PIVKKTATGR NSTNAEVLEE LAKDYEIARL ILEYRKFQKL KSTYVDSIPS SVNITTNRVH SSFHQTGTST GRLSSSAPNL QNLPTRSEEG KEIRHAVKPQ FENWYIVGAD YSQIELRVLA HMSEDEKLLD AFENDYDIHT ITASKIFNVS ELMVTEDMRR IGKMINFAII YGISPYGLSR RIGLNVNETK KIIDNYFKYY QGVFEFIKKT IDFAKKNGFV KTLFGRKRFI PQLKLKNKNL IQEGERIAIN TPVQGTAADI IKIAMVKVHN ELKRNSLKTK LILQVHDELV FEVPFDELQI VKEIIKDKME NAVKLKVPLK VDLYEGREWE
|
| |