Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2764 |
Symbol | |
ID | 4077636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2910652 |
End bp | 2913459 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638008089 |
Product | DNA polymerase I |
Protein accession | YP_614758 |
Protein GI | 99082604 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTA GCAATTTTGG CAAAGGCTGT CACCTCCACC TGATCGATGG CTCCGCCTTC ATCTTTCGCG CCTACCACGC GCTGCCGCCG CTGACGCGCA AATCCGACGG GCTGCCCATC GGCGCGGTCT CGGGGTTCTG CAACATGCTG TTCAAACAGG TCGAGGACAA CAAGGGCCCC GATGCGCCGA CCCATGTGGC GGTGATCTTT GACCACTCCG GCAAATCGTT TCGCAACGAT ATGTACGATC AGTACAAGGC CAATCGCCCC CCCGCGCCGG AGGATCTGGT GCCGCAGTTC CCGCTGACGC GCGAGGCCAC CCGCGCCTTC AACATCGCCT GCAAGGAGAT CGAAGGCTTT GAGGCCGATG ACATCATCGC TACATTGGCC TGTCAGGCGC GCGAGGCCGG GGGCCGCGTG ACGATCATCT CGTCAGATAA GGACCTGATG CAGCTGGTCG GGGACGGGGT CGAGATGCTC GACGCGATGA AGAACAAGCG CATCGACAGC GATGGCGTGC GCGAGAAATT CGGCGTCGGC CCCGACCGGG TTGTGGATGT GCAGGCGCTT GCGGGCGACT CTGTTGACAA CGTGCCCGGC GCGCCCGGCA TCGGCATCAA GACCGCCGCG CTCCTGATCA ACGAGTTCGG CTCGCTGGAG GATCTGCTGG ATCGCGCCGA AGAGATCAAG CAGCCCAAGC GTCGTCAAAC CCTGATCGAG AAGCGCGACC AGATCGAGAT GTCGAAACGG CTGGTGCAGC TGGATTGCGA CATGGAGCTG GACTTCACGC TCGACGATCT GGAGGTGCGC GATCCGGATG CAGATACGCT CTTGGGCTTC CTCGCCGAGA TGGAATTCCG CACCCTGTCC AAGCGAATGG CCGACCAGCT GGGCCGCGAG GCCCCGACCA TCCCCGAAGC GCCCTCCGCC GCAGCCGCCG CGCTGGAGCT GCCCGAAGCG CCGGGCTTTG ACAGTGCCGA ATATACCACC GTGCGCGACG CCGAGACCCT TCAGCAGTGG ATCGATCTCA TTCGTGAGCA TGGCTATGTC GCGGTCGATA CCGAGACCAC CGGCCTTAAC GAGATGATCG CGGATCTCGT TGGCATCAGC CTATGTGTGG TTCCGGGGCA GGCCTGCTAT GTGCCGCTCA CACATAAGAC GGGGAACTCC GACGATCTCT TTGGCTCTGA CGATCTGGCC GAGGGGCAGA TGCCGCTCAA GGACGCGCTG GAGATGCTGA AACCGGTGCT GGAGGATGAC GCCATCCTCA AGATCGGCCA GAACATGAAA TACGATGCCA AGATCTTTGC CCGTAACGGC ATCGACGTGA CCCCGATCGA CGACACCATG TTGTTGTCCT ATGCGCTTCA TGGCGGCATG CACGGACATG GGATGGATAC GCTGTCGGAG CGCTATCTCG ACCATCAGCC GATCCCGATC AAATCGCTCT TGGGGAGCGG AAAATCCGCG ATCACATTTG ACCGGGTGTC GATAGAGGAC GCCACGCCCT ATGCGGCCGA GGACGCCGAC ATCACCCTGC GCCTCTGGCA GCAGTTCAAA CCGCAGTTGC ACCAGAAACA GGTGACCACC GTCTACGAGA CCTTGGAACG CCCGCTGGTG CCGGTGCTGG CGCAGATGGA ACAACATGGC ATCAAGGTGG ACCGCGACAC GCTCAGCCGT ATGTCGAACG CGTTCTCACA AAAAATGGCC GCGCTCGAGG CAGAGATCCA CGAGCTCGCG GGCGAGACCT TCAATGTCGG CTCCCCCAAG CAGCTGGGCG AGATCCTTTT TGACAAGATG TCTCTGCCGG GGGGCAAGAA GGGCAAGACC GGTGCCTATG CCACCGGGGC GGACATTCTT GAGGATCTGG CCACGGAGCA TACCTTGCCC GCGCGCGTGT TGGACTGGCG GCAACTCTCC AAGCTGAAAT CCACGTACAC AGACGCGCTT CAGGAGCACA TCCATCCCGA AACCGGGCGG GTACATACGT CTTATTTGCA GACCGGCGCC AACACTGGGC GTCTGGCCTC GAGCGATCCC AACCTGCAAA ACATCCCCGT GCGCAGCGAA GAGGGACGCC GCATCCGCGA GGCCTTTGTC GCAGACGAGG GCAATGTGCT TCTGTCGCTC GACTACAGCC AGATCGAGCT GCGCATCCTG GCCCATGTTG CGGGAATCGA TGCGCTAAAA CAGGCGTTTG CCGACGGCCA CGACATCCAC GCGATGACCG CCTCCGAGGT CTTTGATGTG CCACTTGAGG AGATGACTCC GGACATTCGC CGCAAGGCCA AGGCGATCAA CTTTGGCGTG ATCTATGGGA TTTCGGGCTT TGGGCTTGCA CGCAACCTGC GTATTCCGCG CGGCGAGGCC CAGGGATTTA TCGACCGCTA TTTCGAGCGT TTCCCCGGCA TTCGTCAGTA TATGGATGAC ACGGTGAACT TCGCCAAGGA GCACGGTTAT GTGCAAACGC TCTTTGGCCG TAAGATCCAC ACGCCGGAGA TCGCAGCCAA GGGACCGCGT GCGAGCTTTG CAAAACGCGC GGCTATCAAC GCGCCCATTC AGGGCACGGC CGCAGATGTC ATCCGGCGCG CCATGGTGCG TATGCCAGAG GCCATCGCCC ACCTGCCCGC GCGCATGCTG CTGCAGGTCC ACGATGAATT GCTGTTCGAA GTGCCCGAGG ATCACGTCGA AGAGACGATT TCCGTCGCCC GCGAGATCAT GGAAGGCGCG GCTGATCCGG CAGTGCATAT GGATGTAAAA CTGGTGGTCG ACGCGGGACG CGGCCAAAAC TGGGCCGAGG CGCATTAA
|
Protein sequence | MSTSNFGKGC HLHLIDGSAF IFRAYHALPP LTRKSDGLPI GAVSGFCNML FKQVEDNKGP DAPTHVAVIF DHSGKSFRND MYDQYKANRP PAPEDLVPQF PLTREATRAF NIACKEIEGF EADDIIATLA CQAREAGGRV TIISSDKDLM QLVGDGVEML DAMKNKRIDS DGVREKFGVG PDRVVDVQAL AGDSVDNVPG APGIGIKTAA LLINEFGSLE DLLDRAEEIK QPKRRQTLIE KRDQIEMSKR LVQLDCDMEL DFTLDDLEVR DPDADTLLGF LAEMEFRTLS KRMADQLGRE APTIPEAPSA AAAALELPEA PGFDSAEYTT VRDAETLQQW IDLIREHGYV AVDTETTGLN EMIADLVGIS LCVVPGQACY VPLTHKTGNS DDLFGSDDLA EGQMPLKDAL EMLKPVLEDD AILKIGQNMK YDAKIFARNG IDVTPIDDTM LLSYALHGGM HGHGMDTLSE RYLDHQPIPI KSLLGSGKSA ITFDRVSIED ATPYAAEDAD ITLRLWQQFK PQLHQKQVTT VYETLERPLV PVLAQMEQHG IKVDRDTLSR MSNAFSQKMA ALEAEIHELA GETFNVGSPK QLGEILFDKM SLPGGKKGKT GAYATGADIL EDLATEHTLP ARVLDWRQLS KLKSTYTDAL QEHIHPETGR VHTSYLQTGA NTGRLASSDP NLQNIPVRSE EGRRIREAFV ADEGNVLLSL DYSQIELRIL AHVAGIDALK QAFADGHDIH AMTASEVFDV PLEEMTPDIR RKAKAINFGV IYGISGFGLA RNLRIPRGEA QGFIDRYFER FPGIRQYMDD TVNFAKEHGY VQTLFGRKIH TPEIAAKGPR ASFAKRAAIN APIQGTAADV IRRAMVRMPE AIAHLPARML LQVHDELLFE VPEDHVEETI SVAREIMEGA ADPAVHMDVK LVVDAGRGQN WAEAH
|
| |