Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1391 |
Symbol | |
ID | 7091729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1502422 |
End bp | 1505463 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643464729 |
Product | DNA polymerase I |
Protein accession | YP_002361718 |
Protein GI | 217977571 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.413806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCC CAGCCTCCCA AACGCCTGTC CAGTCCGGCG ACCACGTCTT TCTCGTCGAC GGGTCGTCTT TCGTGTTCCG CGCCTATTTC CAGTCGATCC GGCAGGACGC CAAATATAAT TACCGCTCCG ACGGCCTGCC GACGGGGGCG GTGCGGCTGT TCTGCACCAA AATCTTCCAG TTCGTGCGCG AGGGCGCGGC CGGCGTGAAG CCGACCCATC TCGCCATTAT TTTCGATAAA TCCGAGAATT CGTTCCGGAA GGAAATCTAC CCGCCCTACA AGGGCAACCG TTCCGAGCCG CCTGAGGATC TGATCCCGCA GTTTCCGCTT ATGCGCGCCG CGGTGCGCGC GTTCGGCCTC CTTCCCGTCG AGCAGGACCG CTATGAGGCC GACGATCTCA TCGCCACCTA TGCGCGGCAA GCGCGCGAGC GCGGCGCCGA CGTCACGATT ATCTCGGCCG ACAAGGATCT GATGCAGCTG ATCGGGCCCG GCGTCTCGAT GTATGATCCA GCGTCCGGCG AGGCCGGCGC GAAAGGCTCG CGAGAGGAGC GCCGCATTGG CGTCGACGAG GTTTTGGCTT ATTTCGGCGT GCCGCCCGAA AAAGTCGTCG ACGTGCAAGC GCTGGCCGGC GATTCGACCG ATAATGTGCC CGGGGCGCGC GGCATCGGCC TCAAAACCGC CGCGCAGCTT ATCGGCGAAT ATGGCGATCT CGACACGCTG CTGGCGCGCG CCGGCGAGAT CAAGCAGCCG AAGCGGCGCG AGATTCTGAC CGACGCGGAT AGCGTCGCGC TGATCCGCAC ATCGCGAAAG CTCGTGGAGC TCGTCTGCGA CGTCGAGGTC GAAACGCCGC TGGACGATCT GCGCCTCGCC GCGCCGGAGG GCAAGACCCT CGTCGCCTTC TGCAAGGCGC TTGAATTTAC AACGCTCACC AGGCGTGTGG CCGAGGCTTG CGCCGTCGAG CCGGCGCTGA TCGAGCCGGA CCCGGATTTC GCCGGCGAAG CCGGATGGCG CGGCCGCAAC GGGGCGCCCC CGCCCGTTTC CACCGCGGCC GAAACGCCAT CCGCCGAAAA TCTGCCGCAA GGGCCGAAGC GCAATGCGCG CTATGGCGCC GAGGCGCCGC AACAGACCAT CGCGACGGGC CCCGGCGAAC TTGCCGCTGC GCGGGCGGCG CAAGCGCGTG CGGAAAAATT CGATCTCGGG GCGTATGAGA CGATTGTGAG CTTCGAGCGG CTCGAGGCCT ATATCGCGGA AGCGATCGAA TCCGGCATTA TCGCGATCGA CACGGAAACC TCCTCGCTCG ATCCGATGCA GGCCGAGCTC GTCGGCCTGT CGCTTTGCCT CGCCCCCGGC CGCGCCGCCT ATGTTCCGCT GCGCCACCGA GGCGAGGGCG CGGGCGATCT CTTTGGCGGC GCTGATCTCG TCCCCGGCCA GCTCGACTCC GACGAAACGC TGGCGCGGCT GAAGCCGATG CTGGAGGCGC CGGACGTTCT CAAAATCGCG CAGAACGCCA AATTCGACCA GCTTGTACTG GCGCAGCGCG GCATAAGGCT CGCTCCCGTC GACGACACGC TGCTGCTGTC TTATGTGCTC GACGCCGGCC GCACCGATCA CGGCATGGAC GTGCTCGCCG AAAAATACTT CGGCCATAGA CCGATTCAGT TCGGCGCCGT CGCAGGCTCG GGCCGGACGT TCATCGGCTT CGCCCGCGTC GCGCTCGACA AGGCGACGGA ATATTCCGCC GAAGACGCCG ACGTCACCTT GCGGCTCTGG CGGGTTTTGA AGCCGCGCCT TGCGGCCGAG CGCATGAGCG CCGTCTATGA GACGCTGGAG CGGCCGATGG TCGAAACTCT CGCGCGCATG GAGCGGCGCG GCGTCTCGAT CGACCGGGCG ATTTTATCGA GGCTCTCCGG CGAATTCGCA CAGGATATGG CGCGGCTGGA GGCGGTTATT TTCGAGCTCG CCGGCGAAAG CTTCAATCTC GGCTCGCCGA AGCAATTGGG CGACATATTA TTCGGCAAGA TGGGCCTTGC GGGAGCGCGC AAAACCGCGA CCGGCGCCTG GTCGACGGCG GCCGGCGTGC TGGACGACCT CGCCGAACAG GGCGTCCCGC TCGCCGCCCG CATTCTCGAC TGGCGCCAGC TGTCGAAACT GAAATCGACC TACACCGACG CCCTGCCCTC CTATGTCAAT CCTGAGACCG GCCGCGTGCA TACCTCCTAT GCGCTCGCGG CGACGACAAC AGGACGGTTG TCCTCGTCGG AGCCCAATCT GCAGAATATT CCCGTGCGCA ACGAGGCGGG GCGAAAGATC CGCAAGGCTT TTATTGCGCC GCCCGGCAGG AAGCTGATCT CGGCCGACTA CAGCCAGATT GAATTGCGCC TGCTTGCTCA TATTGCCGAC ATTTCGCAAC TGAGAGCCGC CTTTGCCGAG AACCTCGACA TTCACGCGAT GACAGCGTCC GAGATGTTCG GCGTGCCGGT CGAGGGCATG CCGCCCGAGG TGCGCCGCCG CGCCAAGGCG ATCAATTTCG GCATCATCTA TGGCATTTCA GCATTCGGCC TCGCCAACCA GCTGGCCATT CCGCGCGAGG AGGCCGGCGC CTATATCAAG CGCTATTTCG AGCGCTTTCC GGGCATCCGC GCCTACATGG ACGCAACCAA ACAATTTGCG CGCGAGAATG GCTATGTGAC AACCATTTTT GGCCGCAAAT GCCACTATCC GCGCATCACC GCCTCAAACC CATCGGAACG CGCCTTTAAC GAACGCGCCG CCATCAACGC GCCGATCCAG GGCTCGGCGG CGGACATCAT CCGGCGCGCG ATGGTCCGCA TGGACGAGGC TTTGGAAAAA GCGGGCTTGA GCGCGCAGAT GTTGTTGCAG GTGCATGACG AGCTTGTGTT TGAAGCGCCG GACGAGGAGA TCGACGCCAC GATCGATGTG GTGCGCAAGG TGATGGTCGA CGCGCCGCAT CCGTTCCTGC AACTCGCCGT GCCGCTGCAA GTCGACGCCA AGGCGGCGCA GAACTGGGAC GAGGCGCATT AG
|
Protein sequence | MPTPASQTPV QSGDHVFLVD GSSFVFRAYF QSIRQDAKYN YRSDGLPTGA VRLFCTKIFQ FVREGAAGVK PTHLAIIFDK SENSFRKEIY PPYKGNRSEP PEDLIPQFPL MRAAVRAFGL LPVEQDRYEA DDLIATYARQ ARERGADVTI ISADKDLMQL IGPGVSMYDP ASGEAGAKGS REERRIGVDE VLAYFGVPPE KVVDVQALAG DSTDNVPGAR GIGLKTAAQL IGEYGDLDTL LARAGEIKQP KRREILTDAD SVALIRTSRK LVELVCDVEV ETPLDDLRLA APEGKTLVAF CKALEFTTLT RRVAEACAVE PALIEPDPDF AGEAGWRGRN GAPPPVSTAA ETPSAENLPQ GPKRNARYGA EAPQQTIATG PGELAAARAA QARAEKFDLG AYETIVSFER LEAYIAEAIE SGIIAIDTET SSLDPMQAEL VGLSLCLAPG RAAYVPLRHR GEGAGDLFGG ADLVPGQLDS DETLARLKPM LEAPDVLKIA QNAKFDQLVL AQRGIRLAPV DDTLLLSYVL DAGRTDHGMD VLAEKYFGHR PIQFGAVAGS GRTFIGFARV ALDKATEYSA EDADVTLRLW RVLKPRLAAE RMSAVYETLE RPMVETLARM ERRGVSIDRA ILSRLSGEFA QDMARLEAVI FELAGESFNL GSPKQLGDIL FGKMGLAGAR KTATGAWSTA AGVLDDLAEQ GVPLAARILD WRQLSKLKST YTDALPSYVN PETGRVHTSY ALAATTTGRL SSSEPNLQNI PVRNEAGRKI RKAFIAPPGR KLISADYSQI ELRLLAHIAD ISQLRAAFAE NLDIHAMTAS EMFGVPVEGM PPEVRRRAKA INFGIIYGIS AFGLANQLAI PREEAGAYIK RYFERFPGIR AYMDATKQFA RENGYVTTIF GRKCHYPRIT ASNPSERAFN ERAAINAPIQ GSAADIIRRA MVRMDEALEK AGLSAQMLLQ VHDELVFEAP DEEIDATIDV VRKVMVDAPH PFLQLAVPLQ VDAKAAQNWD EAH
|
| |