Gene Msil_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1391 
Symbol 
ID7091729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1502422 
End bp1505463 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content64% 
IMG OID643464729 
ProductDNA polymerase I 
Protein accessionYP_002361718 
Protein GI217977571 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.413806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCC CAGCCTCCCA AACGCCTGTC CAGTCCGGCG ACCACGTCTT TCTCGTCGAC 
GGGTCGTCTT TCGTGTTCCG CGCCTATTTC CAGTCGATCC GGCAGGACGC CAAATATAAT
TACCGCTCCG ACGGCCTGCC GACGGGGGCG GTGCGGCTGT TCTGCACCAA AATCTTCCAG
TTCGTGCGCG AGGGCGCGGC CGGCGTGAAG CCGACCCATC TCGCCATTAT TTTCGATAAA
TCCGAGAATT CGTTCCGGAA GGAAATCTAC CCGCCCTACA AGGGCAACCG TTCCGAGCCG
CCTGAGGATC TGATCCCGCA GTTTCCGCTT ATGCGCGCCG CGGTGCGCGC GTTCGGCCTC
CTTCCCGTCG AGCAGGACCG CTATGAGGCC GACGATCTCA TCGCCACCTA TGCGCGGCAA
GCGCGCGAGC GCGGCGCCGA CGTCACGATT ATCTCGGCCG ACAAGGATCT GATGCAGCTG
ATCGGGCCCG GCGTCTCGAT GTATGATCCA GCGTCCGGCG AGGCCGGCGC GAAAGGCTCG
CGAGAGGAGC GCCGCATTGG CGTCGACGAG GTTTTGGCTT ATTTCGGCGT GCCGCCCGAA
AAAGTCGTCG ACGTGCAAGC GCTGGCCGGC GATTCGACCG ATAATGTGCC CGGGGCGCGC
GGCATCGGCC TCAAAACCGC CGCGCAGCTT ATCGGCGAAT ATGGCGATCT CGACACGCTG
CTGGCGCGCG CCGGCGAGAT CAAGCAGCCG AAGCGGCGCG AGATTCTGAC CGACGCGGAT
AGCGTCGCGC TGATCCGCAC ATCGCGAAAG CTCGTGGAGC TCGTCTGCGA CGTCGAGGTC
GAAACGCCGC TGGACGATCT GCGCCTCGCC GCGCCGGAGG GCAAGACCCT CGTCGCCTTC
TGCAAGGCGC TTGAATTTAC AACGCTCACC AGGCGTGTGG CCGAGGCTTG CGCCGTCGAG
CCGGCGCTGA TCGAGCCGGA CCCGGATTTC GCCGGCGAAG CCGGATGGCG CGGCCGCAAC
GGGGCGCCCC CGCCCGTTTC CACCGCGGCC GAAACGCCAT CCGCCGAAAA TCTGCCGCAA
GGGCCGAAGC GCAATGCGCG CTATGGCGCC GAGGCGCCGC AACAGACCAT CGCGACGGGC
CCCGGCGAAC TTGCCGCTGC GCGGGCGGCG CAAGCGCGTG CGGAAAAATT CGATCTCGGG
GCGTATGAGA CGATTGTGAG CTTCGAGCGG CTCGAGGCCT ATATCGCGGA AGCGATCGAA
TCCGGCATTA TCGCGATCGA CACGGAAACC TCCTCGCTCG ATCCGATGCA GGCCGAGCTC
GTCGGCCTGT CGCTTTGCCT CGCCCCCGGC CGCGCCGCCT ATGTTCCGCT GCGCCACCGA
GGCGAGGGCG CGGGCGATCT CTTTGGCGGC GCTGATCTCG TCCCCGGCCA GCTCGACTCC
GACGAAACGC TGGCGCGGCT GAAGCCGATG CTGGAGGCGC CGGACGTTCT CAAAATCGCG
CAGAACGCCA AATTCGACCA GCTTGTACTG GCGCAGCGCG GCATAAGGCT CGCTCCCGTC
GACGACACGC TGCTGCTGTC TTATGTGCTC GACGCCGGCC GCACCGATCA CGGCATGGAC
GTGCTCGCCG AAAAATACTT CGGCCATAGA CCGATTCAGT TCGGCGCCGT CGCAGGCTCG
GGCCGGACGT TCATCGGCTT CGCCCGCGTC GCGCTCGACA AGGCGACGGA ATATTCCGCC
GAAGACGCCG ACGTCACCTT GCGGCTCTGG CGGGTTTTGA AGCCGCGCCT TGCGGCCGAG
CGCATGAGCG CCGTCTATGA GACGCTGGAG CGGCCGATGG TCGAAACTCT CGCGCGCATG
GAGCGGCGCG GCGTCTCGAT CGACCGGGCG ATTTTATCGA GGCTCTCCGG CGAATTCGCA
CAGGATATGG CGCGGCTGGA GGCGGTTATT TTCGAGCTCG CCGGCGAAAG CTTCAATCTC
GGCTCGCCGA AGCAATTGGG CGACATATTA TTCGGCAAGA TGGGCCTTGC GGGAGCGCGC
AAAACCGCGA CCGGCGCCTG GTCGACGGCG GCCGGCGTGC TGGACGACCT CGCCGAACAG
GGCGTCCCGC TCGCCGCCCG CATTCTCGAC TGGCGCCAGC TGTCGAAACT GAAATCGACC
TACACCGACG CCCTGCCCTC CTATGTCAAT CCTGAGACCG GCCGCGTGCA TACCTCCTAT
GCGCTCGCGG CGACGACAAC AGGACGGTTG TCCTCGTCGG AGCCCAATCT GCAGAATATT
CCCGTGCGCA ACGAGGCGGG GCGAAAGATC CGCAAGGCTT TTATTGCGCC GCCCGGCAGG
AAGCTGATCT CGGCCGACTA CAGCCAGATT GAATTGCGCC TGCTTGCTCA TATTGCCGAC
ATTTCGCAAC TGAGAGCCGC CTTTGCCGAG AACCTCGACA TTCACGCGAT GACAGCGTCC
GAGATGTTCG GCGTGCCGGT CGAGGGCATG CCGCCCGAGG TGCGCCGCCG CGCCAAGGCG
ATCAATTTCG GCATCATCTA TGGCATTTCA GCATTCGGCC TCGCCAACCA GCTGGCCATT
CCGCGCGAGG AGGCCGGCGC CTATATCAAG CGCTATTTCG AGCGCTTTCC GGGCATCCGC
GCCTACATGG ACGCAACCAA ACAATTTGCG CGCGAGAATG GCTATGTGAC AACCATTTTT
GGCCGCAAAT GCCACTATCC GCGCATCACC GCCTCAAACC CATCGGAACG CGCCTTTAAC
GAACGCGCCG CCATCAACGC GCCGATCCAG GGCTCGGCGG CGGACATCAT CCGGCGCGCG
ATGGTCCGCA TGGACGAGGC TTTGGAAAAA GCGGGCTTGA GCGCGCAGAT GTTGTTGCAG
GTGCATGACG AGCTTGTGTT TGAAGCGCCG GACGAGGAGA TCGACGCCAC GATCGATGTG
GTGCGCAAGG TGATGGTCGA CGCGCCGCAT CCGTTCCTGC AACTCGCCGT GCCGCTGCAA
GTCGACGCCA AGGCGGCGCA GAACTGGGAC GAGGCGCATT AG
 
Protein sequence
MPTPASQTPV QSGDHVFLVD GSSFVFRAYF QSIRQDAKYN YRSDGLPTGA VRLFCTKIFQ 
FVREGAAGVK PTHLAIIFDK SENSFRKEIY PPYKGNRSEP PEDLIPQFPL MRAAVRAFGL
LPVEQDRYEA DDLIATYARQ ARERGADVTI ISADKDLMQL IGPGVSMYDP ASGEAGAKGS
REERRIGVDE VLAYFGVPPE KVVDVQALAG DSTDNVPGAR GIGLKTAAQL IGEYGDLDTL
LARAGEIKQP KRREILTDAD SVALIRTSRK LVELVCDVEV ETPLDDLRLA APEGKTLVAF
CKALEFTTLT RRVAEACAVE PALIEPDPDF AGEAGWRGRN GAPPPVSTAA ETPSAENLPQ
GPKRNARYGA EAPQQTIATG PGELAAARAA QARAEKFDLG AYETIVSFER LEAYIAEAIE
SGIIAIDTET SSLDPMQAEL VGLSLCLAPG RAAYVPLRHR GEGAGDLFGG ADLVPGQLDS
DETLARLKPM LEAPDVLKIA QNAKFDQLVL AQRGIRLAPV DDTLLLSYVL DAGRTDHGMD
VLAEKYFGHR PIQFGAVAGS GRTFIGFARV ALDKATEYSA EDADVTLRLW RVLKPRLAAE
RMSAVYETLE RPMVETLARM ERRGVSIDRA ILSRLSGEFA QDMARLEAVI FELAGESFNL
GSPKQLGDIL FGKMGLAGAR KTATGAWSTA AGVLDDLAEQ GVPLAARILD WRQLSKLKST
YTDALPSYVN PETGRVHTSY ALAATTTGRL SSSEPNLQNI PVRNEAGRKI RKAFIAPPGR
KLISADYSQI ELRLLAHIAD ISQLRAAFAE NLDIHAMTAS EMFGVPVEGM PPEVRRRAKA
INFGIIYGIS AFGLANQLAI PREEAGAYIK RYFERFPGIR AYMDATKQFA RENGYVTTIF
GRKCHYPRIT ASNPSERAFN ERAAINAPIQ GSAADIIRRA MVRMDEALEK AGLSAQMLLQ
VHDELVFEAP DEEIDATIDV VRKVMVDAPH PFLQLAVPLQ VDAKAAQNWD EAH