Gene M446_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0678 
Symbol 
ID6129833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp785511 
End bp788561 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content71% 
IMG OID641640997 
ProductDNA polymerase I 
Protein accessionYP_001767672 
Protein GI170739017 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.80797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.6811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCG ACCCCGCCCC CGAGACGAAG CCCGTCGGCC CCGGCGACCA GGTGCTCCTG 
GTCGACGGCT CCTCCTTCAT CTTCCGGGCC TATTTCCAGT CGATCAACCA GCCCGAGCGC
TACAATTTCC GGCCCTCCGA CGGGATGCCG ACCGGCGCCG TGCGGCTGTT CTGCGCCAAG
ATCGCCCAGT TCGTGCAGGA GGGTGCGGCC GGGATCCAGC CGAGCCATCT CGGCATCGTC
TTCGACAAGT CGGAGGGCTC GTTCCGCAAG GAGATCTTCC CGGACTACAA GGGCCACCGG
CCGGACGCGC CCGACGACCT CAAGCGGCAG ATGCCGCTGA TGCGGGAGGC GGTGCGGGCC
TTCGGCCTGG AGCCGATCGA GCTCGAGCGC TACGAGGCGG ACGACCTGAT CGCCACCTAC
GCGCGGCAGG CGGAGGCGAG GGGGGCGGGC GTCATCATCG TGTCCTCCGA CAAGGACCTG
ATGCAGCTCG TCGGCGAGCT CGTGCGCTTC TACGACTTCG AATCCGGGCA GAAGGGCAAG
CCCGGCTACC GGCCCGAGCG CAACCTCGAC GCCGCCGCCA TCGTGGAGCG CTGGGAGGGC
TTGAGCCCCG CCCAGATCGG CGACGCGCTG GCGCTGATCG GGGACACTTC CGACAACGTC
CCGGGCGTGC CCGGCATCGG CCTCAAGACC GCCGCCGCGC TGCTCAAGGA ATTCGGGAGC
CTGGAGGCGC TGCTGGAGCG GGCGGCCGAG ATCAAGCAGC CCAAGCGCCG CGAGACGCTG
CTCGCGCATC TCGACCAGGC GCGGCTGTCG CGCCGGCTCG TCACCCTGGA CGAGGCGGTG
CCGGTGCCGG TGCCGCTGGA GGCCCTGCGC CTGCGCAAGC CCGACCCGGA GCGGCTCGTC
GGCTTCCTGA AGGCGATGGA GTTCAACACG CTGACCCGGC GCATCGCCTC GCTGCTCCAC
GTCGATCCGG AGGCGGTGCG GCCCGATCCG GCCCTGCTCC CCGGGGGCGC CGCCGCCTAC
GCCAACGGGA AGGGCGGCAG CGACGCGACC CCGTTCTTCG GCGACGAGGC GACGGCGCCG
CCCGCGCCCG AGACCGATCC CTTCGCGGAT CTCGACCTGC CCGACGCCCC GCCCAGGCCC
CGCGGCCCCG CCGAGCCGAC CCCCGGCCAA GTCGTGGCGG CCCGCGCCGC CGAGGCGGTG
AAGCCGTTCG ACACCGCCTC CTACGAGACG ATCACCCGCC TCGACCGCCT CGACGCCTGG
ATCGCCGAGG CGGCGGAGGC CGGCGTGGTC GCGGTCGACA CCGAGACCAA CGCGCTCGAC
GCCCACCGGG CCGACCTCGT CGGCGTCTCG CTCGCCACGG CGCCCGGCCG CGCCGCCTAC
ATCCCGCTCG CCCATCGCGG CAGCGAGGAC CTGTTCGGCG AGGGGCTGCT CCCCGACCAG
CTCCCCTGGG ACGCGGTGCG GGCCCGCCTG AAGCCGCTGC TGGAGGATCC GGCGGTGCTC
AAGGTCGGGC AGAACGTGAA GTACGACTGG CTGGTGCTCG TCCGCCACGG CATCGAGATC
CAGCCCTACG ACGACACGAT GCTGATCTCC TACGTGCTCG ATGCCGGCAA GGGCTCGCAC
GGCATGGACG AGCTGGCGCG GCGGCATCTC GGCCACCAGC CGATCACCTT CGCGGACGTG
ACGGGCACGG GCCGGGCCAA GGTCACCTTC GACCGCGTGC CCCTCGACAA GGCCACGGCC
TACGCGGCCG AAGACGCGGA CGTGACGCTG CGCCTGTGGC GCCTGATGAA GCCGCGGCTC
GCCGCCGAGC ACCGCGCCAC CGTCTACGAG ACCCTGGAGC GCCCCCTGGT GCCGGTGCTC
GCCCGCATGG AGCGGGAGGG CATCCGGGTC GACCGCGACA TGCTGAGCCG CCTCTCGGGG
GATTTCTCGC AGTCGCTGGC CCGGCTCGAG GCCGAGATCC AGGAGATGGC CGGCGAGACC
TTCTCGGTCT CCTCCCCCAA GCAGATCGGC GACATCCTGT TCGGCAAGCT CGGGCTGCCC
GGCGCCAAGA AGACGCCGTC GGGCCAGTGG GCCACGCCCG CGACGCTGCT GGAGGAGCTC
GCCACCCAGG GACACCCGCT GCCGCGCAAG ATCCTGGAAT GGCGCCAGCT CTCGAAGCTC
AAATCGACCT ACACGGACAG CCTGCAGGAG CATGCCGAAA GGGAGAGCGA CCGCGTCCAC
ACCTCCTTCG CGCTCGCCGC CACCACGACC GGGCGCCTCT CCTCCTCGGA CCCGAACCTG
CAGAACATTC CGATCCGCAC CGAGGAGGGG CGGCGCATCC GCCAGGCCTT CGTGGCGGAT
GCCGGCCACC AGTTGATCTC GGCGGATTAC AGCCAGATCG AGCTGCGGCT GCTCGCCCAC
ATGGCCGACA TCCCGCAGCT GCGCCAGGCC TTCGCGGACG GGCTCGACAT CCACGCGGCG
ACCGCCTCGG CGATGTTCGG CGTCCCCTTG GGCGAGATGA CGCCGGATCT GCGGCGGCGG
GCCAAGACGA TCAATTTCGG CATCATCTAT GGGATCTCGG CCTTCGGCCT CGCCGACCGG
CTCGGCATCC CGCAGGGCGA GGCCTCGGCC TTCATCAAGC AGTATTTCGA GCGCTTCCCG
GGCATCCGCG CCTATATCGA GGACACCAAG AAGGCCTGCC GGGACAAGGG CTACGTGACC
ACCCTGTTCG GGCGCGTCTG CCACTACCCG CAGATCCGCT CCAACAACCC GCAGGAGCGG
GCCTCGGTGG AGCGGCAGGC GATCAACGCG CCGATCCAGG GCACGGCCGC CGACATCATC
CGCCGCGCGA TGGTGCGGAT GGAGGGGGCG CTGGCCGGGG CGGGGCTCAC CACCCGCATG
CTGCTGCAGG TGCACGACGA GCTGGTCTTC GAGGCGCCGC AGGACGAGGT CGCGCGGGCG
CTGCCGATCA TCGCCCGGGT GATGGAGGAG GCGCCCCAGC CGGCGGTGCG GCTCACGGTG
CCGCTCGCCG TCGAGGCCAA GGCGGCGGCG AACTGGCAGG AGGCGCATTG A
 
Protein sequence
MNADPAPETK PVGPGDQVLL VDGSSFIFRA YFQSINQPER YNFRPSDGMP TGAVRLFCAK 
IAQFVQEGAA GIQPSHLGIV FDKSEGSFRK EIFPDYKGHR PDAPDDLKRQ MPLMREAVRA
FGLEPIELER YEADDLIATY ARQAEARGAG VIIVSSDKDL MQLVGELVRF YDFESGQKGK
PGYRPERNLD AAAIVERWEG LSPAQIGDAL ALIGDTSDNV PGVPGIGLKT AAALLKEFGS
LEALLERAAE IKQPKRRETL LAHLDQARLS RRLVTLDEAV PVPVPLEALR LRKPDPERLV
GFLKAMEFNT LTRRIASLLH VDPEAVRPDP ALLPGGAAAY ANGKGGSDAT PFFGDEATAP
PAPETDPFAD LDLPDAPPRP RGPAEPTPGQ VVAARAAEAV KPFDTASYET ITRLDRLDAW
IAEAAEAGVV AVDTETNALD AHRADLVGVS LATAPGRAAY IPLAHRGSED LFGEGLLPDQ
LPWDAVRARL KPLLEDPAVL KVGQNVKYDW LVLVRHGIEI QPYDDTMLIS YVLDAGKGSH
GMDELARRHL GHQPITFADV TGTGRAKVTF DRVPLDKATA YAAEDADVTL RLWRLMKPRL
AAEHRATVYE TLERPLVPVL ARMEREGIRV DRDMLSRLSG DFSQSLARLE AEIQEMAGET
FSVSSPKQIG DILFGKLGLP GAKKTPSGQW ATPATLLEEL ATQGHPLPRK ILEWRQLSKL
KSTYTDSLQE HAERESDRVH TSFALAATTT GRLSSSDPNL QNIPIRTEEG RRIRQAFVAD
AGHQLISADY SQIELRLLAH MADIPQLRQA FADGLDIHAA TASAMFGVPL GEMTPDLRRR
AKTINFGIIY GISAFGLADR LGIPQGEASA FIKQYFERFP GIRAYIEDTK KACRDKGYVT
TLFGRVCHYP QIRSNNPQER ASVERQAINA PIQGTAADII RRAMVRMEGA LAGAGLTTRM
LLQVHDELVF EAPQDEVARA LPIIARVMEE APQPAVRLTV PLAVEAKAAA NWQEAH