Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0678 |
Symbol | |
ID | 6129833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 785511 |
End bp | 788561 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641640997 |
Product | DNA polymerase I |
Protein accession | YP_001767672 |
Protein GI | 170739017 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.80797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.6811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCCG ACCCCGCCCC CGAGACGAAG CCCGTCGGCC CCGGCGACCA GGTGCTCCTG GTCGACGGCT CCTCCTTCAT CTTCCGGGCC TATTTCCAGT CGATCAACCA GCCCGAGCGC TACAATTTCC GGCCCTCCGA CGGGATGCCG ACCGGCGCCG TGCGGCTGTT CTGCGCCAAG ATCGCCCAGT TCGTGCAGGA GGGTGCGGCC GGGATCCAGC CGAGCCATCT CGGCATCGTC TTCGACAAGT CGGAGGGCTC GTTCCGCAAG GAGATCTTCC CGGACTACAA GGGCCACCGG CCGGACGCGC CCGACGACCT CAAGCGGCAG ATGCCGCTGA TGCGGGAGGC GGTGCGGGCC TTCGGCCTGG AGCCGATCGA GCTCGAGCGC TACGAGGCGG ACGACCTGAT CGCCACCTAC GCGCGGCAGG CGGAGGCGAG GGGGGCGGGC GTCATCATCG TGTCCTCCGA CAAGGACCTG ATGCAGCTCG TCGGCGAGCT CGTGCGCTTC TACGACTTCG AATCCGGGCA GAAGGGCAAG CCCGGCTACC GGCCCGAGCG CAACCTCGAC GCCGCCGCCA TCGTGGAGCG CTGGGAGGGC TTGAGCCCCG CCCAGATCGG CGACGCGCTG GCGCTGATCG GGGACACTTC CGACAACGTC CCGGGCGTGC CCGGCATCGG CCTCAAGACC GCCGCCGCGC TGCTCAAGGA ATTCGGGAGC CTGGAGGCGC TGCTGGAGCG GGCGGCCGAG ATCAAGCAGC CCAAGCGCCG CGAGACGCTG CTCGCGCATC TCGACCAGGC GCGGCTGTCG CGCCGGCTCG TCACCCTGGA CGAGGCGGTG CCGGTGCCGG TGCCGCTGGA GGCCCTGCGC CTGCGCAAGC CCGACCCGGA GCGGCTCGTC GGCTTCCTGA AGGCGATGGA GTTCAACACG CTGACCCGGC GCATCGCCTC GCTGCTCCAC GTCGATCCGG AGGCGGTGCG GCCCGATCCG GCCCTGCTCC CCGGGGGCGC CGCCGCCTAC GCCAACGGGA AGGGCGGCAG CGACGCGACC CCGTTCTTCG GCGACGAGGC GACGGCGCCG CCCGCGCCCG AGACCGATCC CTTCGCGGAT CTCGACCTGC CCGACGCCCC GCCCAGGCCC CGCGGCCCCG CCGAGCCGAC CCCCGGCCAA GTCGTGGCGG CCCGCGCCGC CGAGGCGGTG AAGCCGTTCG ACACCGCCTC CTACGAGACG ATCACCCGCC TCGACCGCCT CGACGCCTGG ATCGCCGAGG CGGCGGAGGC CGGCGTGGTC GCGGTCGACA CCGAGACCAA CGCGCTCGAC GCCCACCGGG CCGACCTCGT CGGCGTCTCG CTCGCCACGG CGCCCGGCCG CGCCGCCTAC ATCCCGCTCG CCCATCGCGG CAGCGAGGAC CTGTTCGGCG AGGGGCTGCT CCCCGACCAG CTCCCCTGGG ACGCGGTGCG GGCCCGCCTG AAGCCGCTGC TGGAGGATCC GGCGGTGCTC AAGGTCGGGC AGAACGTGAA GTACGACTGG CTGGTGCTCG TCCGCCACGG CATCGAGATC CAGCCCTACG ACGACACGAT GCTGATCTCC TACGTGCTCG ATGCCGGCAA GGGCTCGCAC GGCATGGACG AGCTGGCGCG GCGGCATCTC GGCCACCAGC CGATCACCTT CGCGGACGTG ACGGGCACGG GCCGGGCCAA GGTCACCTTC GACCGCGTGC CCCTCGACAA GGCCACGGCC TACGCGGCCG AAGACGCGGA CGTGACGCTG CGCCTGTGGC GCCTGATGAA GCCGCGGCTC GCCGCCGAGC ACCGCGCCAC CGTCTACGAG ACCCTGGAGC GCCCCCTGGT GCCGGTGCTC GCCCGCATGG AGCGGGAGGG CATCCGGGTC GACCGCGACA TGCTGAGCCG CCTCTCGGGG GATTTCTCGC AGTCGCTGGC CCGGCTCGAG GCCGAGATCC AGGAGATGGC CGGCGAGACC TTCTCGGTCT CCTCCCCCAA GCAGATCGGC GACATCCTGT TCGGCAAGCT CGGGCTGCCC GGCGCCAAGA AGACGCCGTC GGGCCAGTGG GCCACGCCCG CGACGCTGCT GGAGGAGCTC GCCACCCAGG GACACCCGCT GCCGCGCAAG ATCCTGGAAT GGCGCCAGCT CTCGAAGCTC AAATCGACCT ACACGGACAG CCTGCAGGAG CATGCCGAAA GGGAGAGCGA CCGCGTCCAC ACCTCCTTCG CGCTCGCCGC CACCACGACC GGGCGCCTCT CCTCCTCGGA CCCGAACCTG CAGAACATTC CGATCCGCAC CGAGGAGGGG CGGCGCATCC GCCAGGCCTT CGTGGCGGAT GCCGGCCACC AGTTGATCTC GGCGGATTAC AGCCAGATCG AGCTGCGGCT GCTCGCCCAC ATGGCCGACA TCCCGCAGCT GCGCCAGGCC TTCGCGGACG GGCTCGACAT CCACGCGGCG ACCGCCTCGG CGATGTTCGG CGTCCCCTTG GGCGAGATGA CGCCGGATCT GCGGCGGCGG GCCAAGACGA TCAATTTCGG CATCATCTAT GGGATCTCGG CCTTCGGCCT CGCCGACCGG CTCGGCATCC CGCAGGGCGA GGCCTCGGCC TTCATCAAGC AGTATTTCGA GCGCTTCCCG GGCATCCGCG CCTATATCGA GGACACCAAG AAGGCCTGCC GGGACAAGGG CTACGTGACC ACCCTGTTCG GGCGCGTCTG CCACTACCCG CAGATCCGCT CCAACAACCC GCAGGAGCGG GCCTCGGTGG AGCGGCAGGC GATCAACGCG CCGATCCAGG GCACGGCCGC CGACATCATC CGCCGCGCGA TGGTGCGGAT GGAGGGGGCG CTGGCCGGGG CGGGGCTCAC CACCCGCATG CTGCTGCAGG TGCACGACGA GCTGGTCTTC GAGGCGCCGC AGGACGAGGT CGCGCGGGCG CTGCCGATCA TCGCCCGGGT GATGGAGGAG GCGCCCCAGC CGGCGGTGCG GCTCACGGTG CCGCTCGCCG TCGAGGCCAA GGCGGCGGCG AACTGGCAGG AGGCGCATTG A
|
Protein sequence | MNADPAPETK PVGPGDQVLL VDGSSFIFRA YFQSINQPER YNFRPSDGMP TGAVRLFCAK IAQFVQEGAA GIQPSHLGIV FDKSEGSFRK EIFPDYKGHR PDAPDDLKRQ MPLMREAVRA FGLEPIELER YEADDLIATY ARQAEARGAG VIIVSSDKDL MQLVGELVRF YDFESGQKGK PGYRPERNLD AAAIVERWEG LSPAQIGDAL ALIGDTSDNV PGVPGIGLKT AAALLKEFGS LEALLERAAE IKQPKRRETL LAHLDQARLS RRLVTLDEAV PVPVPLEALR LRKPDPERLV GFLKAMEFNT LTRRIASLLH VDPEAVRPDP ALLPGGAAAY ANGKGGSDAT PFFGDEATAP PAPETDPFAD LDLPDAPPRP RGPAEPTPGQ VVAARAAEAV KPFDTASYET ITRLDRLDAW IAEAAEAGVV AVDTETNALD AHRADLVGVS LATAPGRAAY IPLAHRGSED LFGEGLLPDQ LPWDAVRARL KPLLEDPAVL KVGQNVKYDW LVLVRHGIEI QPYDDTMLIS YVLDAGKGSH GMDELARRHL GHQPITFADV TGTGRAKVTF DRVPLDKATA YAAEDADVTL RLWRLMKPRL AAEHRATVYE TLERPLVPVL ARMEREGIRV DRDMLSRLSG DFSQSLARLE AEIQEMAGET FSVSSPKQIG DILFGKLGLP GAKKTPSGQW ATPATLLEEL ATQGHPLPRK ILEWRQLSKL KSTYTDSLQE HAERESDRVH TSFALAATTT GRLSSSDPNL QNIPIRTEEG RRIRQAFVAD AGHQLISADY SQIELRLLAH MADIPQLRQA FADGLDIHAA TASAMFGVPL GEMTPDLRRR AKTINFGIIY GISAFGLADR LGIPQGEASA FIKQYFERFP GIRAYIEDTK KACRDKGYVT TLFGRVCHYP QIRSNNPQER ASVERQAINA PIQGTAADII RRAMVRMEGA LAGAGLTTRM LLQVHDELVF EAPQDEVARA LPIIARVMEE APQPAVRLTV PLAVEAKAAA NWQEAH
|
| |