Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2694 |
Symbol | |
ID | 7117447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 2832590 |
End bp | 2835733 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643525442 |
Product | DNA polymerase I |
Protein accession | YP_002421461 |
Protein GI | 218530645 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAG AGACGCAACC GCAACCGGCC AAGCCCGTCT GCGCCGATGA CCGGGTGATC CTCGTTGACG GCTCGTCCTT CATCTTCCGA GCGTACTTCC AGTCGATCAA TCAGGACCAG AAGTACAACA CGCGGCCCTC CGACGGCCTG CCGACCGGTG CGGTGCGCCT GTTCTGCACC AAGATCGCGC AGTTCCTTCA GGAGGGTGCG GCGGGTGTGA AGCCGACCCA TCTTGGTATC GTCTTCGACA AATCCGAGGG CTCGTTCCGC AAGGAGCTGT TCCCCGACTA CAAGGGCCAC CGCCCCGACG CCCCGGACGA TCTCAAGCGG CAGATGCCGT TGATGCGCGA CGCCGTGCGC GCCTTCGGGC TGCATGCGGT CGAGCTGGTG CGCTACGAAG CCGACGACCT GATCGCGACC TACGCGCGGC AGGCCGAGGC GCGCGGCGCC GAGGTCATCA TCGTGTCCTC CGACAAGGAC CTGATGCAGC TCGTCTCGGA CAAGATCCGG TTCTACGATT TCGAGTCGGG CGCCAAGGGC AAGCCCGGCT ACCGCCCCGA GCGCAATCTC GACCGCGAGG CTATCATCGC CAAGTGGGAG GGCCTCGCGC CCGAGCAGAT CGGCGACGCC CTGGCACTGA TCGGCGACAC CTCCGACAAC GTGCCGGGTG TGCCCGGCAT CGGCCTGAAG ACGGCGGCCG CGCTCATCAA GGAATACGGG AGCCTGGAGC AGCTCCTGGA GCGGGCCAGC GAGATCAAGC AGCCCAAGCG CCGGGAGATG CTGCTCGCCA ACATCGATCA GGCCAAGCTC TCGCGCCGCC TCGTCGCGCT TGAGGAGAGC GTGCCGGTGC CGGTGCCGCT CGACGAACTC GGCGTGCCGC AGCCCGATCC GCAAAAGCTG GTCGGCTTCC TGAAGGCCAT GGAGTTCAAC ACCCTGACAC GGCGCATCGC GCAGATGCTC CATGTCGATC CCGAAGCGGT GAAGCCCGAT CCATCACTGC TGCCGGGTGC GCAGCCCCAC GCCTACAGCA ATGCGGCCGG CGGCAGCGAC GCCGTGCCGT TCTTCGGCGA CGCCGTGCCG GTCGATCCCG AGACCGCCGC CGCACCCGAG CGGACGGATG GGGAGGCGCC GCCCGAGGCG GGCGAGGCCG ATCCCTTCGC CGATCTCGAC CTGCCGGATC AGGCTCCGAA GAAGAAGCGC GGCTCCAACG AGCCGACGCC CGCAACCCTC GTCGCCGCCC GCGCGGCGGA GTCGGTCAAG CCGTTCGATA CGGCAGCCTA CGAAACCATC GCCACCGTGG CGCAGCTCGA GGCCTGGATC GCCGAGAGCT ACGAGGCCGG GGTGATCGCG GTCGATACCG AGACCGACGC GCTCGACGCG GCCAAGGCGG GACTCGTCGG CGTCTCGCTC GCCACCGCGC CGGGGCGGGC CGCCTATATC CCGCTCGCCC ACGTGAAGCC CGAGGTGAAG GGCGGCGACC TGTTCGGCGA GAGCGGGGCA GGGGCCGATG CGTCCGACAG GCAGCCCGGC CAGATCGATT TCGACACCGC CCTCAAGCTG CTCAAGCCGC TGCTCGAGGA TGCCGGCACG CTCAAGGTCG GCCAGAACCT GAAATACGAC CTCTCGGTGC TGCACCGCTA CGGCATCGAC GTGAGGCCCT TCGACGACAC GATGCTGATC TCCTACGTGC TCGATGCCGG CAAGGGCGGG CACGGCATGG ACGAGCTCGC CCGCCGCCAT CTCGGCCATC AGCCGATCAC CTTCGCCGAC GTCGCCGGCA CCGGCCGCAA CAAGGTCACC TTCGACCGCG TGGCGATCGA CAAGGCGACC GCCTACGCGG CGGAGGATGC CGACGTCACC TTACGCCTGT GGCGGATGAT GAAGCCGCGC CTCGTCGCCG AGCACCGGGT GACGGTCTAC GAGACGCTGG AGCGTCCGCT GGTGCCGGTG CTGGCGCGGA TGGAGCGGGC CGGCATCGCC ATCGACCGCA ACATGCTGAG CCGCCTCTCC GGCGATTTCT CGCAAATCCT GGCGCGACTG GAGGAGGAGA TTCAAGAAGA CGCGGGCGAG CGGTTCCAAG TCTCCTCGCC GAAGCAGATC GGCGACGTGC TGTTCGGCAA GATGGGCCTG CCCGGCGCGA AGAAGACGCC GTCCGGCCAG TGGGCCACGC CCGCGACGCT TCTCGAAGAG CTGGCCCAGG CCGGCCACGA CCTGCCGAAG AAGATTCTCA ACTATCGCCA GCTCTCCAAG CTGAAATCGA CCTACACCGA CTCGCTGCAG CAGCACGCCG ACCGCGGCAC GAACCGGGTC CATACCTCGT TCGCCCTCGC GGCGACGACG ACCGGCCGGC TCTCCTCCTC AGATCCGAAC CTGCAGAACA TCCCGATCCG CACGGAGGAG GGGCGGCGCA TCCGCCGCGC GTTCGTCGCG CCCGAGGGTA AGAAGCTGAT CTCGGCCGAT TACAGCCAGA TCGAGCTGCG CCTGCTCGCC CACATCGCCG ACATCCCGCA ACTGCGCGAA GCGTTCGAGC AGGGGATCGA CATCCACGCG GCCACGGCGT CGGCCATGTT CGGCGTCGCC CTCGATCAGA TGACCGGCGA TCTGCGGCGC CGGGCCAAGA CGATCAATTT CGGCATCATC TACGGCATCT CGGCCTTCGG GCTGGCCGAC CGCCTCGGCA TCGGCCGCGA GGAGGCATCG GCCTTCATCA AGCAGTATTT CGAGCGGTTT CCCGGCATCC GCGACTACAT CGACACCACC AAGCGCTCGT GCCGCGAGAA GGGCTACGTG ACGACCCTGT TCGGCCGTGT CTGCCACTAC CCGCAGATCC GCTCGAACAA CCCGTCCGAA CGGGCGAGCG TCGAGCGGCA AGCCATCAAC GCCCCGATCC AGGGCACCGC CGCCGACATC ATCCGCCGCG CCATGACGCG GATGGAGGAT GCGCTGGAGG CCAAGAAGCT CACCGCGCGG ATGCTGCTGC AGGTGCACGA CGAACTCGTA TTCGAGGTGC CCGACGACGA AGTGGAGGCG ACGATTCCCG TGATCGCTGG GGTGATGGAA GAGGCGCCCG CGCCGGCCCT GACGCTGAGG GTGCCGCTGG TGGTCGAGGC GCGGGCGGCG GGCAACTGGG AAGAGGCGCA CTGA
|
Protein sequence | MTEETQPQPA KPVCADDRVI LVDGSSFIFR AYFQSINQDQ KYNTRPSDGL PTGAVRLFCT KIAQFLQEGA AGVKPTHLGI VFDKSEGSFR KELFPDYKGH RPDAPDDLKR QMPLMRDAVR AFGLHAVELV RYEADDLIAT YARQAEARGA EVIIVSSDKD LMQLVSDKIR FYDFESGAKG KPGYRPERNL DREAIIAKWE GLAPEQIGDA LALIGDTSDN VPGVPGIGLK TAAALIKEYG SLEQLLERAS EIKQPKRREM LLANIDQAKL SRRLVALEES VPVPVPLDEL GVPQPDPQKL VGFLKAMEFN TLTRRIAQML HVDPEAVKPD PSLLPGAQPH AYSNAAGGSD AVPFFGDAVP VDPETAAAPE RTDGEAPPEA GEADPFADLD LPDQAPKKKR GSNEPTPATL VAARAAESVK PFDTAAYETI ATVAQLEAWI AESYEAGVIA VDTETDALDA AKAGLVGVSL ATAPGRAAYI PLAHVKPEVK GGDLFGESGA GADASDRQPG QIDFDTALKL LKPLLEDAGT LKVGQNLKYD LSVLHRYGID VRPFDDTMLI SYVLDAGKGG HGMDELARRH LGHQPITFAD VAGTGRNKVT FDRVAIDKAT AYAAEDADVT LRLWRMMKPR LVAEHRVTVY ETLERPLVPV LARMERAGIA IDRNMLSRLS GDFSQILARL EEEIQEDAGE RFQVSSPKQI GDVLFGKMGL PGAKKTPSGQ WATPATLLEE LAQAGHDLPK KILNYRQLSK LKSTYTDSLQ QHADRGTNRV HTSFALAATT TGRLSSSDPN LQNIPIRTEE GRRIRRAFVA PEGKKLISAD YSQIELRLLA HIADIPQLRE AFEQGIDIHA ATASAMFGVA LDQMTGDLRR RAKTINFGII YGISAFGLAD RLGIGREEAS AFIKQYFERF PGIRDYIDTT KRSCREKGYV TTLFGRVCHY PQIRSNNPSE RASVERQAIN APIQGTAADI IRRAMTRMED ALEAKKLTAR MLLQVHDELV FEVPDDEVEA TIPVIAGVME EAPAPALTLR VPLVVEARAA GNWEEAH
|
| |