Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0619 |
Symbol | |
ID | 4787462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 575723 |
End bp | 578824 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640093040 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_001023618 |
Protein GI | 124263148 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.130808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCACA TGGGCATCCC CGCCTACGCC GAGCTGCGGT GCCTCTCGGC CTTCAGCTTC CTCCGCGGCG CCAGCATGCC GGAGGAGCTC GTCGAGCGCG CCAAGGAGCT CGGCTACAGC GCGCTGGCCA TCACGGACGA GTGCTCGGTC GCCGGCGTCG TGCGCGCCCA CGTGACTGCC AAGGAGCAAG GTCTCCCTCT CATCATCGGC TCGCAGTTCC AGCTCGAGGA CGCGCCCTTC ACCCTGGTGG TTCTGGCCCA GACGCTGCAG GGCTACGGAA ACCTGTGCGA GTTCATCACG AAGCTTCGGA TGGCCTCGCC GAAGGGCACC TACCGGCTGC AGCTGACCGA TGTCCTGCCG GGCCCGCTCG CCGAGTGCCT GGTGCTCGTT GCCCCGGACC GGACGGCCTT CCCCGAGGAG CTGCTCAAGG TTGCCCGCTG GATGCTCACG CATTTCAGCG GCCGCTGCTG GCTCGCCGTC GAGCTGCTGC GGCTGCTGGA TGACGAGATG TGGCTGCACA AGCTGCGCGA GGTCAGCGAG CTCACGGCCA TCCCGCTCGT CGCCACGGGA GACGTGCACG TGCATGTGCG CTCCCGCAAG GCGCTTCAGG ACGTGATGAC CGCCACGCGC GTCGGCAAGC CGCTCACCGG GTGCGGCGGC GCCCTGCAGC CCAACGCCGA GCGCCATCTG CGCACCCGAC TCAGGCTGGC ACAGACCTAT CCCGAAGAGC TGCTCACCGA GACTTCCCGC GTCGCCAGCC TGTGCAGCTT CTCGCTGGAC GAGCTCAAGT ACCAGTACCC CGCCGAGGTG GTGCCTGTCG GCGAGACCGC GTCCTCCTTC CTGCGCCGGC TCACCTACGA GGGTGCAGGC CGCCGTTGGC CGGAAGGCAT GCAGGCGAAG GTGCAGACCC AGATCGAGCA CGAGCTCAAG CTCATCAGCG AGCTGGGCTA CGAGCACTAC TTCCTGACCG TCGCGGACAT CGTCAACTTC GCCCGCTCGC GCCACATCCT CTGCCAGGGC CGCGGGAGCG CCGCCAACAG CGTGGTCTGC TACTGCCTGG GAGTGACCGA GGTCGACCCG GCACGCATGA GCGTGCTCTT CGAGCGCTTC ATCTCCAAGG AACGCAACGA GCCGCCGGAC ATCGACGTCG ACTTCGAGCA CGAGCGTCGC GAGGAGGTCA TCCATTACCT CTACGGCAAG TACGGCCGGC ACAGGGCGGG GCTCACCGCC ACCGTCATCA GCTACCGCCC GAAGTCCGCC ATCCGCGACG TCGGCAAGGC GCTCGGCTTC GACCTGGAGA CCGTCGACCG GGTCGCCAAG AACCACATGT GGTTCGAGGG CCGCCAGGTG CTGACCGAAC GCATGGCCGA GGTGGGACTC AACCCGGAGG ACCTGGCCGT CAAGCAGCTG CTCGCCCTCA CCGGCCAGCT CATCGGCATG CCGCGGCACC TCTCGCAGCA CGTAGGCGGG TTCGTTCTGA CCAAGGGTCC GCTCAGTCGC ATGGTTCCCA TCGAGAACGC GGCGATGGAA GACCGCACGG TCATCGAGTG GGACAAGGAC GACCTGGACG CGCTGGGCCT CCTGAAGGTG GACGTACTGG CGCTCGGGAT GCTCACCGCC ATCCGCAAGA GCCTGGAGTT CATCGGCCAG CGCAAGGGCT ACCGCTTCGA GATGCAGGAC ATCCCGGCCG AGGACCCCGA GACCTACCGG ATGATTTCCA AGGCGGACAC GGTGGGCGTG TTCCAGATCG AGAGCCGCGC CCAGATGAGC ATGCTGCCGC GCATGAAGCC CGAGTGCTTC TACGACCTGG TCATCGAGGT CGCCATCGTG CGCCCGGGCC CGATCCAGGG CGGCATGGTC CACCCGTACC TCAACCGCCG CCAGGGCAAG GAGCCGGTGG TCTACCCCAG CGAAGCACTC AAGGAGGCGC TGGGCCGCAC GCTCGGCGTG CCGGTCTTCC AAGAGCAGGT GATGCAGGTG GCCATCCTCG CCGCTGGCTT CACGCCAGGC GAGGCGGACC AGCTGCGCCG CTCGATGGCC GCGTGGAAGC GCAAGGGCGG CCTGGACCAG TATTACTCGC GCATCGTCGA CGGCATGACC GCGCGCGGCT ACGAGAAGGC GTTTGCGGAG CAAATCTTCC AGCAGATTCA CGGCTTCTCG GAGTACGGCT TCCCCGAGTC GCACGCCGCC TCGTTCGCCC TGCTGGTCTA CGCCTCCTGC TGGATCAAGT GCCACCACCC GGCCGAGTTC CTGGCTGCCA TGCTCAACAG CCAGCCGTTG GGCTTCTACA CCCCCTCGCA GCTCGTGCAG GACGCCAAGC GCCACGGCGT AGACGTGCGC CCGGTCGACG TCATGTACAG CGACTGGGAC TGCACGCTCG AAGGCCTGCC GCATCCGCCG GCAGTGCGCC TTGGGCTGCG CCAGGTGTCG GGCCTTAAGG CGGAGTCCGT GCAGCGCCTG GTCGCCGCGC GTCAGGAGGC CCCGTTCGAC AACGCCGAGG ATCTGGCTCG ACGGGCAGCA CTGGAGCAGC ACGAGATGAA GCTTCTGGCG GCGGCCGACG CACTCATGAG CCTCTCGGGC CACCGGCGTC AGCAGGTATG GGACGCCGCG GCCCTGCGGT CGACTCCGAA GTTGCTTCGA GACGCGCCGG TCGACGAGGA GTTCCTCGAG CTACCGGAGG CGCCCGAGGG AGAGGAAATC GTCTGGGACT ACGCCTCAAC GGGTCTCACA CTGCGCCGGC ATCCACTCGC CCTTCTTCGC CCCCAGCTCG AGGCTCGTCG ACTCCTTACG GCCGAGCAGC TGGACCGGCT GCCAAACGGG CGGCACGTGG GCACCTGCGG CATCGTGACC CTCCGTCAGC AGCCGGATAC GGCCAACGGC GTCATCTTCG TCTCGCTCGA AGACGAAACC GGCGTGGTCC AGGTGATTTG CTGGAAGAGC ATCCGCGAGC AGCAGAGAGC GGAGCTTCTG AAGTCGCGGC TGCTCGCCGT GTACGGGAAG TGGCAGCGGC AGGGCGATGT GCGCAATCTC GTCGCGGAGC GGCTCGAAGA CTTGACCCCC CTCCTTGGCC GTCTGACGAC CGAGTCAAGG AACTTCCACT GA
|
Protein sequence | MLHMGIPAYA ELRCLSAFSF LRGASMPEEL VERAKELGYS ALAITDECSV AGVVRAHVTA KEQGLPLIIG SQFQLEDAPF TLVVLAQTLQ GYGNLCEFIT KLRMASPKGT YRLQLTDVLP GPLAECLVLV APDRTAFPEE LLKVARWMLT HFSGRCWLAV ELLRLLDDEM WLHKLREVSE LTAIPLVATG DVHVHVRSRK ALQDVMTATR VGKPLTGCGG ALQPNAERHL RTRLRLAQTY PEELLTETSR VASLCSFSLD ELKYQYPAEV VPVGETASSF LRRLTYEGAG RRWPEGMQAK VQTQIEHELK LISELGYEHY FLTVADIVNF ARSRHILCQG RGSAANSVVC YCLGVTEVDP ARMSVLFERF ISKERNEPPD IDVDFEHERR EEVIHYLYGK YGRHRAGLTA TVISYRPKSA IRDVGKALGF DLETVDRVAK NHMWFEGRQV LTERMAEVGL NPEDLAVKQL LALTGQLIGM PRHLSQHVGG FVLTKGPLSR MVPIENAAME DRTVIEWDKD DLDALGLLKV DVLALGMLTA IRKSLEFIGQ RKGYRFEMQD IPAEDPETYR MISKADTVGV FQIESRAQMS MLPRMKPECF YDLVIEVAIV RPGPIQGGMV HPYLNRRQGK EPVVYPSEAL KEALGRTLGV PVFQEQVMQV AILAAGFTPG EADQLRRSMA AWKRKGGLDQ YYSRIVDGMT ARGYEKAFAE QIFQQIHGFS EYGFPESHAA SFALLVYASC WIKCHHPAEF LAAMLNSQPL GFYTPSQLVQ DAKRHGVDVR PVDVMYSDWD CTLEGLPHPP AVRLGLRQVS GLKAESVQRL VAARQEAPFD NAEDLARRAA LEQHEMKLLA AADALMSLSG HRRQQVWDAA ALRSTPKLLR DAPVDEEFLE LPEAPEGEEI VWDYASTGLT LRRHPLALLR PQLEARRLLT AEQLDRLPNG RHVGTCGIVT LRQQPDTANG VIFVSLEDET GVVQVICWKS IREQQRAELL KSRLLAVYGK WQRQGDVRNL VAERLEDLTP LLGRLTTESR NFH
|
| |