Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1532 |
Symbol | |
ID | 4783550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1650089 |
End bp | 1653388 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640090099 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_001020729 |
Protein GI | 124266725 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0291245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.633763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAAC CAGCCCATCC CAAGACGCCG CAGCCGGCGT CGCCTCCGCC GCCGCCGCTG CACTGGACGC GACCCGGGCT GCCGGCCTAC GCGGAGCTGC ACTGCCGCTC GAACTTCAGC TTCCTGACCG GCGCCTCGCA CCCACAGGAG TTGGTGGAGC GCGCCGCCCA GCTCGGCTAT GCGGCGATCG CGATCACCGA CGAGTGCTCG GTGGCCGGCG TGGTGCGGGC GCATGCGGCG CTGAAGCGGC TGCGCGAGAA CGGCAACCCG CCGTTGCCGG CCCTGATCAT CGGTAGCGAG TTCACGCTCG CGGCCTCGCC CACCGCACCG GGCTGCCGCC TGGTGCTGCT GGCGAAGAAC CGCGAAGGCT ATGCCCAGAT CTGTGGTCTC ATCACGCTCG GTCGCGGCCG CAGCGCCAAG GGGCACTACC AACTGAGCGT CGACGACCTC GATGCGCTGC TGGCCGGCCA GGGCCATGCC GCCGGCGTGC CCGACTGCCT GGCGCTGCTG ATCCCACGCC GCGACGAGCC TCACGCCACC CTGCTCGCGC AGGCCCGTTG GCTGGCGGGC CGCTGCGGCG AGCGCGCCTC GATCGCGGTC GAGCTGCTGC GCTGGGCCGA CGACGAGTCC TTCGTGGAGC GCCTGCTCGA CGCATCGGCC GCCTCCGGCC TGCCACTGGC GGCGGCGGGC GACGTGCTGA TGCACGTGCG CTCGCGCAAG CCGCTGCAGG ACACGCTGAC CGCGATCCGC CTGAAGCGCC GCGTCGCCGA CTGCGGCTAC GCGCTGATGC GCAACGCCGA GCAGCACCTG CGCTCGCGGC TGGCGCTGGC GCAGCTCTGC CGTCCCGAGT GGCTGCAGCG CAGCGTGGAC CTCGCGCGGG CCTGCGACTT CTCGCTGGAG CAGCTGAAGT ACGAGTATCC GGACGAGATC GTGCCGGCCG GCGAGACGGC CAGCAGCCAC CTGCGGCGCC TGACCGAGGA AGGGGCCTGC GTGCGCTATC CCGGCGGCGT GCCTGGCATG GTGCAGGCGC AGATCGAACA CGAGTTCGCG CTGATCTTCC GCAAGCGCTA CGAGGCCTTC TTCCTCACCG TTCACGACGT GGTGCGCTTC GCCCGCTCCG AGGGCATCCT GTGCCAGGGT CGCGGCTCGG CCGCCAATTC GGCGGTCTGC TACTGCCTGG GCATCACCGA GGTCGACCCG ACGCAGACCC CACTGCTGTT CGAGCGCTTC ATCAGCGAGG AGCGCGGTGA GGCGCCCGAC ATCGACGTCG ACTTCGAACA CGAGCGGCGC GAGGAAGTCA TCCAGTACAT CTACGGCAAG TACGGCACGC ACCGCACCGC ACTGGCCGCC GCACTGGCGA CCTACCGCGT GCGCGGTGCG GTGCGCGAGG TCGGCAAGGC GTTGGGGCTG GACGGGCAGC GCATCGACCG CCTCGCGAAG AGCCACCAGT GGTTCGACGG GCATGAGGCT CTGCCCGCCC GGCTGGTCGA GGCCGGCTTC GATCCGGACG CCCCGGTCAC CCGGCAGTGG CTGGCACTGA GCGCCGCGCT GATCGGCTTT CCGCGGCACC TGTCGCAGCA CGTCGGCGGA TTCGTCATCG CGCGGGACGC GCTGTCGCGC CTGGTGCCGA TCGAGAACGC CGCGATGAAG GACCGCCGCG TGATCCAGTG GGACAAGGAC GATCTTGAGT CGCTGGGGCT GCTGAAGGTC GACGTGCTGG CGCTGGGCAT GCTGAGTGCG CTGAAGCGCG CGCTCGCGCT GATCACCGAC TGGCGCGGCC CGCCGCCCGG CGCCAACGGC GGCGCACCCA TATCCCCGTC ATGGCGGATG CAGGACATCC CGCGCGATGA CATCGCCACC TACGAAATGA TCCAGGAAGC TGACACCGTG GGCGTGTTCC AGATCGAGAG CCGGGCCCAG CAGAGCATGC TGCCTCGCTT GAAGCCAAGT GAGTTCTACG ACCTCGTGGT GCAGGTCGCG ATCGTGCGGC CGGGCCCGAT CAGCGGCGGC ATGGTGCATC CCTACCTGAA GCAGCGTGAT CTGCAGCGCC GGGGCCTGCG CCCGCCACCG GCCTACCCAG CGCTCGAGCA GGCGCTCGAG CGCACGCTGG GCGTTCCGAT CTTCCAGGAG CAGGTGATGC AGATCTGCAT GATCGCGGCC GATTTCAGCG GCGGCGATGC CGACGAACTG CGCCGCGCGA TGGCGGCCTG GAAACGCAAG GGCGGCTTGC AGCAATTCCA CCAGCGCATC GTCGGTCGCA TGGTGGAAAA GAACTACGAC CGCGAGTTCG CCGAGCGCAT CTTCCAGCAG ATCAAGGGCT TCGGCGAGTA CGGCTTCCCG GAGAGCCACG CCTACAGCTT CGCGCTGCTG GCCTACCTGA GCGCCTGGCT CAAGTGCCAC GAGCCGGCGA TCTTCCTGGC CGCGCTGCTG AACTCGCAAC CGATGGGCTT CTACGCACCG TCGCAGCTGG TCCAGGACGC ACGGCGCCAC GCGGTCGAAG TGCGCCCGGT CGACGTGGGC CACAGCGGCT GGGACTGCAC GCTCGAGCCG GCGTCACCGG GCAGCACGCC CTGCGCGGCA CGCTCGCTGC AGCCCGCCGT GCGCCTGGGG CTGCGCCTCG TCTCCGGCTT CAGCGAGGCG GCCGCCGACC GCATCGTGGC TGCCCGCGCT GTGTCGACAT TCGCCAGCCT CGACGACCTG GCGCGGCGCG CCGGGCTCGA TCCCCAGGCC CTGCAGGTGC TGGCCCGCGC CGACGCACTG CAGAGCCTGG CCGGCCACCG CCGGCAACAG GCGTGGCATG CGAGTGGCCA GCCCCGCCGC ACCGCCCTGC TCGCCGGTGC GACGCATCGC GAAGACCCGC TCGTCCTGCC CGCGCCGCCC GAGGCCGAGG AGATCACGCT CGACTACGCC AGCACCCAGC TCACGCTGCG CCGCCATCCG CTGGCGCTGC TGCGTCCGCG ACTCGTACAG CGTGGCTGGC GCAACGCGCG GGAGCTCGGT GCCCTGAAGG ATGGCTGCCG CGTCTGGGCC TGCGGCATCG TCGTCGGCCG CCAGCAGCCC GAGACCGCCA AGGGCACGAT CTTCGTCACG CTGGAAGACG AGACCGGCTC GGTCAACGTG ATCGTCTGGA AGGGCGTGCG CGAGCGCTTC CGCCAGGCGC TGCTCGCCTC ACGCCTGCTG GCCGTGGCCG GGGAATGGCA GCGCTCGCCG GAAGGGGTGA CGCACCTGCT GGCGCGGCGG CTGGTCGACA TGACGCCCTG GCTGGGCCGG CTGGCGACGA GTTCGCGCGA TTTCCACTGA
|
Protein sequence | MPQPAHPKTP QPASPPPPPL HWTRPGLPAY AELHCRSNFS FLTGASHPQE LVERAAQLGY AAIAITDECS VAGVVRAHAA LKRLRENGNP PLPALIIGSE FTLAASPTAP GCRLVLLAKN REGYAQICGL ITLGRGRSAK GHYQLSVDDL DALLAGQGHA AGVPDCLALL IPRRDEPHAT LLAQARWLAG RCGERASIAV ELLRWADDES FVERLLDASA ASGLPLAAAG DVLMHVRSRK PLQDTLTAIR LKRRVADCGY ALMRNAEQHL RSRLALAQLC RPEWLQRSVD LARACDFSLE QLKYEYPDEI VPAGETASSH LRRLTEEGAC VRYPGGVPGM VQAQIEHEFA LIFRKRYEAF FLTVHDVVRF ARSEGILCQG RGSAANSAVC YCLGITEVDP TQTPLLFERF ISEERGEAPD IDVDFEHERR EEVIQYIYGK YGTHRTALAA ALATYRVRGA VREVGKALGL DGQRIDRLAK SHQWFDGHEA LPARLVEAGF DPDAPVTRQW LALSAALIGF PRHLSQHVGG FVIARDALSR LVPIENAAMK DRRVIQWDKD DLESLGLLKV DVLALGMLSA LKRALALITD WRGPPPGANG GAPISPSWRM QDIPRDDIAT YEMIQEADTV GVFQIESRAQ QSMLPRLKPS EFYDLVVQVA IVRPGPISGG MVHPYLKQRD LQRRGLRPPP AYPALEQALE RTLGVPIFQE QVMQICMIAA DFSGGDADEL RRAMAAWKRK GGLQQFHQRI VGRMVEKNYD REFAERIFQQ IKGFGEYGFP ESHAYSFALL AYLSAWLKCH EPAIFLAALL NSQPMGFYAP SQLVQDARRH AVEVRPVDVG HSGWDCTLEP ASPGSTPCAA RSLQPAVRLG LRLVSGFSEA AADRIVAARA VSTFASLDDL ARRAGLDPQA LQVLARADAL QSLAGHRRQQ AWHASGQPRR TALLAGATHR EDPLVLPAPP EAEEITLDYA STQLTLRRHP LALLRPRLVQ RGWRNARELG ALKDGCRVWA CGIVVGRQQP ETAKGTIFVT LEDETGSVNV IVWKGVRERF RQALLASRLL AVAGEWQRSP EGVTHLLARR LVDMTPWLGR LATSSRDFH
|
| |