Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2573 |
Symbol | polI |
ID | 3103895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 2754083 |
End bp | 2756803 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637171710 |
Product | DNA polymerase I |
Protein accession | YP_114980 |
Protein GI | 53803295 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTGT CCGACACCTC TCCGCTCCTC GTCCTCGTCG ACGGATCCTC TTTTCTGTAC CGCGCCTTCT TTGCCCTGCC GCCTCTGACC AACTCCCACG GGGAGCCGAC CGGGGCCGTC TATGGCGTCA TCAACATGTT GCGCAAATTG TTGCAGACCT ACGACGGCGC CCACATCGCC GTGGTGTTCG ACGCGCCGGG GCGGAATTTC CGCGACGAGC TCTTCGAGCA TTACAAAGCG CACCGCCCGC CCATGCCCGA CGATCTGAGG AGCCAGATCG AGCCGCTGCA CCAGGTGGTG CGGGCGATGG GGCTGCCGAT GCTGATCGAG CCGGGGGTGG AGGCGGACGA TGTCATCGGC ACCCTGGCGA GGCAGGCGGT GGAACGGGGC TTCCGGGTGG TGATTTCGAC CGGCGACAAG GACATGGCGC AACTGGTGTG TGACCGGGTG ACTCTGGAAA ACACCATGTT CGACAGCCGG CTCGACGTCA ACGGCGTGAT CGTCAAGTTC GGCGTCCCGC CCGAGCGCAT CGTCGACTAT CTCGCCCTGG TCGGAGACAC CTCGGACAAC ATCCCCGGCG TGCCAAAGGT TGGCCCCAAA ACCGCCGCGA AATGGCTGGC CGAATACGGT TCGCTCGATG CGCTGATCGC CCGCGCCGGA GAAATCGGCG GCAAGACCGG TGAGAACCTC CGCGCCAGTC TGGAGCTGAT CCCGCTCTCG CGCGAACTGG CCACCATCCG TTGCGACCTC CCGCTGCCGC TGACGCCGGA ATCCCTGGAG CGGCGGCCGC CGGACAAGGC CGCTTTGCGC GAGCTGTATA CTCGGCTGGA ATTCAAGACC TTGCTCCGGC AACTGGACGC CGAAGCGGAC GACCCGAAGG CCGCTCCTCC CGTCGCCGCG GCGCCGGAAG TCCAAACACG CTACCAGGCG GTCATGGACC CGGCCGCATT CGACCGCTGG CTGGAGAAAC TCGAAGCGGC CGAGCTCTTC GCCTTCGATA CCGAGACCAG CAGTCTGGAC TACATGCGCG CCGAGGTCAT CGGCCTGTCC TTCGCGGTCG AGCCGGGCGC GGCGGCCTAT GTGCCGCTGG CGCACGACTA CCCCGGTGCA CCGCCCCAGC TCGACCGGGC CATGGTGCTG GAGCGCCTGC GGCCGCTGTT GGAGGACCCG GGCAAGGCCA AACTGGGCCA ACACCTCAAG TACGATGCCA ATGTGTTGCT CAACCATGGC ATCATGCTGC GCGGCATCCG CCACGACACC ATGCTCGAGT CCTACGTGCT GAACAGCACG GCCACCCGGC ATGACATGGA CTCGCTGGCC GAGCGCTACC TGGACCGCAA GACCCTCCAT TACGAGGATG TCGCTGGCAA GGGTGCGAAG CAGATCCCTT TTGCCCGGGT TTCGGTCGAG GACGCTTGCC GCTATGCCGC CGAGGATGCC GACGTCACGC TGTGCCTGCA TCGGGCGCTG TGGCCCCGAC TCGAGAGCAT ACCTTCACTG CGGGCTGTGT ACGAGACCAT CGAGATCCCG CTGGTGCCGG TGCTGTCCCG CATCGAACGT GCCGGCGTGC TGGTGGACGT GTACAAGCTG GCGGAACAGA GCCGCGAGCT GGAACGGCGC ATGGCGGAGG TCGAGGCCGA GGCCCGGGCC GTGGCGGGTG AGACCTTCAA TCTCGGCTCA CCCAAGCAGA TCCAGACCAT CCTGTACGAC AAGCTTGGGC TGCCGGTCAT CAAAAAGACG CCGACCGGCC AGCCGTCCAC CGACGAGTCC GTGCTTCAGG ATCTGGCCGA GACTTTCGAG CTGCCGCGGC TGATCCTGGA ATACCGCTCG CTGTCCAAGC TCAAGTCCAC TTACACCGAC AAGCTGCCGC ACCAGGTGAA TCCCGTCAGC GGCCGGGTCC ACACATCCTA TCACCAGGCG GTGGCGGCGA CCGGGCGGCT GTCTTCGTCC GACCCCAATC TGCAGAACAT CCCGGTGCGG ACCGAGGAAG GCCGCCGCAT CCGCCAGGCC TTCGTCGCGC CGCCCGGCCA CAAGCTGCTG GCGGCGGACT ACTCGCAGAT CGAACTGCGG ATCATGGCGC ATTTGTCGGG CGACGCGAAC CTGCTCGCCG CCTTCGCCGA GGACGCCGAC GTCCACCGCG CCACCGCCGC CGAGGTGTTC GGTGTGGCGC TGGAAGAGGT GACGAGCAGC CAGCGCCGTT CCGCCAAGGC CATCAACTTC GGGTTGATCT ATGGCATGTC CGCCTTCGGA CTGGCCAAGC AGCTCGGTAT CCAGCAGAAG CTGGCGCAAG GTTACATCGA CCTCTACTTC GCCCGCTATC CCGGCGTGCG CGCCTACATG GACCGTACCC GCGAATCGGC GCGCGAGTTG GGGTATGTCG AAACCCTGTT CGGCCGCCGC CTGCACGTGC CGGACATCCA GTCCCGTAAC GGCCAGCGCC GCCAGTACGC CGAGCGCACG GCGATCAACG CGCCCATGCA GGGAACGGCC GCCGACATCA TCAAGCGCGC GATGATCGCC CTGGACGCCT GGATCGAATC CAGCGGGGCG CCGTTGCGGA TGATCATGCA GGTGCATGAC GAGCTGGTGT TCGAGGTGGC CGAGGATTTC GTGTCCGAAG CCACCGCCAC CGTGCGTGAG CACATGAGCC GTGCCGCCGA ACTCGCGGTG CCGCTGATGG TCGACATCGG CACCGGCGAC AACTGGGATG AGGCTCACTG A
|
Protein sequence | MPVSDTSPLL VLVDGSSFLY RAFFALPPLT NSHGEPTGAV YGVINMLRKL LQTYDGAHIA VVFDAPGRNF RDELFEHYKA HRPPMPDDLR SQIEPLHQVV RAMGLPMLIE PGVEADDVIG TLARQAVERG FRVVISTGDK DMAQLVCDRV TLENTMFDSR LDVNGVIVKF GVPPERIVDY LALVGDTSDN IPGVPKVGPK TAAKWLAEYG SLDALIARAG EIGGKTGENL RASLELIPLS RELATIRCDL PLPLTPESLE RRPPDKAALR ELYTRLEFKT LLRQLDAEAD DPKAAPPVAA APEVQTRYQA VMDPAAFDRW LEKLEAAELF AFDTETSSLD YMRAEVIGLS FAVEPGAAAY VPLAHDYPGA PPQLDRAMVL ERLRPLLEDP GKAKLGQHLK YDANVLLNHG IMLRGIRHDT MLESYVLNST ATRHDMDSLA ERYLDRKTLH YEDVAGKGAK QIPFARVSVE DACRYAAEDA DVTLCLHRAL WPRLESIPSL RAVYETIEIP LVPVLSRIER AGVLVDVYKL AEQSRELERR MAEVEAEARA VAGETFNLGS PKQIQTILYD KLGLPVIKKT PTGQPSTDES VLQDLAETFE LPRLILEYRS LSKLKSTYTD KLPHQVNPVS GRVHTSYHQA VAATGRLSSS DPNLQNIPVR TEEGRRIRQA FVAPPGHKLL AADYSQIELR IMAHLSGDAN LLAAFAEDAD VHRATAAEVF GVALEEVTSS QRRSAKAINF GLIYGMSAFG LAKQLGIQQK LAQGYIDLYF ARYPGVRAYM DRTRESAREL GYVETLFGRR LHVPDIQSRN GQRRQYAERT AINAPMQGTA ADIIKRAMIA LDAWIESSGA PLRMIMQVHD ELVFEVAEDF VSEATATVRE HMSRAAELAV PLMVDIGTGD NWDEAH
|
| |