Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0016 |
Symbol | |
ID | 4283970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 17462 |
End bp | 20248 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638139476 |
Product | DNA polymerase I |
Protein accession | YP_755250 |
Protein GI | 114568570 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGA CCAAACCCGT CGACGAGACC AGCCACGTCT ATCTGATCGA CGGGTCGGGC TATATCTTCC GGGCCTATCA CGCACTGCCA CCTTTGACCC GGACCGACGG GACGCCAACC GGGGCGGTGC AGGGCTTTTG CAACATGTTG TGGAAGCTGC TGGAAGACCT GAAGGGTGAC GATCAGCCGT CGCACCTGGC GGTGATCTTC GACCATTCCG GCAAGACCTT CCGCAATGAT CTCTATGACC TCTACAAGGC CAACCGGCCC CCGGCGCCGG AAGATCTGAT CCCGCAATTC TCCATCATCC GCGATGCGAC CCGCGCCTTC GGCACGCCCT GTGTCGAGCT GGAGAATTAC GAGGCCGATG ACATCATCGC CACCTATGCC CGCCAGGCCG AGGCGCTGGG TGCGGATGTC ACCATCGTCT CGTCGGACAA GGATCTGATG CAATTGGTGA CCGACAAGGT CTCCATGTTC GACGCCATGA AGAACAAGCG CATCCAGGTC CCTGAAGTGA TGGAAAAGTT CGGCGTCGGG CCGGACAAGG TCATCGATAT CCAGTCCCTG GCCGGCGACA GCGTCGACAA TGTGCCCGGC GTGCCCGGTA TCGGCGTGAA GACGGCGGCG CTGTTGATCA ATGAATACGG CGATCTCGAC ACGCTGCTCG AGCGGGCCGG TGAGATCAAG CAGAAGGGCC GGCGCGAAAA ACTGCTCGCC CATGCCGAGG ATGCCCGCAT CTCGCGCGAC CTGGTGACCC TGAAACTCGA TGCGCCCATG CCGGAAAGGC TGGAGGAATT CGGTCTTGCC GAGCCCGATC CAGACGTCCT CGTCCCCTTC CTGCGGGAGA TGGAATTCCG CTCCTTCACC CGCAAGGTCG AAGAAGCTTT GGGCGGACCG CGGGCCGATG AGACCGGCGA TGCCACGGCG CCCATCAATC GCGACGACTA TGAGTGCGTA ACGACAATGG AGGCGCTCGA GCGCTGGATC GCCAAGAGCT TCGAAGCCGG CCAGATCGCC GTTGATACCG AGACCGATGC CCTGTCCTCG ACCGCGTCCG GCCTGGTCGG CATTTCGCTG GCCACAGCGC CGGGCAGGGC CTGCTATATC CCGCTCGCCC ATGTCGACCC GCAGGGCACG GGCGACATGT TCGACACCGG CGCCGCGCCG GAACAGATCC CGATGGATCA GGCGCTGAAG GTACTGAAAC CCCTGCTGGA AGACCCGGCC GTGCTGAAGA TCGGCCAGAA TTTCAAATAT GATCTCGGCG TGTTGTCGCG CTATGGCATT GATGTCGCGC CCTATGACGA CACCATGCTG ATCTCCTATG TCATGGAGGC CGGCCTGCAC GGGCATGGCA TGGACGCGCT GGCCGAACTT CATCTGGGCC ATACCTGCAT CCCCTTCAAG GAGATCTGCG GCACCGGCAA GAACCAGATC ACCTTCGACA AGGTGCCGCT GGACAAGGCG ACGCTCTATG CCGCCGAAGA TGCCGACATC ACGCTGCGGC TGTGGGAAAT CCTGAAACCG GCCCTGGTCG CCAAGAAAAT GGCGACGGTC TATGAGACGC TGGAACGGCC GATGGCCGAT GTGCTGTCGA AAATGGAGCG GGTCGGCATC AAGGTCGATC CGGACCAGCT AAATCGCCTG TCCTCCGATT TCGGCCAGAA GATGATGGCC GCCGAGGCCG AGGCCCATGA GGCCGCAGGC CGCGACTTCA ACGTTGCGTC ACCGAAACAA ATCGGGGAAA TCCTGTTTGG AGAGATGGGG TTACCCGGTG GCAAGAAGAC CAAGACCGGG GCTTGGTCGA CCGATGCCGC CGTGCTCGAC CAGCTCGCGG CCGAGGGCCA TGCCCTGCCG GTCGCGCTGC TGGAATACCG CCAGTTTGCC AAGCTGAAGT CGACTTATTC CGACAGCCTC TTCGCCCATA TCAATCGCGA CACGAAGCGC GTCCACACCT CCTTCTCCTT GGCCGCGACC ACGACCGGGC GCCTGTCCTC GACCGAGCCC AATCTGCAGA ACATCCCGAT CCGCACCGAG GCTGGCCGCC AGATCCGCGA AGTCTTTATC GCCGAACCGG GCCATGTCCT GGTCGCCGCC GATTATTCCC AGGTCGAGCT GCGCCTTCTC GCCCATATCG CCAACGTTGA AAGCCTCAAA CAGGCCTTCC GGGACGGCAC CGACATCCAT GCGATGACCG CCTCGGAAGT GTTTGGCGTG CCGATCGAGG GCATGGATCC GATGGTCCGG CGCAAGGCCA AGGCGATCAA TTTCGGCGTC ATCTACGGCA TTTCCGCCTT CGGCCTGGCC AACCAGATCG GGGTCAAGCG CGACGAGGCC AAGGCCTTCA TCGACGCCTA TTTCGAGAAA TTCCCCGGCA TCCGCGCCTA TATGGATGAG ATGAAGGCCA AGGCCGCCGA GACCGGCTAT GTCGAGACCA TTTTCGGCCG CCGCGCCCAT TTCCCGGGCA TTCGCGACAA AAACCCCAAT ATGCGCATGT TCGCCGAACG CCAGGCCATC AACGCCCCGA TCCAGGGCTC AGCCGCCGAC GTCATCCGCC GGGCCATGAT CCGCATGGAT GACGCGCTGA ACGCCGCCAA TCTCGATGCG AAAATGCTGC TCCAGGTGCA TGATGAACTG GTGTTTGAAG TGCCGGAAAA CCAGGCCGCC GATCTGATTG CGCTGACAGC AAAGGTGATG GGTGAGGCCT GCTCGCCCGC GCTGGAGCTG AGCGTGCCGC TGGTGGTGGA CGCGAAGGCG GGACGGACCT GGGGTGAAGC TCATTGA
|
Protein sequence | MATTKPVDET SHVYLIDGSG YIFRAYHALP PLTRTDGTPT GAVQGFCNML WKLLEDLKGD DQPSHLAVIF DHSGKTFRND LYDLYKANRP PAPEDLIPQF SIIRDATRAF GTPCVELENY EADDIIATYA RQAEALGADV TIVSSDKDLM QLVTDKVSMF DAMKNKRIQV PEVMEKFGVG PDKVIDIQSL AGDSVDNVPG VPGIGVKTAA LLINEYGDLD TLLERAGEIK QKGRREKLLA HAEDARISRD LVTLKLDAPM PERLEEFGLA EPDPDVLVPF LREMEFRSFT RKVEEALGGP RADETGDATA PINRDDYECV TTMEALERWI AKSFEAGQIA VDTETDALSS TASGLVGISL ATAPGRACYI PLAHVDPQGT GDMFDTGAAP EQIPMDQALK VLKPLLEDPA VLKIGQNFKY DLGVLSRYGI DVAPYDDTML ISYVMEAGLH GHGMDALAEL HLGHTCIPFK EICGTGKNQI TFDKVPLDKA TLYAAEDADI TLRLWEILKP ALVAKKMATV YETLERPMAD VLSKMERVGI KVDPDQLNRL SSDFGQKMMA AEAEAHEAAG RDFNVASPKQ IGEILFGEMG LPGGKKTKTG AWSTDAAVLD QLAAEGHALP VALLEYRQFA KLKSTYSDSL FAHINRDTKR VHTSFSLAAT TTGRLSSTEP NLQNIPIRTE AGRQIREVFI AEPGHVLVAA DYSQVELRLL AHIANVESLK QAFRDGTDIH AMTASEVFGV PIEGMDPMVR RKAKAINFGV IYGISAFGLA NQIGVKRDEA KAFIDAYFEK FPGIRAYMDE MKAKAAETGY VETIFGRRAH FPGIRDKNPN MRMFAERQAI NAPIQGSAAD VIRRAMIRMD DALNAANLDA KMLLQVHDEL VFEVPENQAA DLIALTAKVM GEACSPALEL SVPLVVDAKA GRTWGEAH
|
| |