Gene Mmar10_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0016 
Symbol 
ID4283970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp17462 
End bp20248 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content62% 
IMG OID638139476 
ProductDNA polymerase I 
Protein accessionYP_755250 
Protein GI114568570 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGA CCAAACCCGT CGACGAGACC AGCCACGTCT ATCTGATCGA CGGGTCGGGC 
TATATCTTCC GGGCCTATCA CGCACTGCCA CCTTTGACCC GGACCGACGG GACGCCAACC
GGGGCGGTGC AGGGCTTTTG CAACATGTTG TGGAAGCTGC TGGAAGACCT GAAGGGTGAC
GATCAGCCGT CGCACCTGGC GGTGATCTTC GACCATTCCG GCAAGACCTT CCGCAATGAT
CTCTATGACC TCTACAAGGC CAACCGGCCC CCGGCGCCGG AAGATCTGAT CCCGCAATTC
TCCATCATCC GCGATGCGAC CCGCGCCTTC GGCACGCCCT GTGTCGAGCT GGAGAATTAC
GAGGCCGATG ACATCATCGC CACCTATGCC CGCCAGGCCG AGGCGCTGGG TGCGGATGTC
ACCATCGTCT CGTCGGACAA GGATCTGATG CAATTGGTGA CCGACAAGGT CTCCATGTTC
GACGCCATGA AGAACAAGCG CATCCAGGTC CCTGAAGTGA TGGAAAAGTT CGGCGTCGGG
CCGGACAAGG TCATCGATAT CCAGTCCCTG GCCGGCGACA GCGTCGACAA TGTGCCCGGC
GTGCCCGGTA TCGGCGTGAA GACGGCGGCG CTGTTGATCA ATGAATACGG CGATCTCGAC
ACGCTGCTCG AGCGGGCCGG TGAGATCAAG CAGAAGGGCC GGCGCGAAAA ACTGCTCGCC
CATGCCGAGG ATGCCCGCAT CTCGCGCGAC CTGGTGACCC TGAAACTCGA TGCGCCCATG
CCGGAAAGGC TGGAGGAATT CGGTCTTGCC GAGCCCGATC CAGACGTCCT CGTCCCCTTC
CTGCGGGAGA TGGAATTCCG CTCCTTCACC CGCAAGGTCG AAGAAGCTTT GGGCGGACCG
CGGGCCGATG AGACCGGCGA TGCCACGGCG CCCATCAATC GCGACGACTA TGAGTGCGTA
ACGACAATGG AGGCGCTCGA GCGCTGGATC GCCAAGAGCT TCGAAGCCGG CCAGATCGCC
GTTGATACCG AGACCGATGC CCTGTCCTCG ACCGCGTCCG GCCTGGTCGG CATTTCGCTG
GCCACAGCGC CGGGCAGGGC CTGCTATATC CCGCTCGCCC ATGTCGACCC GCAGGGCACG
GGCGACATGT TCGACACCGG CGCCGCGCCG GAACAGATCC CGATGGATCA GGCGCTGAAG
GTACTGAAAC CCCTGCTGGA AGACCCGGCC GTGCTGAAGA TCGGCCAGAA TTTCAAATAT
GATCTCGGCG TGTTGTCGCG CTATGGCATT GATGTCGCGC CCTATGACGA CACCATGCTG
ATCTCCTATG TCATGGAGGC CGGCCTGCAC GGGCATGGCA TGGACGCGCT GGCCGAACTT
CATCTGGGCC ATACCTGCAT CCCCTTCAAG GAGATCTGCG GCACCGGCAA GAACCAGATC
ACCTTCGACA AGGTGCCGCT GGACAAGGCG ACGCTCTATG CCGCCGAAGA TGCCGACATC
ACGCTGCGGC TGTGGGAAAT CCTGAAACCG GCCCTGGTCG CCAAGAAAAT GGCGACGGTC
TATGAGACGC TGGAACGGCC GATGGCCGAT GTGCTGTCGA AAATGGAGCG GGTCGGCATC
AAGGTCGATC CGGACCAGCT AAATCGCCTG TCCTCCGATT TCGGCCAGAA GATGATGGCC
GCCGAGGCCG AGGCCCATGA GGCCGCAGGC CGCGACTTCA ACGTTGCGTC ACCGAAACAA
ATCGGGGAAA TCCTGTTTGG AGAGATGGGG TTACCCGGTG GCAAGAAGAC CAAGACCGGG
GCTTGGTCGA CCGATGCCGC CGTGCTCGAC CAGCTCGCGG CCGAGGGCCA TGCCCTGCCG
GTCGCGCTGC TGGAATACCG CCAGTTTGCC AAGCTGAAGT CGACTTATTC CGACAGCCTC
TTCGCCCATA TCAATCGCGA CACGAAGCGC GTCCACACCT CCTTCTCCTT GGCCGCGACC
ACGACCGGGC GCCTGTCCTC GACCGAGCCC AATCTGCAGA ACATCCCGAT CCGCACCGAG
GCTGGCCGCC AGATCCGCGA AGTCTTTATC GCCGAACCGG GCCATGTCCT GGTCGCCGCC
GATTATTCCC AGGTCGAGCT GCGCCTTCTC GCCCATATCG CCAACGTTGA AAGCCTCAAA
CAGGCCTTCC GGGACGGCAC CGACATCCAT GCGATGACCG CCTCGGAAGT GTTTGGCGTG
CCGATCGAGG GCATGGATCC GATGGTCCGG CGCAAGGCCA AGGCGATCAA TTTCGGCGTC
ATCTACGGCA TTTCCGCCTT CGGCCTGGCC AACCAGATCG GGGTCAAGCG CGACGAGGCC
AAGGCCTTCA TCGACGCCTA TTTCGAGAAA TTCCCCGGCA TCCGCGCCTA TATGGATGAG
ATGAAGGCCA AGGCCGCCGA GACCGGCTAT GTCGAGACCA TTTTCGGCCG CCGCGCCCAT
TTCCCGGGCA TTCGCGACAA AAACCCCAAT ATGCGCATGT TCGCCGAACG CCAGGCCATC
AACGCCCCGA TCCAGGGCTC AGCCGCCGAC GTCATCCGCC GGGCCATGAT CCGCATGGAT
GACGCGCTGA ACGCCGCCAA TCTCGATGCG AAAATGCTGC TCCAGGTGCA TGATGAACTG
GTGTTTGAAG TGCCGGAAAA CCAGGCCGCC GATCTGATTG CGCTGACAGC AAAGGTGATG
GGTGAGGCCT GCTCGCCCGC GCTGGAGCTG AGCGTGCCGC TGGTGGTGGA CGCGAAGGCG
GGACGGACCT GGGGTGAAGC TCATTGA
 
Protein sequence
MATTKPVDET SHVYLIDGSG YIFRAYHALP PLTRTDGTPT GAVQGFCNML WKLLEDLKGD 
DQPSHLAVIF DHSGKTFRND LYDLYKANRP PAPEDLIPQF SIIRDATRAF GTPCVELENY
EADDIIATYA RQAEALGADV TIVSSDKDLM QLVTDKVSMF DAMKNKRIQV PEVMEKFGVG
PDKVIDIQSL AGDSVDNVPG VPGIGVKTAA LLINEYGDLD TLLERAGEIK QKGRREKLLA
HAEDARISRD LVTLKLDAPM PERLEEFGLA EPDPDVLVPF LREMEFRSFT RKVEEALGGP
RADETGDATA PINRDDYECV TTMEALERWI AKSFEAGQIA VDTETDALSS TASGLVGISL
ATAPGRACYI PLAHVDPQGT GDMFDTGAAP EQIPMDQALK VLKPLLEDPA VLKIGQNFKY
DLGVLSRYGI DVAPYDDTML ISYVMEAGLH GHGMDALAEL HLGHTCIPFK EICGTGKNQI
TFDKVPLDKA TLYAAEDADI TLRLWEILKP ALVAKKMATV YETLERPMAD VLSKMERVGI
KVDPDQLNRL SSDFGQKMMA AEAEAHEAAG RDFNVASPKQ IGEILFGEMG LPGGKKTKTG
AWSTDAAVLD QLAAEGHALP VALLEYRQFA KLKSTYSDSL FAHINRDTKR VHTSFSLAAT
TTGRLSSTEP NLQNIPIRTE AGRQIREVFI AEPGHVLVAA DYSQVELRLL AHIANVESLK
QAFRDGTDIH AMTASEVFGV PIEGMDPMVR RKAKAINFGV IYGISAFGLA NQIGVKRDEA
KAFIDAYFEK FPGIRAYMDE MKAKAAETGY VETIFGRRAH FPGIRDKNPN MRMFAERQAI
NAPIQGSAAD VIRRAMIRMD DALNAANLDA KMLLQVHDEL VFEVPENQAA DLIALTAKVM
GEACSPALEL SVPLVVDAKA GRTWGEAH