Gene Mmar10_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1831 
Symbol 
ID4286383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1989517 
End bp1992798 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content65% 
IMG OID638141324 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_757061 
Protein GI114570381 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.715135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CCTATGCCGA GCTTGTCGCC GCCAGCAATT TTTCCTTCCT GCGCGGCGCA 
TCCCATCCCG ATGAAATGGT GCTGGCTGCC AAGGCGCTGG GTCTTGCAGC CATCGGTCTT
TGCGACCGCA ACTCCCTTGC CGGCGTGGTC CGGGCGCACA AGGCAGCAAA GAAAGCCGGC
ATGACGCTGT GCATCGGCAC ACGCCTGGTC ACACGCTGCG GGTTCGAACT CGCGACCTAT
CCCCGCGACC GGGCCGCCTA TGCACGCCTG TCCCGCCTGC TGACCCTGGG CAAGCGCCGG
ACCGTGAAAG GCAAGTGCGA TCTGGGCCTC GAGGATGTTC TTGACCATGC CGAAGGCCAG
GTCTTCATTC TCCTGCCCCC GGCCGATCCG GGTCCGGACT GGCAGGACAC GGCCCGCCAT
TTCTGCGCCG CGGCCCCGGA CGGGGCCTGC CATCTGGCCC TGGCACAACG GTTCGACGGG
CAGGACGGGG CGCGTGTCTT CCGGCTGACC GGGTTTGCCG GACAGACCGG CCTGCCGCTC
ATCGCCACAA CGGACGCGCT CTACCACACG CCGGAGAGAC GGGCCCTGCA GGATGCCCTG
ACCTGCATTC GCGAAAAAAC CACGATCGAC ACCGCCGGTT TCAGGCTGGA GGCCCATGCC
GAACGCCATC TCAAACCGCC GGCCGAAATG GCGCGGCTGT TTGCCGCCCA TCCCGAAGCC
GTGGCCAATA CAATGCATCT GGCCCGACGC CTGCGCTTTT CCCTCGACGA GATCGCCTAC
CAATATCCCG ACGAAATCAT GGAGCCCGGC GAGACAGCCA TGCAGACCCT GACCCGGCTG
AGCCGGGAAA AGGCGGTCTG GCGCTATCCC GACGGCATTC CCGACAAGGT CGCCGCGGCG
ATCGAGTATG AACTCACCCT GATCAGGCAA CTCGACTACG CGCCCTATTT TCTCACCGTC
TACGACATCG TCCGCCATGC CCGCAGCCGC GGCATTCTCT GTCAGGGACG CGGGTCGGCG
GCCAATTCCA CCATCTGCTA CTGCCTCGGC ATCACCTCGG TGGATCCAGC CCGGATCGAC
CTTCTGTTCG AGCGCTTTGT CTCGGCCGAA CGCAATGAAC CGCCTGATAT CGACGTTGAT
TTCGAGCATG AACGTCGCGA GGAGGTGATC CAGTACATTT ACGGCAAATA CGGTCGCCAC
CGAGCCGGCA TGACGGCCAC CGTGACCACC TATCGCAGCA AGGGGGCAAT CCGCGACATC
GGCAAGGCAA TGGGGCTATC GCCGGACCTG ATCGATGCGC TGGCGCGTGC CGTCTGGCAC
GGGTCAAGCG CTGGCGTGCC GGAGGCGGAT ATCCGGGCCA TGGCCCTCGA TCCGGACGAG
CAGCGCCTGG CGCTGGCCAT GCGTCTGGTG CGCGAACTGA TCGGCTTTCC GCGCCACCTG
TCCCAGCATC CCGGCGGCTT CGTCATCACC CGCGACCGGC TCGACGACAT CGTTCCCGTG
ATGAATGCCG CCATGGCCGA CCGTACCATG ATCGAATGGG ACAAGGACGA TATCGAAACC
CTCGGCCTGA TGAAGGTCGA TGTGCTGGGC CTCGGCATGC TCAGCGTCAT CGCCAAGGCA
TTGGCGCTCC TGCGCGAGGC CTATGGCCGT CGCGAAAGCC TGGCCAGCCT GCAGGACGAG
GATCCGCGTG TCTATGACAT GATCTGCGAG GCCGACACGG TCGGCGTTTT CCAGATCGAA
AGCCGGGCCC AGATGTCGAT GCTGCCGCGG CTGAAACCGC GCCAATTCTA TGATCTGGTC
ATCGAGGTCG CCATCGTCCG GCCCGGCCCG ATCCAGGGTG ACATGGTCCA CCCCTATCTG
CGTCGCCGCG AGGGCCGCGA GAGAATCGAC TACCCGTCAC CCGAATTGAA GGCCGTTCTC
GGGCGCACGC TGGGCGTCCC CCTGTTCCAG GAACAGGCCA TGAAGATTGC CGTGGTCGCG
GCCGGCTTCA CGCCGGCCGA GGCGGACGGG TTGCGCCGGG CCATGGCAAC TTTCCGTCAT
GCCGGCATTA TCCACGAATT CCGTCACCGC TTCCTGGAGG GCATGATCAA AAACGGTTAT
GCCCCGGAAT TTGCCGAACG TTGTTTCAAG CAGATCGAAG GATTTGGCGA ATACGGATTC
CCGGAAAGCC ACGCTGCCAG CTTCGCCCTC CTCGTCTATG TCTCGGCCTG GCTGAAGTGC
CATTACCCGG ACGTCTTCTG TACCGCCATC CTGAACGCCC AGCCCATGGG GTTTTACGCC
CCGGCCCAGC TGGTCCGCGA TGCCCGCGAA CATGGCGTCG ACATACGCCC GCCCGATATC
AACCACTCTG CCTGGGACTG TACGCTGGAA GCGGGATCGG AGCCGGGGCC GGATGTCAGA
CCAAACCGGA AGGGCGCACA CCAGTATGCG GTGCGCCTGG GCCTGCGTCA GGTAAAGGGC
CTTGCCGACG CCGCCGCCGC GCGCATCGTC GTGGCGCGCC AGGCGGGCGG ACACTTCATC
TGCGTGGAAA GCCTGATGCG GCGGGCCGCC CTGTCGCGCC GCGCCCTCGA CCAGCTCGCC
AATGCTGACG CCATGCGCTC GCTCGGACAG GACCGACGGG CGGCGAGCTG GCAAGCGGCC
GGTCTGGCAA CGTCCGCCCT GCCTCTTTTC GCGGCCGGCG AAGGCGCCGA AACCGCCAAC
GGCGAAACCC CTCCGCCCTT GCCCGACATG CCGGCCAGCG AACAGGTCTA TCGCGACTAT
CGCAGCCTCG GCCTCAGCCT GAAAGGCCAT CCGCTGGGCT TCTTCCGGCA GGCGCTGTCC
GAACGCGGCC TGGTTGAGGC ACGCGCCCTG AAGACCCTGC CCAATGGCAG CCAGATCGAT
CTCGCCGGCC TGGTCCTGAT CCGTCAGCGG CCCGGCTCGG CGAGCGGAAT CGTCTTCGTC
ACCCTGGAAG ACGAAACCGG GGTGGCCAAC CTGATCGTCT GGGGCAAGGT CTTCGAACGC
TTTCGTCGCA CCGTCATCGG CGCCCGCCTG CTGCGCGTAA AGGGCCGGCT GCAACGCGAA
GGCCGGGTGA TCCACATCAT CGCCGAGACC CTGATCGATG AAAGCTGGCG GATCGACTCG
CTTCAGGATG ACGGTGATGG CTGGAACGAC CGCAGCCTGT TGCATGGCGA CGACTTCCGG
GCCGGGACCG AAGCCGATCC GCGGCCCCTC GGTCACAATC GACAAGCCGA CACGGCCGCC
CGCCGTGCCG TGGCCCTGCA GCGTTCACGG GATTTCCATT AG
 
Protein sequence
MSETYAELVA ASNFSFLRGA SHPDEMVLAA KALGLAAIGL CDRNSLAGVV RAHKAAKKAG 
MTLCIGTRLV TRCGFELATY PRDRAAYARL SRLLTLGKRR TVKGKCDLGL EDVLDHAEGQ
VFILLPPADP GPDWQDTARH FCAAAPDGAC HLALAQRFDG QDGARVFRLT GFAGQTGLPL
IATTDALYHT PERRALQDAL TCIREKTTID TAGFRLEAHA ERHLKPPAEM ARLFAAHPEA
VANTMHLARR LRFSLDEIAY QYPDEIMEPG ETAMQTLTRL SREKAVWRYP DGIPDKVAAA
IEYELTLIRQ LDYAPYFLTV YDIVRHARSR GILCQGRGSA ANSTICYCLG ITSVDPARID
LLFERFVSAE RNEPPDIDVD FEHERREEVI QYIYGKYGRH RAGMTATVTT YRSKGAIRDI
GKAMGLSPDL IDALARAVWH GSSAGVPEAD IRAMALDPDE QRLALAMRLV RELIGFPRHL
SQHPGGFVIT RDRLDDIVPV MNAAMADRTM IEWDKDDIET LGLMKVDVLG LGMLSVIAKA
LALLREAYGR RESLASLQDE DPRVYDMICE ADTVGVFQIE SRAQMSMLPR LKPRQFYDLV
IEVAIVRPGP IQGDMVHPYL RRREGRERID YPSPELKAVL GRTLGVPLFQ EQAMKIAVVA
AGFTPAEADG LRRAMATFRH AGIIHEFRHR FLEGMIKNGY APEFAERCFK QIEGFGEYGF
PESHAASFAL LVYVSAWLKC HYPDVFCTAI LNAQPMGFYA PAQLVRDARE HGVDIRPPDI
NHSAWDCTLE AGSEPGPDVR PNRKGAHQYA VRLGLRQVKG LADAAAARIV VARQAGGHFI
CVESLMRRAA LSRRALDQLA NADAMRSLGQ DRRAASWQAA GLATSALPLF AAGEGAETAN
GETPPPLPDM PASEQVYRDY RSLGLSLKGH PLGFFRQALS ERGLVEARAL KTLPNGSQID
LAGLVLIRQR PGSASGIVFV TLEDETGVAN LIVWGKVFER FRRTVIGARL LRVKGRLQRE
GRVIHIIAET LIDESWRIDS LQDDGDGWND RSLLHGDDFR AGTEADPRPL GHNRQADTAA
RRAVALQRSR DFH