Gene Mnod_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1501 
Symbol 
ID7307579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1584276 
End bp1587350 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content70% 
IMG OID643599236 
ProductDNA polymerase I 
Protein accessionYP_002496796 
Protein GI220921495 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.318672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAGGAA CCGAGCCGCC CCGCATGACC GCCGATTCCG CCCCCGAGAC GAAGCCCGTC 
GGCCCCGGCG ATCAGGTGCT CCTGGTCGAC GGATCCTCCT TCATCTTCCG GGCCTATTTC
CAGTCCATCA ACCAGCCGGA GCGCTACAAC TTCCGGCCCT CGGACGGGCT GCCCACCGGC
GCCGTGCGGC TGTTCTGCGC CAAGATCGCC CAGTTCGTGC AGGAGGGGGC GGCCGGGGTG
AAGCCGAGCC ATCTCGGCAT CGTCTTCGAC AAGTCGGAGG GCTCGTTCCG CAAGGAGATT
TTTCCCGACT ACAAGGGCCA CCGGCCGGAC GCGCCCGAGG ACCTCAAGCG GCAGATGCCG
CTGATGCGCG AGGCGGTGCG GGCCTTCGGC CTCGAACCGA TCGAGCTCGA ACGCTACGAA
GCGGACGACC TCATCGCCAC CTATGCGCGG CAGGCGGAGG CGCGGGGCGC GGGCGTCATC
ATCGTGTCCT CCGACAAGGA CCTGATGCAG CTCGTCGGCG ACCTCGTGCG GTTCTACGAC
TTCGAGTCGG GCCAACAGGG CAAGCCCGGC TACCGGCCCG AGCGCAACCT CGACGCGGCG
GCGATCGTCG AGCGCTGGGA GGGCCTGAGC CCGGCGCAGA TCGGCGATGC GCTGGCGCTG
ATCGGCGACA CCTCCGACAA CGTGCCGGGC GTGCCGGGCA TCGGGCTCAA GACCGCGGCC
GTGCTGATCA AGGAGTTCGG CAGCCTGGAG GCCCTGCTGG AGCGGGCCGC AGAGATCAAG
CAGCCCAAGC GCCGCGAGAC GCTGCTCGCC AATATCGAGC AGGCCCGCCT GTCGCGACGG
CTCGTGACCC TCGACGAGGC GGTGCCGGTG CCTGTGCCGC TCGAGGCGTT GCGCCTGCGC
AAGCCCGACC CGGATCGGCT CGTCGGCTTC CTCAAGGCGA TGGAGTTCAA CACGCTGACG
CGGCGCATCG CCTCGCTGCT GCATGTCGAC CCGGAGGCGG TGAAGCCCGA CCCGGCCCTG
CTGCCGGGCG GCGCGGCGTC CTACGCCAAC GAGAAGGGCG GCAGCGACGT CACCCCGTTC
TTCGGCGACG AGGCCAGGGA TCAGCCCGCC GCCGAGGTCG ACCCCTTCGC CGATCTCGGC
CTGCCGGACG CGCCCCTCAA GCCCCGCGGG CCCGTCGAGG CGACGCCGGG CAGCCTGGTC
GCTGCCCGCG CCGCCGAGGC GGTGAAGCCC TTCGACACCG ACGCCTACGA GACCATCACC
TCCCTCGACC GGCTCGATGC CTGGGTCGCG GAGGCCGCCG AGGCCGGCGT GCTCGCGGTC
GACACCGAGA CCAACGCCCT CGACGCGCAC AGGGCGGATC TGGTCGGCGT CTCGCTCGCC
ACCGCGCCGG GGCGCGCCGC CTATATCCCG CTCTCCCATC GCGGCAGCGA GGACCTGTTC
GGCGAGGGGC TGCTGCCGAA CCAGCTCCCC TGGGAGGCCG TGCGGGCGCG CCTCAAGCCG
CTCCTCGAAG ACCCGGCCGT GCTCAAGGTC GGGCAGAACC TGAAATACGA CTGGCTGGTG
CTGGCCCGCC ACGGCATCGA GGTGCGGCCC TACGACGACA CCATGCTGAT CTCCTACGTG
CTCGACGCCG GCAAGGGCTC GCACGGCATG GACGAGCTCG CGCGCCGCCA TCTCGGGCAC
CAGCCGATCA CCTTCGCGGA TGTCACCGGC ACGGGCCGCA CCAAGGTCAC CTTCGACCGC
GTTCCCCTCG ACAAGGCCAC CGCCTATGCG GCGGAGGATG CGGACGTCAC CTTGCGCCTG
TGGCGCCTGA TGAAGCCGCG GCTCGCCGCC GAGCGGCGCG CCACCGTCTA CGAGACCCTG
GAGCGCCCCC TGGTACCGGT GCTCGCCCGC ATGGAAAGCT GCGGCATTCG GGTCGACCGG
GGCATGCTGA GCCGGCTCTC GGGCGATTTC TCGCAATCCC TGGCGCGGCT GGAGGCCGAG
ATCCAGGAGA TGGCGGGCGA GAGTTTCTCC GTCTCCTCCC CGAAGCAGAT CGGCGACATC
CTGTTCGGCA AGTTCGGGCT TCCCGGCGCC AAGAAGACGC CGTCGGGCCA ATGGGCGACG
CCCGCCACCC TGCTGGAGGA GCTTGCCGGC CAGGGCCATG CGCTGCCGAA GAAGATCCTG
GAATGGCGCC AGCTCTCGAA GCTGAAATCC ACCTACACGG ACACCCTCCA GGAGCATGCC
GACCGGGAGA CGAACCGCGT CCACACCTCC TTCGCGCTCG CCGCCACCAC GACCGGGCGG
CTCTCCTCCT CGGACCCGAA CCTGCAGAAC ATCCCGATCC GCACCGAGGA GGGGCGGCGC
ATCCGGCAGG CCTTCGTGGC GGATGAGGGC CACAAGCTGA TCTCGGCCGA TTACAGCCAG
ATCGAGCTCA GGCTGCTCGC CCACATCGCC GACATCCCGC AGCTGCGCGA AGCCTTCGCG
GCGGGCATCG ACATCCATGC GGCCACCGCC TCGGCGATGT TCGGCGTGCC CCTCGACCAG
ATGACGCCCG ACCTGCGGCG GCGGGCCAAG ACCATCAATT TCGGCATCAT CTACGGCATC
TCGGCCTTCG GCCTCGCCGA CCGGCTCGGC ATCCCGCAGG GCGAGGCCGC GGCCTTCATC
AAGCAGTATT TCGAGCGGTT CCCCGGCATC CGCGCCTATA TCGACGACAT CAAGAAGACC
TGCCGGGACA AGGGCTACGT GACGACGCTG TTCGGGCGCG TCTGCCACTA TCCGCAGATC
CGCTCGAACA ACCCGCAGGA ACGGGCCTCG GTGGAGCGGC AGGCGATCAA CGCGCCGATC
CAGGGCTCGG CCGCCGACAT CATCCGGCGC GCGATGGTGC GGATGGAGGG GGCGCTGGCG
GCGGCGGGCC TCACCACGCG CATGCTGCTG CAGGTGCATG ACGAGCTCGT CTTCGAGGCG
CCCGACGACG AGGTGGAGCG GGCGCTGCCG ATCATCGCCC GGGTGATGGA GGAGGCGCCG
CACCCGGCGG TGCAGCTCCG GGTGCCGCTT GCCGTCGAGG CCAAGGCCGC CACGAACTGG
CAGGAGGCGC ATTGA
 
Protein sequence
MSGTEPPRMT ADSAPETKPV GPGDQVLLVD GSSFIFRAYF QSINQPERYN FRPSDGLPTG 
AVRLFCAKIA QFVQEGAAGV KPSHLGIVFD KSEGSFRKEI FPDYKGHRPD APEDLKRQMP
LMREAVRAFG LEPIELERYE ADDLIATYAR QAEARGAGVI IVSSDKDLMQ LVGDLVRFYD
FESGQQGKPG YRPERNLDAA AIVERWEGLS PAQIGDALAL IGDTSDNVPG VPGIGLKTAA
VLIKEFGSLE ALLERAAEIK QPKRRETLLA NIEQARLSRR LVTLDEAVPV PVPLEALRLR
KPDPDRLVGF LKAMEFNTLT RRIASLLHVD PEAVKPDPAL LPGGAASYAN EKGGSDVTPF
FGDEARDQPA AEVDPFADLG LPDAPLKPRG PVEATPGSLV AARAAEAVKP FDTDAYETIT
SLDRLDAWVA EAAEAGVLAV DTETNALDAH RADLVGVSLA TAPGRAAYIP LSHRGSEDLF
GEGLLPNQLP WEAVRARLKP LLEDPAVLKV GQNLKYDWLV LARHGIEVRP YDDTMLISYV
LDAGKGSHGM DELARRHLGH QPITFADVTG TGRTKVTFDR VPLDKATAYA AEDADVTLRL
WRLMKPRLAA ERRATVYETL ERPLVPVLAR MESCGIRVDR GMLSRLSGDF SQSLARLEAE
IQEMAGESFS VSSPKQIGDI LFGKFGLPGA KKTPSGQWAT PATLLEELAG QGHALPKKIL
EWRQLSKLKS TYTDTLQEHA DRETNRVHTS FALAATTTGR LSSSDPNLQN IPIRTEEGRR
IRQAFVADEG HKLISADYSQ IELRLLAHIA DIPQLREAFA AGIDIHAATA SAMFGVPLDQ
MTPDLRRRAK TINFGIIYGI SAFGLADRLG IPQGEAAAFI KQYFERFPGI RAYIDDIKKT
CRDKGYVTTL FGRVCHYPQI RSNNPQERAS VERQAINAPI QGSAADIIRR AMVRMEGALA
AAGLTTRMLL QVHDELVFEA PDDEVERALP IIARVMEEAP HPAVQLRVPL AVEAKAATNW
QEAH