Gene Namu_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3037 
Symbol 
ID8448650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3331474 
End bp3334182 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content70% 
IMG OID645042121 
ProductDNA polymerase I 
Protein accessionYP_003202363 
Protein GI258653207 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000203838 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.011117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCG ACGACGCCGA TGCGACGGAG CGGGTGCTGC TGTTGATGGA CGGGCATTCG 
TTGGCCTACC GCGCGTTCTA CGCGCTGCCC CCGGAGAACT TCTCCACCAC CACCGGCCAG
ACCACCAACG CCGTGTACGG CTTCACCTCG ATGCTGATCA ACCTGCTCCG CGACGAGCAG
CCCACCCACG TCGCGGTCGC GTTCGACCTG TCCCGGCAGA CCTGGCGGCG CGAGGAGTTC
GTCGATTACA AGGCCAACCG CAGCGCGTCG CCGTCGGAGT TCGCCGGGCA GATCGACCTG
ATCAAGGAAG TCCTGACGGC GATGCGGATC CCGTATCTGA CCGCGGAGAA CTACGAGGCC
GACGACATCA TCGCCACCCT GGCCACCCGC GCGGTCGCCG AGGGGCTCAA CGTCCGCATC
TGCACCGGCG ACCGGGACGC GCTGCAGCTG GTCTCCGACC AGGTGACCCT GCTCTACCTG
CAGCGCGGCG TCTCCGAGAT GGCCCGGTAC ACCCCGGCCG CGGTCGAGGC CAAGTACGGG
CTGACCCCGC TGCAGTACCC GGACTTCGCC GCCCTGCGCG GCGACCCGTC GGACAACCTG
CCCGGCATCC CCGGCGTGGG GGAGAAGACC GCGACCAAGT GGATCCGCGA GTTCGGCAGC
CTCACCGCGC TGGTCGATCG GGTGGACGAG GTCCGGGGCA AGGCGGGCGA CGCGTTGCGG
GAGGCCCTGC CGCACGTGCT GACCAACCGG CGGCTCACCG AACTGGTCCG CGAGGTGCCG
GTGGACGCCG ATCCGGCCAC CGACCTGGAG CGGCTGCCCT ACGACCGCGA GGCCGTGCAC
CACATCTTCG ACGACCTGCA GTTCCGGGTG CTGCGCGAGC GCCTGCTGGA TTACTTCGAG
CAGTCCGACG AGACCAGCAC CGAGGGGTTC GAGGTGGCCG GCAACCGGCT GGCCCCGGGC
ACCGTGCGGG CCTGGCTGGC CGAGCACGGC ACCGGCCGGG TCGGCCTGGT GGTCCGCGGC
ACCTGGGCGC CCGGTGGCGG CGACGTGCAC ACGCTGGCCT TGGCCGCCGC CGACGGTGAG
GCCGCGGTGG TCGACGTGGT CGACACCGAC CCGGACGACG AGGCGGCCCT GGCCGCCTGG
TTCGCCGACC CCACCCCGAC GAAGGTCGGG CACGACCTCA AGACGGCGGT CAACGCGCTG
ACCGCCCGCG GCTGGCCGGT CGGCGGCGTC GCCTGCGACA CCGCCCTGGC CGCCTACCTG
GCCCTGCCGG GGCAGCAGAC CTTCGACCTG GGCGATCTGG TCCAGCGCTA CCTGCACCGC
ACGCTGGATC CGGAGCACAG CACCAACGCC GGTCAGCAGC TCTCCTTGAT CCCGGAGGAG
AACGAGGGCG CGCAGACCGA GCAGGATTCG CGGGACATGG TCAGGGCCCG CGCCATCATC
GACCTGGCCG AGGCGCTCGA GCAGCACCTG GACTCGCTGG GTCAGAAGTC GCTGCTGGCC
GACATCGAGC TGCCGGTGAT GACGGTGCTC GGGGAGATGG AACGCGACGG CATCGCCGTC
GACGTCGACT ACCTGGACGA CCTGCAGTCG ACCTTCGCCG CCGAGGTGAC CAGCGCGGCC
AAGGCCTGCT ACGCCGAGAT CGGACGCGAG GTCAACCTGG GCTCGCCCAA GCAGCTGCAG
GTGGTGCTGT TCGACGAGCT GGGCATGCCC AAGACCAAGC GGACCAAGAC CGGCTACACC
ACCGACGCGG ACGCGCTGGT CAGCCTGCAC GAGCAGACCG GGCACCCCTT CCTCACCCAC
CTGCTGCGGC ACCGGGACGT CACCCGGCTC AAGGTGACCG TGGAGGGCCT GCGCAAGTCG
GTCGGCGACG ACGGCCGCAT CCACACCACC TTCCAGCAGA CGGTGGCCGC GACCGGACGG
CTCTCCAGCA CCGAACCCAA CCTGCAGAAC ATCCCGATCC GCACCGACGA GGGTCGGTTG
ATCCGCCGCG CCTTCGTCCC CGGCCCGCAG GCCGACCTGC TGCTCACCGC GGACTACTCG
CAGATCGAGA TGCGGATCAT GGCCACCCTG TCCGAGGACG AGGGCCTGAT CGAGGCGTTC
CGATCGGGCG AGGACCTGCA CACCTTCGTG GCCATGAAGG CGTTCGGGCT GCCGGCCGAA
CAGGTCACCC CGGAGTTGCG CCGGCGGATC AAGGCGATGT CCTACGGTCT GGCGTACGGG
CTGTCCGCCT ACGGCCTGTC CGGGCAGTTG AAGATCTCGG TCGACGAGGC CAAGGAACAG
ATGGAGGCCT ACTTCTCCCG CTTCGGCGGT GTGCGCGACT ACCTGCGCGA CACCGTGGCC
CGGGCCCGCA AGGACGGCTA CACCGAGACC ATCTTCGGGC GCCGCCGGTA CGTGCCCGAC
CTGAACAGCG ACAACCGGCA GAAGCGGGCG ATGGCCGAGC GGATCGCGTT GAACGCGCCC
ATCCAGGGCA GCGCCGCCGA CGTGATCAAG GTGGCCATGG TCAACGTGCA GCGCCGCATC
CGGGCCGAGG GTCTGCGGTC GCGGATGCTG CTGCAGGTGC ACGACGAGTT GGTCTGCGAA
GTGGTCGCCG ACGAGCTGGC GGTGATGACC GAGCTGCTCA AGCAGGAGAT GGGCGGCGCC
TACCCGCTGG CGGTGCCGCT GGAGGTCTCC GTCGGGTCGG GAGCCAACTG GGACGCGGCC
GCGCACTGA
 
Protein sequence
MPADDADATE RVLLLMDGHS LAYRAFYALP PENFSTTTGQ TTNAVYGFTS MLINLLRDEQ 
PTHVAVAFDL SRQTWRREEF VDYKANRSAS PSEFAGQIDL IKEVLTAMRI PYLTAENYEA
DDIIATLATR AVAEGLNVRI CTGDRDALQL VSDQVTLLYL QRGVSEMARY TPAAVEAKYG
LTPLQYPDFA ALRGDPSDNL PGIPGVGEKT ATKWIREFGS LTALVDRVDE VRGKAGDALR
EALPHVLTNR RLTELVREVP VDADPATDLE RLPYDREAVH HIFDDLQFRV LRERLLDYFE
QSDETSTEGF EVAGNRLAPG TVRAWLAEHG TGRVGLVVRG TWAPGGGDVH TLALAAADGE
AAVVDVVDTD PDDEAALAAW FADPTPTKVG HDLKTAVNAL TARGWPVGGV ACDTALAAYL
ALPGQQTFDL GDLVQRYLHR TLDPEHSTNA GQQLSLIPEE NEGAQTEQDS RDMVRARAII
DLAEALEQHL DSLGQKSLLA DIELPVMTVL GEMERDGIAV DVDYLDDLQS TFAAEVTSAA
KACYAEIGRE VNLGSPKQLQ VVLFDELGMP KTKRTKTGYT TDADALVSLH EQTGHPFLTH
LLRHRDVTRL KVTVEGLRKS VGDDGRIHTT FQQTVAATGR LSSTEPNLQN IPIRTDEGRL
IRRAFVPGPQ ADLLLTADYS QIEMRIMATL SEDEGLIEAF RSGEDLHTFV AMKAFGLPAE
QVTPELRRRI KAMSYGLAYG LSAYGLSGQL KISVDEAKEQ MEAYFSRFGG VRDYLRDTVA
RARKDGYTET IFGRRRYVPD LNSDNRQKRA MAERIALNAP IQGSAADVIK VAMVNVQRRI
RAEGLRSRML LQVHDELVCE VVADELAVMT ELLKQEMGGA YPLAVPLEVS VGSGANWDAA
AH