Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3037 |
Symbol | |
ID | 8448650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3331474 |
End bp | 3334182 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042121 |
Product | DNA polymerase I |
Protein accession | YP_003202363 |
Protein GI | 258653207 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000203838 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.011117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCCG ACGACGCCGA TGCGACGGAG CGGGTGCTGC TGTTGATGGA CGGGCATTCG TTGGCCTACC GCGCGTTCTA CGCGCTGCCC CCGGAGAACT TCTCCACCAC CACCGGCCAG ACCACCAACG CCGTGTACGG CTTCACCTCG ATGCTGATCA ACCTGCTCCG CGACGAGCAG CCCACCCACG TCGCGGTCGC GTTCGACCTG TCCCGGCAGA CCTGGCGGCG CGAGGAGTTC GTCGATTACA AGGCCAACCG CAGCGCGTCG CCGTCGGAGT TCGCCGGGCA GATCGACCTG ATCAAGGAAG TCCTGACGGC GATGCGGATC CCGTATCTGA CCGCGGAGAA CTACGAGGCC GACGACATCA TCGCCACCCT GGCCACCCGC GCGGTCGCCG AGGGGCTCAA CGTCCGCATC TGCACCGGCG ACCGGGACGC GCTGCAGCTG GTCTCCGACC AGGTGACCCT GCTCTACCTG CAGCGCGGCG TCTCCGAGAT GGCCCGGTAC ACCCCGGCCG CGGTCGAGGC CAAGTACGGG CTGACCCCGC TGCAGTACCC GGACTTCGCC GCCCTGCGCG GCGACCCGTC GGACAACCTG CCCGGCATCC CCGGCGTGGG GGAGAAGACC GCGACCAAGT GGATCCGCGA GTTCGGCAGC CTCACCGCGC TGGTCGATCG GGTGGACGAG GTCCGGGGCA AGGCGGGCGA CGCGTTGCGG GAGGCCCTGC CGCACGTGCT GACCAACCGG CGGCTCACCG AACTGGTCCG CGAGGTGCCG GTGGACGCCG ATCCGGCCAC CGACCTGGAG CGGCTGCCCT ACGACCGCGA GGCCGTGCAC CACATCTTCG ACGACCTGCA GTTCCGGGTG CTGCGCGAGC GCCTGCTGGA TTACTTCGAG CAGTCCGACG AGACCAGCAC CGAGGGGTTC GAGGTGGCCG GCAACCGGCT GGCCCCGGGC ACCGTGCGGG CCTGGCTGGC CGAGCACGGC ACCGGCCGGG TCGGCCTGGT GGTCCGCGGC ACCTGGGCGC CCGGTGGCGG CGACGTGCAC ACGCTGGCCT TGGCCGCCGC CGACGGTGAG GCCGCGGTGG TCGACGTGGT CGACACCGAC CCGGACGACG AGGCGGCCCT GGCCGCCTGG TTCGCCGACC CCACCCCGAC GAAGGTCGGG CACGACCTCA AGACGGCGGT CAACGCGCTG ACCGCCCGCG GCTGGCCGGT CGGCGGCGTC GCCTGCGACA CCGCCCTGGC CGCCTACCTG GCCCTGCCGG GGCAGCAGAC CTTCGACCTG GGCGATCTGG TCCAGCGCTA CCTGCACCGC ACGCTGGATC CGGAGCACAG CACCAACGCC GGTCAGCAGC TCTCCTTGAT CCCGGAGGAG AACGAGGGCG CGCAGACCGA GCAGGATTCG CGGGACATGG TCAGGGCCCG CGCCATCATC GACCTGGCCG AGGCGCTCGA GCAGCACCTG GACTCGCTGG GTCAGAAGTC GCTGCTGGCC GACATCGAGC TGCCGGTGAT GACGGTGCTC GGGGAGATGG AACGCGACGG CATCGCCGTC GACGTCGACT ACCTGGACGA CCTGCAGTCG ACCTTCGCCG CCGAGGTGAC CAGCGCGGCC AAGGCCTGCT ACGCCGAGAT CGGACGCGAG GTCAACCTGG GCTCGCCCAA GCAGCTGCAG GTGGTGCTGT TCGACGAGCT GGGCATGCCC AAGACCAAGC GGACCAAGAC CGGCTACACC ACCGACGCGG ACGCGCTGGT CAGCCTGCAC GAGCAGACCG GGCACCCCTT CCTCACCCAC CTGCTGCGGC ACCGGGACGT CACCCGGCTC AAGGTGACCG TGGAGGGCCT GCGCAAGTCG GTCGGCGACG ACGGCCGCAT CCACACCACC TTCCAGCAGA CGGTGGCCGC GACCGGACGG CTCTCCAGCA CCGAACCCAA CCTGCAGAAC ATCCCGATCC GCACCGACGA GGGTCGGTTG ATCCGCCGCG CCTTCGTCCC CGGCCCGCAG GCCGACCTGC TGCTCACCGC GGACTACTCG CAGATCGAGA TGCGGATCAT GGCCACCCTG TCCGAGGACG AGGGCCTGAT CGAGGCGTTC CGATCGGGCG AGGACCTGCA CACCTTCGTG GCCATGAAGG CGTTCGGGCT GCCGGCCGAA CAGGTCACCC CGGAGTTGCG CCGGCGGATC AAGGCGATGT CCTACGGTCT GGCGTACGGG CTGTCCGCCT ACGGCCTGTC CGGGCAGTTG AAGATCTCGG TCGACGAGGC CAAGGAACAG ATGGAGGCCT ACTTCTCCCG CTTCGGCGGT GTGCGCGACT ACCTGCGCGA CACCGTGGCC CGGGCCCGCA AGGACGGCTA CACCGAGACC ATCTTCGGGC GCCGCCGGTA CGTGCCCGAC CTGAACAGCG ACAACCGGCA GAAGCGGGCG ATGGCCGAGC GGATCGCGTT GAACGCGCCC ATCCAGGGCA GCGCCGCCGA CGTGATCAAG GTGGCCATGG TCAACGTGCA GCGCCGCATC CGGGCCGAGG GTCTGCGGTC GCGGATGCTG CTGCAGGTGC ACGACGAGTT GGTCTGCGAA GTGGTCGCCG ACGAGCTGGC GGTGATGACC GAGCTGCTCA AGCAGGAGAT GGGCGGCGCC TACCCGCTGG CGGTGCCGCT GGAGGTCTCC GTCGGGTCGG GAGCCAACTG GGACGCGGCC GCGCACTGA
|
Protein sequence | MPADDADATE RVLLLMDGHS LAYRAFYALP PENFSTTTGQ TTNAVYGFTS MLINLLRDEQ PTHVAVAFDL SRQTWRREEF VDYKANRSAS PSEFAGQIDL IKEVLTAMRI PYLTAENYEA DDIIATLATR AVAEGLNVRI CTGDRDALQL VSDQVTLLYL QRGVSEMARY TPAAVEAKYG LTPLQYPDFA ALRGDPSDNL PGIPGVGEKT ATKWIREFGS LTALVDRVDE VRGKAGDALR EALPHVLTNR RLTELVREVP VDADPATDLE RLPYDREAVH HIFDDLQFRV LRERLLDYFE QSDETSTEGF EVAGNRLAPG TVRAWLAEHG TGRVGLVVRG TWAPGGGDVH TLALAAADGE AAVVDVVDTD PDDEAALAAW FADPTPTKVG HDLKTAVNAL TARGWPVGGV ACDTALAAYL ALPGQQTFDL GDLVQRYLHR TLDPEHSTNA GQQLSLIPEE NEGAQTEQDS RDMVRARAII DLAEALEQHL DSLGQKSLLA DIELPVMTVL GEMERDGIAV DVDYLDDLQS TFAAEVTSAA KACYAEIGRE VNLGSPKQLQ VVLFDELGMP KTKRTKTGYT TDADALVSLH EQTGHPFLTH LLRHRDVTRL KVTVEGLRKS VGDDGRIHTT FQQTVAATGR LSSTEPNLQN IPIRTDEGRL IRRAFVPGPQ ADLLLTADYS QIEMRIMATL SEDEGLIEAF RSGEDLHTFV AMKAFGLPAE QVTPELRRRI KAMSYGLAYG LSAYGLSGQL KISVDEAKEQ MEAYFSRFGG VRDYLRDTVA RARKDGYTET IFGRRRYVPD LNSDNRQKRA MAERIALNAP IQGSAADVIK VAMVNVQRRI RAEGLRSRML LQVHDELVCE VVADELAVMT ELLKQEMGGA YPLAVPLEVS VGSGANWDAA AH
|
| |