Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_7357 |
Symbol | polA |
ID | 5149500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 7723733 |
End bp | 7726795 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640562018 |
Product | DNA polymerase I |
Protein accession | YP_001243127 |
Protein GI | 148258542 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.409782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAA ATCCCTACAT TGCGCACATG CCCAAAGCGC CCCAGAAAAC TGCCGCCACC AGCCCCGCCG CCCCTGTGCC TGCTGCCGCC ACCCAAGGCG CGGCCAAGGG AGCCACCAAG AGCGACCACG TGTTCCTGGT CGATGGCTCG TCCTACATCT TCCGTGCCTA TCACGCGCTG CCGCCGCTCA ACCGCAAGTC CGACGGGCTG AACGTCAATG CCGTGCTTGG CTTCTGCAAC ATGTTGTGGA AGCTGCTCCG CGACATGCCC GAGACCGACC GGCCGACGCA TCTGGCCATC GTGTTCGACA AGTCGGAAGT GACCTTCCGC AACAAGATCT ACGCCGATTA CAAGGCCCAC CGGCCGCCGG CGCCGGACGA CCTGATCCCG CAATTCGCGC TGATCCGCGA GGCGGTGCAC GCCTTCGATC TGCCCTGCCT GGAACAGGCA GGCTACGAGG CCGACGACCT GATCGCGACC TATGCGCGGC TCGCCTGCGA GCGCGGGGCC ACCACCACCA TCGTGTCGTC CGACAAGGAC CTGATGCAGC TCGTCAACGA CTGCGTCACG ATGTACGACA CGATGAAGGA CCGCCGCATC GGCACAGCCG AGGTGATCGA GAAATTCGGC GTGCCGCCGG AGAAGGTTGT CGAGGTGCAG GCGCTGGCGG GCGATTCCAC CGACAACGTG CCGGGCGTAC CCGGCATCGG CATCAAGACC GCGGCGCAGT TGATCAATGA TTACGGCGAT CTCGAAGGGC TGCTGACGCG CGCCGCCGAG ATCAAGCAGC CGAAACGCCG CGAGGCGCTG ATCGAGAACG CCGAGAAGGC GCGGATCTCG CGCCAGCTGG TGCTGCTCGA TGATCATGTC GCGCTGGAGG TGCCGCTCGA CGACCTCGCG GTGCACGAGC CCGATCCGCG CAAGCTGATC GCCTTCCTCA AGGCGATGGA GTTCACCACG CTGACGCGGA GGGTGGCCGA ATATTCGCAG ATCGATCCGG CCGACGTCGC GGCGAATCAG CAGCTCAAGA GCGGCGGCGG CACAGCCGTA CCCGCGCCCG CTGCTGCCAG CGCAGGGACG AACGCAGCGG GCAAGCCGGC GGCCGCTGCT CCCAAAACGG GGGATCCGCG CACCGACAAA TCGGCGATGA GCAAGGGCAC ACCGGTGAGC CTCGCCGAGG CGCGCGCGGA AGCAGCGAAG AGCGCGCGCA TCGATCGCGA AAAGTATCAG ACGATCCGCA GCCTGCCCGA GCTGAACGCC TGGATCGCGC GCATCCATGA TCTCGGCCGC GTCACTATCG AGGCCAAGGC GAATTCGATC GATCCGATGC AGGCGAGCCT TTGCGGCATC GCGCTGGCGC TGGCGCCCAA CGATGCCTGC TATATTCCGT TGGCGCATCG CCAGGCCGGC GACGGCGGCG GCCTGTTTGA TGCTGGCCTC GCGCCGGACC AGGTCAAGGA CCGCGACGCG CTGGCTGCGC TGAAGCCGGT GCTGGAATCG GCCGGCATTC TCAAGATCGG CTTCAACATC AAATTCGTCG CCGTGCTGCT GGCGCAACAC GGCATCACGC TCGCAAACGT CGATGACATC CAGCTGATGA GCTACGCGCT CGACGCGGGG CGGGGTTCGC AGAAATTCGA GTCGCTGGCA GAGCACGTGC TCGCGCACGC CGTCCTCGGG GAAGGCGAGT TGGTCGGCAG CGGCAAGAAC AAGCTGACCT TCGAGCAGGT CGCCATCGAT CGCGCGACAG CGCATGCGGC CGAGGCGGCC GACGTCATCC ATCGGCTGTG GCGGGTGCTG AAGCCGCGGC TGGTCGCCGA GCGCATGACC GCGGTCTACG AGACGCTGGA GCGGCCGCTG GTGACCGTGC TGGCGCAGAT GGAGCGCAGG GGCATCGCGA TCGACCGCCA GGTGCTGTCG CGGCTGTCCG GCGATTTTGC CCAGACCGCG GGACGCGTCG AGGCCGAACT GCAGGAGCTG GCCGGCGAGC CGATCAATGT CGGCAGTCCC AAGCAGATCG GCGACATCCT GTTCGGCAAG ATGGGCCTGC CGGGCGGCAC CAAGACCAAG ACCGGCGCGT GGTCGACCTC CGCCTCGATT CTCGATGACC TCGCCGAGCA GGGCAATGAT TTCGCGCGCA AAATCCTGGA ATGGCGCCAG GTCTCGAAGC TGAAATCGAC CTACACCGAT GCGCTGCCGA CCTATGTCAA TCCGCAGAGC CAGCGCGTGC ACACCACCTA CGCGCTCGCG GCGACCACGA CGGGACGGTT GTCGTCGAAC GAACCGAATC TGCAGAACAT TCCGGTGCGG ACCGAGGATG GCCGCAAGAT CCGCCGCGCC TTCATCGCCA CGCCAGGCCA CAAACTGGTG TCGGCGGATT ACTCGCAGAT CGAGCTGCGG CTGCTGGCGG AGATCGCCGA TATTCCGGTG CTGAAACAGG CCTTCAAGGA CGGGCTCGAC ATTCACGCAA TGACGGCGTC GGAGATGTTC GGCGTGCCGA TCAAGGACAT GCCGAGCGAG GTGCGCCGCC GGGCGAAGGC GATCAATTTC GGCATCATCT ACGGCATCTC CGCCTTCGGC CTTGCCAACC AGCTCGGCAT CGCCCGCGAA GAGGCGTCGG CCTATATCAA GCGCTATTTC GAGCGCTTCC CCGGCATCCG TGCCTATATG GACGAAACGC GCGAATTCTG TCGCAAGCAC GGCTATGTCA CCACGCTGTT CGGCCGCAAG TGCCACTATC CCGAGATCAA GGCGTCGAAC GCCTCGGTGC GCGCCTTCAA CGAACGCGCC GCGATCAACG CGCGGCTGCA GGGCACCGCC GCCGACATCA TCCGCCGCGC CATGACACGC GTCGAGGATG CGCTCGCGGA GAAGAAACTC TCCGCGCAGA TGCTGCTGCA AGTGCATGAC GAATTGATCT TCGAGGTGCC GGACGACGAA GTCGCTGCGA CCCTGCCGGT GGTGCAGCAC GTCATGCAGG ACGCCCCCTT CCCGGCCGTG CTGCTATCGG TGCCGCTGCA CGTCGACGCG CGGGCCGCTG ACAATTGGGA CGAGGCGCAT TGA
|
Protein sequence | MSENPYIAHM PKAPQKTAAT SPAAPVPAAA TQGAAKGATK SDHVFLVDGS SYIFRAYHAL PPLNRKSDGL NVNAVLGFCN MLWKLLRDMP ETDRPTHLAI VFDKSEVTFR NKIYADYKAH RPPAPDDLIP QFALIREAVH AFDLPCLEQA GYEADDLIAT YARLACERGA TTTIVSSDKD LMQLVNDCVT MYDTMKDRRI GTAEVIEKFG VPPEKVVEVQ ALAGDSTDNV PGVPGIGIKT AAQLINDYGD LEGLLTRAAE IKQPKRREAL IENAEKARIS RQLVLLDDHV ALEVPLDDLA VHEPDPRKLI AFLKAMEFTT LTRRVAEYSQ IDPADVAANQ QLKSGGGTAV PAPAAASAGT NAAGKPAAAA PKTGDPRTDK SAMSKGTPVS LAEARAEAAK SARIDREKYQ TIRSLPELNA WIARIHDLGR VTIEAKANSI DPMQASLCGI ALALAPNDAC YIPLAHRQAG DGGGLFDAGL APDQVKDRDA LAALKPVLES AGILKIGFNI KFVAVLLAQH GITLANVDDI QLMSYALDAG RGSQKFESLA EHVLAHAVLG EGELVGSGKN KLTFEQVAID RATAHAAEAA DVIHRLWRVL KPRLVAERMT AVYETLERPL VTVLAQMERR GIAIDRQVLS RLSGDFAQTA GRVEAELQEL AGEPINVGSP KQIGDILFGK MGLPGGTKTK TGAWSTSASI LDDLAEQGND FARKILEWRQ VSKLKSTYTD ALPTYVNPQS QRVHTTYALA ATTTGRLSSN EPNLQNIPVR TEDGRKIRRA FIATPGHKLV SADYSQIELR LLAEIADIPV LKQAFKDGLD IHAMTASEMF GVPIKDMPSE VRRRAKAINF GIIYGISAFG LANQLGIARE EASAYIKRYF ERFPGIRAYM DETREFCRKH GYVTTLFGRK CHYPEIKASN ASVRAFNERA AINARLQGTA ADIIRRAMTR VEDALAEKKL SAQMLLQVHD ELIFEVPDDE VAATLPVVQH VMQDAPFPAV LLSVPLHVDA RAADNWDEAH
|
| |