Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0851 |
Symbol | polA |
ID | 3693549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 1111054 |
End bp | 1113825 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637731105 |
Product | DNA polymerase I |
Protein accession | YP_336009 |
Protein GI | 76818134 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.280687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAG AACGAAATCT GGAAGGTAAG ACCCTGCTAT TGGTTGACGG TTCGAGCTAT CTGTATCGGG CTTACCATGC GATGCCTGAT TTGCGCGGCC CTGGCGGGGA GCCGACCGGA GCGCTCTACG GAATCGTCAA TATGTTGCGC CGCATGCGCA AGGATGTCAG TGCAGAGTAT AGCGCGTGCG TGTTCGACGC CAAGGGCAAA ACCTTCCGCG ACGATCTGTA TGCCGACTAC AAGGCGCACC GCCCGCCGAT GCCGCCCGAT CTCGCGCTGC AGATCGAGCC GATCCATACG GCCGTGCGCG CGCTCGGCTG GCCGCTGTTG ATGATCGAGG GCGTCGAGGC CGACGACGTG ATCGGCACGC TTGCGAAGCA GGCCGAACAA CACGGCATGA ACGTGATCGT CTCGACGGGC GACAAGGATC TCGCGCAACT CGTCACCGAT CGCGTCACGC TCATCAACAC GATGACGAAC GAGGCGCTCG ACCGCGACGG CGTGCTCGCG AAGTTCGGCG TGCCGCCGGA GCGGATCGTC GATTACCTGT CGCTCATCGG CGACACCGTC GACAACGTGC CGGGCGTCGA GAAGTGCGGG CCGAAGACGG CCGTCAAGTG GCTCACGCAG TACGGCTCGC TCGACGGCGT CGTCGAACAT GCGGGCGAGA TCAAGGGCGT CGTCGGCGAC AACCTGCGCC GTGCGCTCGA TTTCCTGCCG CTCGCGCGCA AGCTGGTGAC GGTCGAGACC GGCTGCGAGC TCGCGCCGCA CGTCGAATCG TTCGACGCGT CGCTCGCGAC GGACGGCGAG GGCCGCGACG CGCTGCGCGA GATCTTCGCG AAGTACGGCT TCAAGACGTG GCTGCGCGAG CTCGACAGCG AACCCGCCGC CGCGGCCGGT GCGGCCGCCG CCGGCGCCGC GCCGGATCCG GCGAGCGGCG CGACGGCCGA ACTGCCGCTC GCCACGGCGC GCAACTACGC GACCGTGCAG ACGTGGGAGC AGTTCGACGC GTGGCTCGCG AAGATCTCCG CCGCAGAGCT GACCGCGTTC GACACCGAGA CGACGTCGCT CGACCCGATG CGCGCACAAA TCGTCGGCCT GTCGTTCTCG GTGGAGCCGG GCCATGCCGC GTACGTGCCG GTTGCGCACC GCGGGCCGGA CATGCCCGCG CAACTGCCGA GCGACGAAGT GCTCGCGAAG CTCACGCCGT GGCTCGAGGA TGCGGGCAAG AAGAAGCTCG GCCAACACCT GAAGTACGAC GCGCAGGTGC TCGCGAACTA CGGCATCGCG CTGAACGGCA TCGAGCACGA CACGCTGCTC GAATCGTATG TGCTCGAATC GCACCGCACA CACGACATGG ACAGCCTTGC GCTGCGCCAT CTCGGCGTGA AGACGATCAA GTACGAGGAC GTGGCGGGCA AGGGCGCGCA GCAGATCGGC TTCGACGAGG TGCCGCTCGC GCAGGCGTCC GAGTATGCGG CGGAAGACGC CGACATCACG CTGCAGCTGC ATCACGCGCT GTATCCGCAG ATCGCGCGCG AGCCGGGCCT CACGCGCGTG TATCGCGACA TCGAGCTGCC GGTGTCGCTC GTGTTGCGCA AGATGGAGCG CACCGGCGTG CTGATCGACG GCGACAAGCT GAGCCGCCAG AGCGGCGAGA TCGCGACGCG GCTCGTCGCG CTCGAGCAGG AGGCGTACGC GCTCGCGGGC GGCGAATTCA ATCTCGGCTC GCCGAAGCAG ATCGGCCAGA TTTTCTTCGA GCGGCTGCAA TTGCCCGTCG TCAAGAAGAC GCCGAGCGGC GCGCCGTCGA CCGACGAGGA GGTGCTGCAA AAGCTCGCCG AGGACTATCC GCTGCCGAAG CTGTTGCTTG AGCACCGCGG CCTGTCGAAG CTGAAATCGA CCTATACCGA CAAGCTGCCG CGCATGGTCA ATCCGGACAC GGGCCGCGTG CACACGAACT ATGCGCAGGC GGTGGCGGTC ACCGGGCGGC TCGCGTCGAA CGATCCGAAC CTGCAGAACA TCCCGGTGCG CACGGCCGAA GGGCGGCGCA TCCGCGAGGC GTTCGTCGCG CCGCCGGGCA GCAAGATCGT GTCCGCCGAC TATTCGCAAA TCGAGCTGCG CATCATGGCG CACATCTCGG AGGACGAATC GTTGCTGCGC GCGTTCGCGC ACGGCGAGGA CATTCACCGC GCGACCGCGG CGGAGGTGTT CGGCGTGACG CCGCTCGAAG TGACGTCCGA TCAGCGGCGC ATCGCGAAGG TGATCAACTT CGGTCTGATC TACGGGATGA GTTCGTTCGG CCTCGCGTCG AATCTCGGCA TCACGCGCGA TGCGGCGAAG CTCTATATCG ACCGCTACTT CCTCCGCTAT CCGGGCGTCG CCCGCTACAT GGAGGAAACG CGCATGCGGG CGAAGGAGAA GGGCTACGTC GAAACCGTGT TCGGCCGCCG CCTGTGGCTG CCCGAGATCA ACGGCGGCAA CGGGCCGCGC CGCCAGGCCG CCGAGCGCGC GGCGATCAAC GCGCCGATGC AGGGCACGGC CGCCGATCTG ATCAAGCTGT CGATGATCGC GGTCGACGAC TGGCTCGAGC GCGGCGGCTT GCGCGCGCGA ATGATCATGC AGGTGCACGA CGAACTCGTG CTCGAGGTGC CGGAGGACGA GCTGTCCATC GTGCGCGAGA AGCTGCCGGA GATGATGTGC GGCGTCGCGA AACTGAAGGT GCCGCTCGTC GCCGAGGTCG GCGTCGGCGA GAACTGGGAA GAGGCGCACT GA
|
Protein sequence | MPEERNLEGK TLLLVDGSSY LYRAYHAMPD LRGPGGEPTG ALYGIVNMLR RMRKDVSAEY SACVFDAKGK TFRDDLYADY KAHRPPMPPD LALQIEPIHT AVRALGWPLL MIEGVEADDV IGTLAKQAEQ HGMNVIVSTG DKDLAQLVTD RVTLINTMTN EALDRDGVLA KFGVPPERIV DYLSLIGDTV DNVPGVEKCG PKTAVKWLTQ YGSLDGVVEH AGEIKGVVGD NLRRALDFLP LARKLVTVET GCELAPHVES FDASLATDGE GRDALREIFA KYGFKTWLRE LDSEPAAAAG AAAAGAAPDP ASGATAELPL ATARNYATVQ TWEQFDAWLA KISAAELTAF DTETTSLDPM RAQIVGLSFS VEPGHAAYVP VAHRGPDMPA QLPSDEVLAK LTPWLEDAGK KKLGQHLKYD AQVLANYGIA LNGIEHDTLL ESYVLESHRT HDMDSLALRH LGVKTIKYED VAGKGAQQIG FDEVPLAQAS EYAAEDADIT LQLHHALYPQ IAREPGLTRV YRDIELPVSL VLRKMERTGV LIDGDKLSRQ SGEIATRLVA LEQEAYALAG GEFNLGSPKQ IGQIFFERLQ LPVVKKTPSG APSTDEEVLQ KLAEDYPLPK LLLEHRGLSK LKSTYTDKLP RMVNPDTGRV HTNYAQAVAV TGRLASNDPN LQNIPVRTAE GRRIREAFVA PPGSKIVSAD YSQIELRIMA HISEDESLLR AFAHGEDIHR ATAAEVFGVT PLEVTSDQRR IAKVINFGLI YGMSSFGLAS NLGITRDAAK LYIDRYFLRY PGVARYMEET RMRAKEKGYV ETVFGRRLWL PEINGGNGPR RQAAERAAIN APMQGTAADL IKLSMIAVDD WLERGGLRAR MIMQVHDELV LEVPEDELSI VREKLPEMMC GVAKLKVPLV AEVGVGENWE EAH
|
| |