Gene BURPS1710b_A0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0851 
SymbolpolA 
ID3693549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1111054 
End bp1113825 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content67% 
IMG OID637731105 
ProductDNA polymerase I 
Protein accessionYP_336009 
Protein GI76818134 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.280687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAG AACGAAATCT GGAAGGTAAG ACCCTGCTAT TGGTTGACGG TTCGAGCTAT 
CTGTATCGGG CTTACCATGC GATGCCTGAT TTGCGCGGCC CTGGCGGGGA GCCGACCGGA
GCGCTCTACG GAATCGTCAA TATGTTGCGC CGCATGCGCA AGGATGTCAG TGCAGAGTAT
AGCGCGTGCG TGTTCGACGC CAAGGGCAAA ACCTTCCGCG ACGATCTGTA TGCCGACTAC
AAGGCGCACC GCCCGCCGAT GCCGCCCGAT CTCGCGCTGC AGATCGAGCC GATCCATACG
GCCGTGCGCG CGCTCGGCTG GCCGCTGTTG ATGATCGAGG GCGTCGAGGC CGACGACGTG
ATCGGCACGC TTGCGAAGCA GGCCGAACAA CACGGCATGA ACGTGATCGT CTCGACGGGC
GACAAGGATC TCGCGCAACT CGTCACCGAT CGCGTCACGC TCATCAACAC GATGACGAAC
GAGGCGCTCG ACCGCGACGG CGTGCTCGCG AAGTTCGGCG TGCCGCCGGA GCGGATCGTC
GATTACCTGT CGCTCATCGG CGACACCGTC GACAACGTGC CGGGCGTCGA GAAGTGCGGG
CCGAAGACGG CCGTCAAGTG GCTCACGCAG TACGGCTCGC TCGACGGCGT CGTCGAACAT
GCGGGCGAGA TCAAGGGCGT CGTCGGCGAC AACCTGCGCC GTGCGCTCGA TTTCCTGCCG
CTCGCGCGCA AGCTGGTGAC GGTCGAGACC GGCTGCGAGC TCGCGCCGCA CGTCGAATCG
TTCGACGCGT CGCTCGCGAC GGACGGCGAG GGCCGCGACG CGCTGCGCGA GATCTTCGCG
AAGTACGGCT TCAAGACGTG GCTGCGCGAG CTCGACAGCG AACCCGCCGC CGCGGCCGGT
GCGGCCGCCG CCGGCGCCGC GCCGGATCCG GCGAGCGGCG CGACGGCCGA ACTGCCGCTC
GCCACGGCGC GCAACTACGC GACCGTGCAG ACGTGGGAGC AGTTCGACGC GTGGCTCGCG
AAGATCTCCG CCGCAGAGCT GACCGCGTTC GACACCGAGA CGACGTCGCT CGACCCGATG
CGCGCACAAA TCGTCGGCCT GTCGTTCTCG GTGGAGCCGG GCCATGCCGC GTACGTGCCG
GTTGCGCACC GCGGGCCGGA CATGCCCGCG CAACTGCCGA GCGACGAAGT GCTCGCGAAG
CTCACGCCGT GGCTCGAGGA TGCGGGCAAG AAGAAGCTCG GCCAACACCT GAAGTACGAC
GCGCAGGTGC TCGCGAACTA CGGCATCGCG CTGAACGGCA TCGAGCACGA CACGCTGCTC
GAATCGTATG TGCTCGAATC GCACCGCACA CACGACATGG ACAGCCTTGC GCTGCGCCAT
CTCGGCGTGA AGACGATCAA GTACGAGGAC GTGGCGGGCA AGGGCGCGCA GCAGATCGGC
TTCGACGAGG TGCCGCTCGC GCAGGCGTCC GAGTATGCGG CGGAAGACGC CGACATCACG
CTGCAGCTGC ATCACGCGCT GTATCCGCAG ATCGCGCGCG AGCCGGGCCT CACGCGCGTG
TATCGCGACA TCGAGCTGCC GGTGTCGCTC GTGTTGCGCA AGATGGAGCG CACCGGCGTG
CTGATCGACG GCGACAAGCT GAGCCGCCAG AGCGGCGAGA TCGCGACGCG GCTCGTCGCG
CTCGAGCAGG AGGCGTACGC GCTCGCGGGC GGCGAATTCA ATCTCGGCTC GCCGAAGCAG
ATCGGCCAGA TTTTCTTCGA GCGGCTGCAA TTGCCCGTCG TCAAGAAGAC GCCGAGCGGC
GCGCCGTCGA CCGACGAGGA GGTGCTGCAA AAGCTCGCCG AGGACTATCC GCTGCCGAAG
CTGTTGCTTG AGCACCGCGG CCTGTCGAAG CTGAAATCGA CCTATACCGA CAAGCTGCCG
CGCATGGTCA ATCCGGACAC GGGCCGCGTG CACACGAACT ATGCGCAGGC GGTGGCGGTC
ACCGGGCGGC TCGCGTCGAA CGATCCGAAC CTGCAGAACA TCCCGGTGCG CACGGCCGAA
GGGCGGCGCA TCCGCGAGGC GTTCGTCGCG CCGCCGGGCA GCAAGATCGT GTCCGCCGAC
TATTCGCAAA TCGAGCTGCG CATCATGGCG CACATCTCGG AGGACGAATC GTTGCTGCGC
GCGTTCGCGC ACGGCGAGGA CATTCACCGC GCGACCGCGG CGGAGGTGTT CGGCGTGACG
CCGCTCGAAG TGACGTCCGA TCAGCGGCGC ATCGCGAAGG TGATCAACTT CGGTCTGATC
TACGGGATGA GTTCGTTCGG CCTCGCGTCG AATCTCGGCA TCACGCGCGA TGCGGCGAAG
CTCTATATCG ACCGCTACTT CCTCCGCTAT CCGGGCGTCG CCCGCTACAT GGAGGAAACG
CGCATGCGGG CGAAGGAGAA GGGCTACGTC GAAACCGTGT TCGGCCGCCG CCTGTGGCTG
CCCGAGATCA ACGGCGGCAA CGGGCCGCGC CGCCAGGCCG CCGAGCGCGC GGCGATCAAC
GCGCCGATGC AGGGCACGGC CGCCGATCTG ATCAAGCTGT CGATGATCGC GGTCGACGAC
TGGCTCGAGC GCGGCGGCTT GCGCGCGCGA ATGATCATGC AGGTGCACGA CGAACTCGTG
CTCGAGGTGC CGGAGGACGA GCTGTCCATC GTGCGCGAGA AGCTGCCGGA GATGATGTGC
GGCGTCGCGA AACTGAAGGT GCCGCTCGTC GCCGAGGTCG GCGTCGGCGA GAACTGGGAA
GAGGCGCACT GA
 
Protein sequence
MPEERNLEGK TLLLVDGSSY LYRAYHAMPD LRGPGGEPTG ALYGIVNMLR RMRKDVSAEY 
SACVFDAKGK TFRDDLYADY KAHRPPMPPD LALQIEPIHT AVRALGWPLL MIEGVEADDV
IGTLAKQAEQ HGMNVIVSTG DKDLAQLVTD RVTLINTMTN EALDRDGVLA KFGVPPERIV
DYLSLIGDTV DNVPGVEKCG PKTAVKWLTQ YGSLDGVVEH AGEIKGVVGD NLRRALDFLP
LARKLVTVET GCELAPHVES FDASLATDGE GRDALREIFA KYGFKTWLRE LDSEPAAAAG
AAAAGAAPDP ASGATAELPL ATARNYATVQ TWEQFDAWLA KISAAELTAF DTETTSLDPM
RAQIVGLSFS VEPGHAAYVP VAHRGPDMPA QLPSDEVLAK LTPWLEDAGK KKLGQHLKYD
AQVLANYGIA LNGIEHDTLL ESYVLESHRT HDMDSLALRH LGVKTIKYED VAGKGAQQIG
FDEVPLAQAS EYAAEDADIT LQLHHALYPQ IAREPGLTRV YRDIELPVSL VLRKMERTGV
LIDGDKLSRQ SGEIATRLVA LEQEAYALAG GEFNLGSPKQ IGQIFFERLQ LPVVKKTPSG
APSTDEEVLQ KLAEDYPLPK LLLEHRGLSK LKSTYTDKLP RMVNPDTGRV HTNYAQAVAV
TGRLASNDPN LQNIPVRTAE GRRIREAFVA PPGSKIVSAD YSQIELRIMA HISEDESLLR
AFAHGEDIHR ATAAEVFGVT PLEVTSDQRR IAKVINFGLI YGMSSFGLAS NLGITRDAAK
LYIDRYFLRY PGVARYMEET RMRAKEKGYV ETVFGRRLWL PEINGGNGPR RQAAERAAIN
APMQGTAADL IKLSMIAVDD WLERGGLRAR MIMQVHDELV LEVPEDELSI VREKLPEMMC
GVAKLKVPLV AEVGVGENWE EAH