Gene BURPS668_A2546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2546 
SymbolpolA 
ID4886050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2458444 
End bp2461215 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content67% 
IMG OID640132483 
ProductDNA polymerase I 
Protein accessionYP_001063539 
Protein GI126443368 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.809446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAG AACGAAATCT GGAAGGTAAG ACCCTGCTAT TGGTTGACGG TTCGAGCTAT 
CTGTATCGGG CTTACCATGC GATGCCTGAT TTGCGCGGCC CTGGCGGGGA GCCGACCGGA
GCGCTCTACG GAATCGTCAA TATGTTGCGC CGCATGCGCA AGGATGTCAG TGCAGAGTAT
AGCGCGTGCG TGTTCGACGC CAAGGGCAAA ACCTTCCGCG ACGATCTGTA TGCCGACTAC
AAGGCGCACC GCCCGCCGAT GCCGCCCGAT CTCGCGCTGC AGATCGAGCC GATCCATACG
GCCGTGCGCG CGCTCGGCTG GCCGCTGTTG ATGATCGAGG GCGTCGAGGC CGACGACGTG
ATCGGCACGC TTGCGAAGCA GGCCGAACAA CACGGCATGA ACGTGATCGT CTCGACGGGC
GACAAGGATC TCGCGCAGCT CGTCACCGAT CGCGTCACGC TCATCAACAC GATGACGAAC
GAGGCGCTCG ACCGCGACGG CGTGCTCGCG AAGTTCGGCG TGCCGCCGGA GCGGATCGTC
GATTACCTGT CGCTCATCGG CGATACCGTC GACAACGTGC CGGGCGTCGA GAAGTGCGGG
CCGAAGACGG CCGTCAAGTG GCTCATGCAG TACGGCTCGC TCGACGGCGT CGTCGAACAT
GCGGGCGAGA TCAAGGGCGT CGTCGGCGAC AACCTGCGCC GTGCGCTCGA TTTCCTGCCG
CTCGCGCGCA AGCTGGTGAC GGTCGAGACC GGCTGCGAGC TCGCGCCGCA CGTCGAATCG
TTCGACGCGT CGCTCGCGAC GGACGGCGAG GGCCGCGACG CGCTGCGCGA GATCTTCGCG
AAGTACGGCT TCAAGACGTG GCTGCGCGAG CTCGACAGCG AACCCGCCGC CGCGGCCGGT
GCGGCCGCCG CCGGCGCCGC GCCGGATCCG GCGAGCGGCG CGACGGCCGA ACTGCCGCTC
GCCACGGCGC GCAACTACGC GACCGTGCAG ACGTGGGAGC AGTTCGACGC ATGGCTCGCG
AAGATCTCCG CCGCAGAGCT GACCGCGTTC GACACCGAGA CGACGTCGCT CGACCCGATG
CGCGCACAGA TCGTCGGCCT GTCGTTCTCG GTGGAGCCGG GCCATGCCGC GTACGTGCCG
GTTGCGCACC GCGGGCCGGA CATGCCCGCG CAACTGCCGC GCGACGAAGT GCTCGCGAAG
CTCACGCCGT GGCTCGAGGA TGCGGGCAAG AAGAAGCTCG GCCAACACCT GAAGTACGAC
GCGCAGGTGC TCGCGAACTA CGGCATCGCG CTGAACGGCA TCGAGCACGA CACGCTGCTC
GAATCGTATG TGCTCGAATC GCACCGCACG CACGACATGG ACAGCCTCGC GCTGCGCCAT
CTCGGCGTGA AGACGATCAA GTACGAGGAC GTGGCGGGCA AGGGCGCGCA GCAGATCGGC
TTCGACGAGG TGCCGCTCGC GCAGGCGTCC GAGTATGCGG CGGAAGACGC CGACATCACG
CTGCAGCTGC ATCACGCGCT GTATCCGCAG ATCGCGCGCG AGCCGGGCCT CACGCGCGTG
TATCGCGACA TCGAGCTGCC GGTGTCGCTC GTGTTGCGCA AGATGGAGCG CACCGGCGTG
CTGATCGACG GCGACAGGCT GAGCCGCCAG AGCGGCGAGA TCGCGACGCG GCTCGTCGCG
CTCGAGCAGG AGGCGTACGC GCTCGCGGGC GGCGAATTCA ATCTCGGCTC GCCGAAGCAG
ATCGGCCAGA TTTTCTTCGA GCGGCTGCAA TTGCCCGTCG TCAAGAAGAC GCCGAGCGGC
GCGCCGTCGA CCGACGAGGA GGTGCTGCAA AAGCTCGCCG AGGACTATCC GCTGCCGAAG
CTGTTGCTTG AGCACCGCGG CCTGTCGAAG CTGAAATCGA CCTATACCGA CAAGCTGCCG
CGCATGGTCA ATCCGGACAC GGGCCGCGTG CACACGAACT ATGCGCAGGC GGTGGCGGTC
ACCGGGCGGC TCGCGTCGAA CGATCCGAAC CTGCAGAACA TCCCGGTGCG CACGACCGAA
GGGCGGCGCA TCCGCGAGGC GTTCGTCGCG CCGCCGGGCA GCAAGATCGT GTCCGCCGAC
TATTCGCAAA TCGAGCTGCG CATCATGGCG CACATCTCGG AGGACGAATC GTTGCTGCGC
GCGTTCGCGC ACGGCGAGGA CATTCACCGC GCGACCGCGG CGGAGGTGTT CGGCGTGACG
CCGCTCGAAG TGACGTCCGA TCAGCGGCGC ATCGCGAAGG TGATCAACTT CGGTTTGATC
TACGGGATGA GTTCGTTCGG CCTCGCGTCG AATCTCGGCA TCACGCGCGA TGCGGCGAAG
CTCTATATCG ACCGTTACTT CCTCCGCTAT CCGGGCGTCG CCCGCTACAT GGAGGAAACG
CGCATGCGGG CGAAGGAGAA GGGCTACGTC GAAACCGTGT TCGGCCGCCG CCTGTGGCTG
CCCGAGATCA ACGGCGGCAA CGGGCCGCGC CGCCAGGCCG CCGAGCGCGC GGCGATCAAC
GCGCCGATGC AGGGCACGGC CGCCGATCTG ATCAAGCTGT CGATGATCGC GGTCGACGAC
TGGCTCGAGC GCGGCGGCTT GCGCGCGCGA ATGATCATGC AGGTGCACGA CGAACTCGTG
CTCGAGGTGC CGGAGGACGA GTTGTCCATC GTGCGCGAGA AGCTGCCGGA GATGATGTGT
GGCGTCGCGA AACTGAAGGT GCCGCTCGTC GCCGAGGTCG GCGTCGGCGA GAACTGGGAA
GAGGCGCACT GA
 
Protein sequence
MPEERNLEGK TLLLVDGSSY LYRAYHAMPD LRGPGGEPTG ALYGIVNMLR RMRKDVSAEY 
SACVFDAKGK TFRDDLYADY KAHRPPMPPD LALQIEPIHT AVRALGWPLL MIEGVEADDV
IGTLAKQAEQ HGMNVIVSTG DKDLAQLVTD RVTLINTMTN EALDRDGVLA KFGVPPERIV
DYLSLIGDTV DNVPGVEKCG PKTAVKWLMQ YGSLDGVVEH AGEIKGVVGD NLRRALDFLP
LARKLVTVET GCELAPHVES FDASLATDGE GRDALREIFA KYGFKTWLRE LDSEPAAAAG
AAAAGAAPDP ASGATAELPL ATARNYATVQ TWEQFDAWLA KISAAELTAF DTETTSLDPM
RAQIVGLSFS VEPGHAAYVP VAHRGPDMPA QLPRDEVLAK LTPWLEDAGK KKLGQHLKYD
AQVLANYGIA LNGIEHDTLL ESYVLESHRT HDMDSLALRH LGVKTIKYED VAGKGAQQIG
FDEVPLAQAS EYAAEDADIT LQLHHALYPQ IAREPGLTRV YRDIELPVSL VLRKMERTGV
LIDGDRLSRQ SGEIATRLVA LEQEAYALAG GEFNLGSPKQ IGQIFFERLQ LPVVKKTPSG
APSTDEEVLQ KLAEDYPLPK LLLEHRGLSK LKSTYTDKLP RMVNPDTGRV HTNYAQAVAV
TGRLASNDPN LQNIPVRTTE GRRIREAFVA PPGSKIVSAD YSQIELRIMA HISEDESLLR
AFAHGEDIHR ATAAEVFGVT PLEVTSDQRR IAKVINFGLI YGMSSFGLAS NLGITRDAAK
LYIDRYFLRY PGVARYMEET RMRAKEKGYV ETVFGRRLWL PEINGGNGPR RQAAERAAIN
APMQGTAADL IKLSMIAVDD WLERGGLRAR MIMQVHDELV LEVPEDELSI VREKLPEMMC
GVAKLKVPLV AEVGVGENWE EAH