Gene BURPS1106A_A2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2403 
SymbolpolA 
ID4903970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2381236 
End bp2384007 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content67% 
IMG OID640145508 
ProductDNA polymerase I 
Protein accessionYP_001076435 
Protein GI126456399 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAG AACGAAATCT GGAAGGTAAG ACCCTGCTAT TGGTTGACGG TTCGAGCTAT 
CTGTATCGGG CTTACCATGC GATGCCTGAT TTGCGCGGCC CTGGCGGGGA GCCGACCGGA
GCGCTCTACG GAATCATCAA TATGTTGCGC CGCATGCGCA AGGATGTCAG TGCAGAGTAT
AGCGCGTGCG TGTTCGACGC CAAGGGCAAA ACCTTCCGCG ACGATCTGTA TGCCGACTAC
AAGGCGCACC GCCCGCCGAT GCCGCCCGAT CTCGCGCTGC AGATCGAGCC GATCCATACG
GCCGTGCGCG CGCTCGGCTG GCCGCTGTTG ATGATCGAGG GCGTCGAGGC CGACGACGTG
ATCGGCACGC TTGCGAAGCA GGCCGAACAA CACGGCATGA ACGTGATCGT CTCGACGGGC
GACAAGGACC TCGCGCAGCT CGTCACCGAT CGCGTCACGC TCATCAACAC GATGACGAAC
GAGGCGCTCG ACCGCGACGG CGTGCTCGCG AAGTTCGGCG TGCCGCCGGA GCGGATCGTC
GATTACCTGT CGCTCATCGG CGACACCGTC GACAACGTGC CGGGCGTCGA GAAGTGCGGG
CCGAAGACGG CCGTCAAGTG GCTCACGCAG TACGGCTCGC TCGACGGCGT CGTCGAACAT
GCGGGCGAGA TCAAGGGCGT CGTCGGCGAC AACCTGCGCC GTGCGCTCGA TTTCCTGCCG
CTCGCGCGCA AGCTGGTGAC GGTCGAGACC GGCTGCGAGC TCGCGCCGCA CGTCGAATCG
TTCGACGCGT CGCTCGCGAC GGACGGCGAG GGCCGCGACG CGCTGCGCGA GATCTTCGCG
AAGTACGGCT TCAAGACGTG GCTGCGCGAG CTCGACAGCG AACCCGCCGC CGCGGCCGGT
GCGGCCGCCG CCGGCGCCGC GCCGGATCCG GCGAGCGGCG CGACGGCCGA ACTGCCGCTC
GCCACGGCGC GCAACTACGC GACCGTGCAG ACGTGGGAGC AGTTCGACGC GTGGCTCGCG
AAGATCTCCG CCGCAGAGCT GACCGCGTTC GACACCGAGA CGACGTCGCT CGACCCGATG
CGCGCACAAA TCGTCGGCCT GTCGTTCTCG GTGGAGCCGG GCCATGCCGC GTACGTGCCG
GTTGCGCACC GCGGGCCGGA CATGCCCGCG CAACTGCCGA GCGACGAAGT GCTCGCGAAG
CTCACGCCGT GGCTCGAGGA TGCGGGCAAG AAGAAGCTCG GCCAACACCT GAAGTACGAC
GCGCAGGTGC TCGCGAACTA CGGCATCGCG CTGAACGGCA TCGAGCACGA CACGCTGCTC
GAATCGTATG TGCTCGAATC GCACCGCACA CACGACATGG ACAGCCTTGC GCTGCGCCAT
CTCGGCGTGA AGACGATCAA GTACGAGGAC GTGGCGGGCA AGGGCGCGCA GCAGATCGGC
TTCGACGAGG TGCCGCTCGC GCAGGCGTCC GAGTATGCGG CGGAAGACGC CGACATCACG
CTGCAGCTGC ATCACGCGCT GTATCCGCAG ATCGCGCGCG AGCCGGGCCT CACGCGCGTG
TATCGCGACA TCGAGCTGCC GGTGTCGCTC GTGTTGCGCA AGATGGAGCG CACCGGCGTG
CTGATCGACG GCGACAGGCT GAGCCGCCAG AGCGGCGAGA TCGCGACGCG GCTCGTCGCG
CTCGAGCAGG AGGCGTACGC GCTCGCGGGC GGCGAATTCA ATCTCGGCTC GCCGAAGCAG
ATCGGCCAGA TTTTCTTCGA GCGGCTGCAA TTGCCCGTCG TCAAGAAGAC GCCGAGCGGC
GCGCCGTCGA CCGACGAGGA GGTGCTGCAA AAGCTCGCCG AGGACTATCC GCTGCCGAAG
CTGTTGCTTG AGCACCGCGG CCTGTCGAAG CTGAAATCGA CCTATACCGA CAAGCTGCCG
CGCATGGTCA ATCCGGACAC GGGCCGCGTG CACACGAACT ATGCGCAGGC GGTGGCGGTC
ACCGGGCGGC TCGCGTCGAA CGATCCGAAC CTGCAGAACA TCCCGGTGCG CACGGCCGAA
GGGCGGCGCA TTCGCGAGGC GTTCGTCGCG CCGCCGGGCA GCAAGATCGT GTCCGCCGAC
TATTCGCAAA TCGAGCTGCG CATCATGGCG CACATCTCGG AGGACGAATC GTTGCTGCGC
GCGTTCGCGC ACGGCGAGGA CATTCACCGC GCGACCGCGG CGGAGGTGTT CGGCGTGACG
CCGCTCGAAG TGACGTCCGA TCAGCGGCGC ATCGCGAAGG TGATCAACTT CGGTCTGATC
TACGGGATGA GTTCGTTCGG CCTCGCGTCG AATCTCGGCA TCACGCGCGA TGCGGCGAAG
CTCTATATCG ACCGCTACTT CCTCCGCTAT CCGGGCGTCG CCCGCTACAT GGAGGAAACG
CGCATGCGGG CGAAGGAGAA GGGCTACGTC GAAACCGTGT TCGGCCGCCG CCTGTGGCTG
CCCGAGATCA ACGGCGGCAA CGGGCCGCGC CGCCAGGCCG CCGAGCGCGC GGCGATCAAC
GCGCCGATGC AGGGCACGGC CGCCGATCTG ATCAAGCTGT CGATGATCGC GGTCGACGAC
TGGCTCGAGC GTGGCGGCTT GCGCGCGCGA ATGATCATGC AGGTGCACGA CGAACTCGTG
CTCGAGGTGC CGGAGGACGA GCTGTCCATC GTGCGCGAGA AGCTGCCGGA GATGATGTGC
GGCGTCGCGA AACTGAAGGT GCCGCTCGTC GCCGAGGTCG GCGTCGGCGA GAACTGGGAA
GAGGCGCACT GA
 
Protein sequence
MPEERNLEGK TLLLVDGSSY LYRAYHAMPD LRGPGGEPTG ALYGIINMLR RMRKDVSAEY 
SACVFDAKGK TFRDDLYADY KAHRPPMPPD LALQIEPIHT AVRALGWPLL MIEGVEADDV
IGTLAKQAEQ HGMNVIVSTG DKDLAQLVTD RVTLINTMTN EALDRDGVLA KFGVPPERIV
DYLSLIGDTV DNVPGVEKCG PKTAVKWLTQ YGSLDGVVEH AGEIKGVVGD NLRRALDFLP
LARKLVTVET GCELAPHVES FDASLATDGE GRDALREIFA KYGFKTWLRE LDSEPAAAAG
AAAAGAAPDP ASGATAELPL ATARNYATVQ TWEQFDAWLA KISAAELTAF DTETTSLDPM
RAQIVGLSFS VEPGHAAYVP VAHRGPDMPA QLPSDEVLAK LTPWLEDAGK KKLGQHLKYD
AQVLANYGIA LNGIEHDTLL ESYVLESHRT HDMDSLALRH LGVKTIKYED VAGKGAQQIG
FDEVPLAQAS EYAAEDADIT LQLHHALYPQ IAREPGLTRV YRDIELPVSL VLRKMERTGV
LIDGDRLSRQ SGEIATRLVA LEQEAYALAG GEFNLGSPKQ IGQIFFERLQ LPVVKKTPSG
APSTDEEVLQ KLAEDYPLPK LLLEHRGLSK LKSTYTDKLP RMVNPDTGRV HTNYAQAVAV
TGRLASNDPN LQNIPVRTAE GRRIREAFVA PPGSKIVSAD YSQIELRIMA HISEDESLLR
AFAHGEDIHR ATAAEVFGVT PLEVTSDQRR IAKVINFGLI YGMSSFGLAS NLGITRDAAK
LYIDRYFLRY PGVARYMEET RMRAKEKGYV ETVFGRRLWL PEINGGNGPR RQAAERAAIN
APMQGTAADL IKLSMIAVDD WLERGGLRAR MIMQVHDELV LEVPEDELSI VREKLPEMMC
GVAKLKVPLV AEVGVGENWE EAH