Gene BURPS1106A_0527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0527 
SymboldnaE2 
ID4902892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp489985 
End bp493203 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content70% 
IMG OID640133757 
Producterror-prone DNA polymerase 
Protein accessionYP_001064810 
Protein GI126453110 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCGG CATCCGGAAT CCTGCCCGAC TACGCGGAGC TGTTCTGCCG GTCGAACTTC 
TCGTTCCTGC ACGGCGCGTC GTCGGCGGAA GAGCTCGTCG AGCGCGCGGC GAAGCAGGGC
TATCGCGGCA TCGCGATCAC CGACGAATGC TCGCTCGCCG GCGCGCCGCG CATGCACGTC
GCGGCGAAGG CGGTGGGGCT GCCGCTCGTC GTCGGCGCGT ACTTCGGCGT GACGCCGGAC
GACGCCGCGC CGGGCCACGA TCCGGGCCCC GGCGCGTTCG GCCTCGTGCT GCTCGCGCAA
AACCGCGAGG GCTACGGGAA CCTGTCCGAG CTGATCTCCT GGCGGCGAAT GAACGCGCCC
AAAGGCACCT ACCGGCTCAC GCCGCGCATG CTCGCCGCGC CGCCGCGGGC GCTCGCGCAT
CTGCGCGGCG TGCCCGACTG CTTCGCGATT CTCGTGCCGA CGTATCCGGC GCGCGCCGAC
GTGCTCGACG CGCAGCTCGC CTGGTTCGAC GCGCTGTTCG GCGAGCGCGC GCGGCTCGGG
CTCGTGCAGC TGCAGCGCGC GCTCGACGGC GCGCATCGCG AGCAAGTCCG GGCGGCGGGC
GAGCGGCGCG GGATGCACAT CGTCGCGCTC GGCGACGTGA CGATGCACAT CCGCTCGTGC
AAGCCGCTGC AGGACACGAT GACGGCGATT CGGCTCGGGA TGCCGATCGC CGAATGCGGC
CATGCGCTCG CGCCGAACGG CGAACAGCAC TTGCGCACGC GCCAGCGGAT CGCGCAGCTG
TTTCCGGCCG ACGCGCTCGC GCAGACATGC CGGATGCTCG ACGCGTGCCA TTTCTCGCTT
GACGACCTGC GCTACGAATA TCCGCACGAA ATCGTCCCCG CGGGCCATAC GTCGACGAGC
TATCTGGCGC AGGAAACGTG GGCGGGCGCG CGCAGGCGCT ATCCCGACGG CGTGCCGGAC
ACGGTGAGGC AGAGAATCGA GTTCGAACTC GCGCTGATCG CCGACCTGAA ATACGAGCCG
TATTTCCTGA CGGTCTACGA TATCGTCAAA TACGCGCGCA GCAAGGACAT CCTGTGCCAG
GGGCGCGGCT CGGCGGCGAA CTCGGTCGTC TGCTATTGCC TCGGCGTCAC GGAGGTCAAT
CCGCAGCAGA GCACGCTGCT GTTCGAGCGC TTCCTCAGCC GCGAGCGCGG CGAGCCGCCC
GACATCGACG TCGACTTCGA ACACCAGCGG CGCGAGGAAG TCATCCAGTA TCTGTACGAA
AAGTACGGCC ACGATCGCGC GGCGCTCGCG GCGGCCGTAT CGACCTATCG CCCGCGCGGC
GCGCTGCGCG AGACCGGCAA GGCGCTCGGC GTCGATCCGA TGCTCGTCGA GCGGGTGGCG
AAGGAGCATC GCTGGTTCGA CGGCAGCCGC GATCTGCTCG CGCGCTTCGC GTCGGTCGGG
CTCGATCCGG AGGTGCCGCT GATCCGAACC TGGGCCGAGA TCGCCGCGCG GCTGCTGAAT
TTCCCTCGCC ATCTGTCGCA GCATTCGGGC GGCTTCGTGG TGAGCCGCGG CAAGCTCACG
CGGCTCGTGC CGGTCGAGAA CGCGGCGATG GAAGGGCGGC GCGTGATTCA GTGGGACAAG
GACGATCTGG AGGCCCTCGG GCTGATGAAG GTCGACGTGC TCGCGCTCGG CATGCTGTCC
GCGTTGCATC GCGCGTTCGA CATGATCACC GCGTGGCGCG GCCCGCCGCT GCCGGACGGC
CGGCCGTTCC GGCTGGAGCA CATTCCGCAG GATGACGAAG CGACCTACGA CATGATCTGC
CGCGCGGACA CGGTCGGGGT GTTCCAGATC GAGTCGCGCG CGCAGATGTC GATGCTGCCG
CGCCTGCGGC CGCGCGGCTA TTACGATCTG GTGGTCCAGG TATCGATCGT CCGGCCGGGG
CCGATCCAGG GCGGCGCCGT GCATCCGTAC CTGGAGCGGC GGCGGATCGC GGCCGGCGAG
GCGCACGGAG AGATCACCTA TCCGAGCGAG GCGCTCGAAC GCGTGCTCGA GCGCACGCTC
GGGATTCCGA TTTTCCAGGA GCAGGTGATG CAGATCGCGA TCGTCGCGGC GGGCTTCACG
CCCGGCGAGG CCGATGCGCT GCGCCGGGCG ATGGCGGCGT GGAAACGCAA GGGCGATCTC
GGCAAGTATC ACGAGCGGAT CGTCGCGGGG ATGCTCGAGC GCGGCTATTC CCGCGAATTC
GCCGAGCAGA TCTTCGAGCA GATCAAGGGC TTCGGCGAGT ATGGCTTTCC GGAAAGCCAT
GCGGCGAGCT TCGCGAAGCT CGCTTATGCG AGCAGCTGGC TCAAGCGTCA CGAGCCGGCG
ATCTTTCTCG CCGCGCTCCT GAACAGCCAG CCGATGGGCT TCTATCCGCC CGCGCAGCTC
GTGCAGGACG CGAAGCGCCA CGGCGTGACG GTGCTGCCGA TCGATGCGAC GAAGAGCGGC
TGGGAAGCGT CGCTCGAAGC GCAGCCCGGC GCGGCGCCGC CCGACGGCCG GCCGGCGGTG
CGGCTCGGCC TGTCGCTCGT GCGCGGGCTG GGCGAGGAAG CCGCGCGGCG CATCGGCGCG
GCGCGTGCGG CGGGGCCGTT TGCGAGCGTC GACGAACTCG CGCGCCGCGC GTGCCTCGAA
CGCCGCGATC TCGAGGCGCT CGCCGCCGCG AACGCGTTCG CGACGCTTGC CGGTAATCGC
CGCGATGCGC TGTGGCAGGC GGTTGCCGCC GCGCCGGAGC GCGGCCTGCT TGCCGCCGCG
CCGATCGACG AAGCGGTGAG GCCGGCGCTC GGCGCGCCCA CCGAGGCCGA CGACGTGTTC
GCCGACTATC GCACGATCGG CCTCACGCTG AACCGGCATC CGGTCGCGCT GCTGCGGCCC
GCGCTCGACG CGCGGCGGCT ATCGTCCGCC GCGGCGCTGC GCGACCGCCG CAACGGCCGG
CTCGCGCGCG CGTGCGGGCT CGTGACCGCG CGGCAGATGC CGGGCACGGC GAAGGGCGTG
TTGTTCGTCA CGCTCGAGGA CGAGACGGGG TGCGTGAACG TGATCGTGCG GCCGGAACTG
CTCGAGCGGC AGCGGCGCGA GACGCTCGAT TCGCAACTGC TCGCGGTATC GGGCGTCTGG
CAGTGCGAGA GCGACGTCCG GCATCTCGTC GCGCAATATC TCGAGGATCT GACGCCGCTG
ATCGCGGGCT TGCGCACCGA GAGCCGCGAA TTCCATTGA
 
Protein sequence
MDAASGILPD YAELFCRSNF SFLHGASSAE ELVERAAKQG YRGIAITDEC SLAGAPRMHV 
AAKAVGLPLV VGAYFGVTPD DAAPGHDPGP GAFGLVLLAQ NREGYGNLSE LISWRRMNAP
KGTYRLTPRM LAAPPRALAH LRGVPDCFAI LVPTYPARAD VLDAQLAWFD ALFGERARLG
LVQLQRALDG AHREQVRAAG ERRGMHIVAL GDVTMHIRSC KPLQDTMTAI RLGMPIAECG
HALAPNGEQH LRTRQRIAQL FPADALAQTC RMLDACHFSL DDLRYEYPHE IVPAGHTSTS
YLAQETWAGA RRRYPDGVPD TVRQRIEFEL ALIADLKYEP YFLTVYDIVK YARSKDILCQ
GRGSAANSVV CYCLGVTEVN PQQSTLLFER FLSRERGEPP DIDVDFEHQR REEVIQYLYE
KYGHDRAALA AAVSTYRPRG ALRETGKALG VDPMLVERVA KEHRWFDGSR DLLARFASVG
LDPEVPLIRT WAEIAARLLN FPRHLSQHSG GFVVSRGKLT RLVPVENAAM EGRRVIQWDK
DDLEALGLMK VDVLALGMLS ALHRAFDMIT AWRGPPLPDG RPFRLEHIPQ DDEATYDMIC
RADTVGVFQI ESRAQMSMLP RLRPRGYYDL VVQVSIVRPG PIQGGAVHPY LERRRIAAGE
AHGEITYPSE ALERVLERTL GIPIFQEQVM QIAIVAAGFT PGEADALRRA MAAWKRKGDL
GKYHERIVAG MLERGYSREF AEQIFEQIKG FGEYGFPESH AASFAKLAYA SSWLKRHEPA
IFLAALLNSQ PMGFYPPAQL VQDAKRHGVT VLPIDATKSG WEASLEAQPG AAPPDGRPAV
RLGLSLVRGL GEEAARRIGA ARAAGPFASV DELARRACLE RRDLEALAAA NAFATLAGNR
RDALWQAVAA APERGLLAAA PIDEAVRPAL GAPTEADDVF ADYRTIGLTL NRHPVALLRP
ALDARRLSSA AALRDRRNGR LARACGLVTA RQMPGTAKGV LFVTLEDETG CVNVIVRPEL
LERQRRETLD SQLLAVSGVW QCESDVRHLV AQYLEDLTPL IAGLRTESRE FH