Gene BMASAVP1_A1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1044 
SymboldnaE1 
ID4680648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1023751 
End bp1028811 
Gene Length5061 bp 
Protein Length1686 aa 
Translation table11 
GC content68% 
IMG OID639845316 
ProductDNA polymerase III, alpha subunit, form 1 
Protein accessionYP_992382 
Protein GI121600233 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGGCG CGGGCTCACA CGGCGCGCGC CGGTTCGGCG CCCGGCTCCG CGCGCCGGCC 
CGGCGTGCCG GAATCGCCCG GATGGCACGC GGGGCAGCAC TTGCCGGGCA CGTAGAGCGG
CGACTGTTGC GCATCGGCCG GCACGACCGC GCGGCACGCG AAGCACGTGA CGTCGGCTGT
CGGCGCGAGC TGCGGGTTGA GCGCGGTACG GTAATCGAAC ACGAAGCAGT CGCCGTGGTA
ATGGGCACCG CCGACTTCCT CGAAATACTT GAGGATCCCG CCCTCGAGCT GATAGACGTT
CTCGATGCCG ACGTCCTTCA TGTGGATCGC CGCCTTCTCG CAGCGGATGC CGCCCGTGCA
GAACGACACG ATCGTCTTGC CCTCGAGGTC CGCGCGGTTC GCCTCGATCA CGGCGGGAAA
CTCGCTGAAT TTGTCGATCC GGTAATCGAG CGCGCGATCG AACGTCCCCA CGTCGACTTC
GAACGCATTG CGCGTGTCGA GCATCACGAC GGGGCGGCCC GCGTCGTCGT GCCCCTGGTC
GAGCCACGCT TTGAGCGTGC GCGCGTCGAC GGACGGCGCG CGGCCGAGCT CCGGCTTGAT
CGCCGGCTTC TTCATCGTGA TGATCTCGCG CTTCAGGCGC ACGAGCATCC GGCGAAACGG
CTGCGAATCG GACAGGCTCT CCTTGAACGG CAGGTCCGCG AACTTGCCCT CGAAGAGCGG
ATCGTGGCGG ATGTAATCGA CGAACGCGTC GGTCGCCTCG CGTGGGCCCG CGATGAACAG
ATTGATGCCT TCGGGCGCGA GCAGGATCGT GCCGCGCAGC CCGAGCGTGT TGCAGCGCGC
GGCGACGAGC GGCCGCCATT GCTCGATCGA ATCGAGCGAG ACGAAACGGT AGGCGGCGAG
ATTGACGGTG GTCATGAGGT TCGGACGGTA AGGCGGCTGC GGCCCGGCGG CGAAGCCGCG
CGGCTCGGGG GCGGCGCGCG CGGCGAGCGC CTGGGCGCAA GCGGACGAAA ACGGGAAAAA
CGTGGAAAAG CCGTATTATC CCGCAAACCG CGCGGCTTTT TCGCAGTCGG CCGAACGGCC
GACGTGTTCG GCGCGCGCCG CGGCGCCGTC GCAAGCGCGC CGGCGCGCCG GCGTGTCGCG
GCCGGGCCCG GAGCCGGGGA GTCTCGCGCC GCGCGGTTGC GAAGGCGCGG TCGTCGGCGT
GTCGAACGGC GAGCCCACCG ACGTGTCGTT CGCCGCGTGG CCGATGCGTT CGCGGCCGCC
GGCCACGCCA TCCCGAAAAG GGGCGAAACG AGGGCGCGAA GCACGGACGC GGACGTTGCC
GCGTCGCCGC GAACGGGGCG CCGGCGGCCG GAACCCGTGA TCGAACCCGA CGATCCGAAT
CCGCGATCGG AGCCGGCCAC GCGCGCCGCG GCGAACTGCC GTCGTGCGCG GCTTCGCGGC
CCGCCGCCCG CGCCGGCCTT GGCCCGACGG CCAACGCGGC GATTTTTTGC CCGCAGCGAA
CGGTATCGGG TTCCCGGGCC GGCTACAATA GCGCCCATGT CAGATCCCCG TTTCGTTCAC
CTCCGCGTCC ACTCCGAATT CTCGATTGCC GACGGCATCG TGCGTCTCGA CGATATCGTC
AAGTCGGCGG CCGAAGACGG TCAGGGCGCG CTTGCACTGA CCGACCTCGG CAACGCGTTC
GGTCTCGTCC GTTTCTACAA GGAAGCCCGC GATGCGGGCA TCAAGCCGAT CGCCGGCTGC
GATGTCTGGA TCACCAACCA CGACGATCGC GACAAGCCGT CGCGGCTGCT GCTGCTCGTC
AAGGACAAGC GCGGCTACCT GAACCTCTGC GAGCTGCTGT CGAAGGCGTG GCTCACGAAC
CAATACCGCG GCCGCGCGGA GCTCGACGCG AGCTGGCTCG AAGGCGAGCT CGCCGAAGGG
CTGCTCGCGC TGTCAGGCGC GCAGCAGGGC GACATCGGCC TCGCGCTCGC GGCGGGCAAC
GAGGCGGCCG CGCGCCGCCA CGCGCAGCGC TGGGCGCGGG TGTTCCCGGG CGGTTTCTAT
ATCGAATTGC AGCGCTACGG CCAGCCGGGC GCGGAAGCGT ACATCCAGCA GGCGGTGACG
ATCGCGGCGG AGCTGAAGCT GCCCGTCGTC GCGACACATC CGCTGCAGTA CATGACGGCC
GACGATTTCA CCGCGCACGA GGCGCGCGTG TGCATCTCGG AAGGCGACAT CCTCGCGAAT
CCGCGCCGCC AGAAACGCTT CACGACCGAG CAGTTCTTCC GCACGCAGGG CGACATGGCC
GCGCTGTTCG CCGATCTGCC CTCGGCGCTC GCGAACACGG TCGAGATCGC CAAGCGCTGC
AACCTGACGC TCGAGCTCGG CAAGCCGAAG CTGCCGCGGT TCCCGACGCC CGACGGCATG
TCGCTCGACG ACTACCTCGT GCAGTTGTCG CAGGAAGGGC TCGACAAACG CCTCGTGCAG
CTCTACCCGG ACGAGGCCGA GCGCGAAGCG CAACGCGGGA AGTACAACCA GCGTCTCGAT
TTCGAGTGCG GCACGATCAA GAAGATGGGC TTTCCGGGCT ACTTCCTGAT CGTCGCGGAC
TTCATCAACT GGGCGAAGAA CAACGGCGTG CCGGTGGGCC CCGGGCGGGG CTCGGGCGCG
GGCTCGCTCG TCGCGTATTC GCTCGGCATC ACCGACCTCG ATCCGCTGCG CTACAACCTG
CTGTTCGAGC GCTTCCTGAA CCCGGAGCGG GTGTCGATGC CCGACTTCGA CATCGACTTC
TGCCAGCACG GCCGCGACCG CGTGATCCAG TACGTGAAGG AGAAGTACGG CGCGGACGCG
GTGTCGCAGA TCGCCACCTT CGGCACGATG GCCGCGAAGG CGGCCGTGCG GGATATCGGC
CGGGTGCTCG ATCTCGGCTA CATGTTCACC GACGGCGTCG CGAAGCTGAT CCCGTTCAAG
CCGGGCAAGC ACGTGACGAT CGCCGACGCG ATGAAGGAAG AGCCGCTCCT GCAGGAGCGC
TACGACAACG AGGACGAAGT CCATCAGTTG CTCGATCTCG CGCAGCGCGT GGAGGGCCTC
ACGCGCAACG TCGGGATGCA CGCGGGCGGC GTGCTGATCG CGCCCGGCAA GCTGACCGAT
TTCTGCCCCC TCTACACGCA GGGCGACGAA GGCGGCGTCG TCAGCCAGTA CGACAAGGAC
GACGTCGAAG CCGTCGGCCT CGTGAAGTTC GACTTTCTCG GCCTCACGAC GCTCACGATT
CTCGACTGGG CCGAGCGCTA CATTCGCCGT CTCGATCCGA GCAAGGAGAA CTGGTCGCTC
GCGCAGGTGC CGCTCGACGA TCCGACGTCG TTCCAGATCC TCAAGAAGGC CAACACGGTC
GCCGTGTTCC AGCTGGAAAG CCGCGGCATG CAGGGGATGC TGAAGGACGC GCAGCCCGAC
CGCTTCGAGG ACATCATCGC GCTCGTGTCG TTGTACCGGC CGGGCCCGAT GGACCTGATT
CCGAGCTTCT GCGCGCGCAA GCACGGGCGC GAGAAGGTCG ACTATCCGGA TCCGCGCGTC
GAGCCTGTCC TGAAAGAGAC CTACGGGATC ATGGTCTATC AGGAGCAGGT GATGCAGATG
GCGCAGATCA TCGGCGGCTA TTCGCTCGGC GGCGCGGACT TGCTGCGTCG CGCGATGGGC
AAGAAGAAGC CCGAGGAGAT GGCCAAGCAT CGCGAGATCT TCGCCGAGGG CGCCGCGAAG
AACGGCCTCA CGCGCGAGAA GTCCGACGAG ATCTTCGACC TGATGGAGAA GTTCGCGGGC
TACGGCTTCA ACAAGTCGCA CGCGGCCGCG TACGCGCTGC TCGCGTATTA CACCGCGTGG
CTGAAGGCGC ACCATCCGGC CGAATTCATG GCGGCCAACA TGACGCTCGC GATGGACGAC
ACCGACAAGG TGAAGATCCT GTTCGACGAT TGCCTCGTCA ACGGCCTCGC CGTGCTGCCG
CCCGACATCA ACCGTTCGAA CGATCGCTTC GAGCCCGTCG CCGAGGCCGA CGGCAAACGC
TCGCGCACGA TCCGCTACGG CCTCGGCGCG ATCAAGGGCA GCGGCCAGAA CGCGATCGAG
GAGATCCTGC GCGCACGCGA GGAAAAGCCG TTCGCCGATC TGTTCGATTT TTGCGAGCGG
ATCGACCGCC GCGTCGTGAA CCGCCGCACG ATCGAAGCGA TGATTCGCGC GGGCGCATTC
GATTCGCTGC ACGAGAATCG CGCGCAGTTG CTCGCATCGG TGCCGCTCGC GATGGAGGCC
GCCGAGCAGG CGGCCGCGAA CGCGCTGCAG GCGGGCCTGT TCGACATCGG CGGCGTGCCC
GCGCACCAGC ATGCGCTCGT CGACGAGCCG GCGTGGGACG ACAAGCGTCG CCTGCAGGAA
GAGAAGGGCG CGCTCGGCTT CTACCTGTCC GGCCACCTGT TCGACGCGTA TCGCGACGAG
GTGCGCCGTT TCGTGCGCCA GAAGCTCGGC GAGCTGAAGG AAGGGCGCGA CAAGGTGGTG
GCCGGCGTGA TCGCGTCGTT GCGCACGCAG ATGACGCAGC GCGGCAAGAT GGTGATCGCG
TTGCTCGACG ACGGCACCGG CCAGTGCGAA GTCACCGTGT TCAACGAGCA GTTCGACGCG
AACCGCGCGC TCTTCAAGGA GGACGAGCTG CTGATCGTCC AGGGGCAGGC GCGCAACGAC
GCGTTCACGG GCGGAATTCG CTTCACCGCC GAGTCGGTGA TGGACCTCGA GCGTGCGCGC
AGCCGCTACG CGCAGGCGGT GCGGATGACG ATGAACGGCA ACGCGGACGC GGCGGCGCTG
CGCCGCGTGC TGGAAGCGCA CGTCGCGAAA CCCGACGAGA CGCCGCCCGC CGCGGCGCCG
GCGCCGCGCG GCGGTCGCGA CGGCGGGCGG CGGGCGCAGG CGGCGATACC GAATGGTCTC
GCCGTGCGGA TCGCATACAG CAACGCGCGT GCGCAAGGCG AGATGCGCCT GGGCGACGCA
TGGCGCGTGA AGCCGAGCGA CGCGTTGCTC GCCGATCTGC GCGCGGCGTT CGGCGGCAGC
GTCGTCGAGA TCGTCTACTG A
 
Protein sequence
MGGAGSHGAR RFGARLRAPA RRAGIARMAR GAALAGHVER RLLRIGRHDR AAREARDVGC 
RRELRVERGT VIEHEAVAVV MGTADFLEIL EDPALELIDV LDADVLHVDR RLLAADAARA
ERHDRLALEV RAVRLDHGGK LAEFVDPVIE RAIERPHVDF ERIARVEHHD GAARVVVPLV
EPRFERARVD GRRAAELRLD RRLLHRDDLA LQAHEHPAKR LRIGQALLER QVRELALEER
IVADVIDERV GRLAWARDEQ IDAFGREQDR AAQPERVAAR GDERPPLLDR IERDETVGGE
IDGGHEVRTV RRLRPGGEAA RLGGGARGER LGASGRKREK RGKAVLSRKP RGFFAVGRTA
DVFGARRGAV ASAPARRRVA AGPGAGESRA ARLRRRGRRR VERRAHRRVV RRVADAFAAA
GHAIPKRGET RARSTDADVA ASPRTGRRRP EPVIEPDDPN PRSEPATRAA ANCRRARLRG
PPPAPALARR PTRRFFARSE RYRVPGPATI APMSDPRFVH LRVHSEFSIA DGIVRLDDIV
KSAAEDGQGA LALTDLGNAF GLVRFYKEAR DAGIKPIAGC DVWITNHDDR DKPSRLLLLV
KDKRGYLNLC ELLSKAWLTN QYRGRAELDA SWLEGELAEG LLALSGAQQG DIGLALAAGN
EAAARRHAQR WARVFPGGFY IELQRYGQPG AEAYIQQAVT IAAELKLPVV ATHPLQYMTA
DDFTAHEARV CISEGDILAN PRRQKRFTTE QFFRTQGDMA ALFADLPSAL ANTVEIAKRC
NLTLELGKPK LPRFPTPDGM SLDDYLVQLS QEGLDKRLVQ LYPDEAEREA QRGKYNQRLD
FECGTIKKMG FPGYFLIVAD FINWAKNNGV PVGPGRGSGA GSLVAYSLGI TDLDPLRYNL
LFERFLNPER VSMPDFDIDF CQHGRDRVIQ YVKEKYGADA VSQIATFGTM AAKAAVRDIG
RVLDLGYMFT DGVAKLIPFK PGKHVTIADA MKEEPLLQER YDNEDEVHQL LDLAQRVEGL
TRNVGMHAGG VLIAPGKLTD FCPLYTQGDE GGVVSQYDKD DVEAVGLVKF DFLGLTTLTI
LDWAERYIRR LDPSKENWSL AQVPLDDPTS FQILKKANTV AVFQLESRGM QGMLKDAQPD
RFEDIIALVS LYRPGPMDLI PSFCARKHGR EKVDYPDPRV EPVLKETYGI MVYQEQVMQM
AQIIGGYSLG GADLLRRAMG KKKPEEMAKH REIFAEGAAK NGLTREKSDE IFDLMEKFAG
YGFNKSHAAA YALLAYYTAW LKAHHPAEFM AANMTLAMDD TDKVKILFDD CLVNGLAVLP
PDINRSNDRF EPVAEADGKR SRTIRYGLGA IKGSGQNAIE EILRAREEKP FADLFDFCER
IDRRVVNRRT IEAMIRAGAF DSLHENRAQL LASVPLAMEA AEQAAANALQ AGLFDIGGVP
AHQHALVDEP AWDDKRRLQE EKGALGFYLS GHLFDAYRDE VRRFVRQKLG ELKEGRDKVV
AGVIASLRTQ MTQRGKMVIA LLDDGTGQCE VTVFNEQFDA NRALFKEDEL LIVQGQARND
AFTGGIRFTA ESVMDLERAR SRYAQAVRMT MNGNADAAAL RRVLEAHVAK PDETPPAAAP
APRGGRDGGR RAQAAIPNGL AVRIAYSNAR AQGEMRLGDA WRVKPSDALL ADLRAAFGGS
VVEIVY