Gene BURPS1106A_A0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0618 
Symbol 
ID4903597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp599682 
End bp601904 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content71% 
IMG OID640143724 
Producthelix-hairpin-helix DNA-binding motif-containing protein 
Protein accessionYP_001074654 
Protein GI126456341 
COG category[L] Replication, recombination and repair 
COG ID[COG1796] DNA polymerase IV (family X) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATCC ACAATGCCGA TTTCGCGGCG GTCTTCGCGG AGATCGCCGA CTTGCTCGAG 
ATACAGGGGG CCAATCCGTT TCGCGTGCGT GCGTACCGCA ACGCGGCGCG CACCATCGGC
GGGCTCGGCC GTGACATCGG CGCGCTGATC GCGGCCGGCC GCAGCCTCGA CGATATCCCG
ACGATCGGCG CCGACCTCGC CGGCAAGCTG CGCGAGATCG CGACGACGGG CACCTGCGCG
TTGCAGCGGC AACTGCGCGG GGCGCTGCCG GCGGCGCTCG TCGAGTTGCT CGGCGTGCCG
GGGCTCGGCG CGAAACGCGT GCGCGCGCTG CACGACGCGC TCGGCGTCGA GACGCTCGAG
CAACTGAAGA CGGCGGCCGA GCACGGCAAG ATCCGCGGGC TGCCCGGCTT CGGCGAGAAA
ACCGAGGCGC ACATCGCGGA GGCGATCGGC GCGCGGCTGC GGCGCAAGTC GCAGCGGTTC
CTGCTGTCGT TCGCGACGCA GTACCTGACG CCGCTGCTCA CGTATCTGCG CGAAACGCCG
GGCGTGTCCG AGGCGGTGGC GGCGGGCAGC TTCCGCCGGC GGTGCGAGAG CGTCGGCGAT
CTCGACATCG TCGTTACGTC GGGCGATCCG GCGAAGGTCT CGGCGCGCTT CGTCGAGTAC
GGCGAAGTCG CGCGCGTGCT CGCGAGCGGC GATACGCGCT CGAGCGTCGT GCTTCGCTGC
GGCATTCAGG CCGATCTGCG CGTGGTGTCG CCGGCGGCGC TCGGCGCGGC GCTCGTCTAT
TTCACCGGCT CGAAGGCGCA TAACATCGCG ATGCGGCGCA TCGCGCAGGC GCGCGATCTG
AAGATCAACG AATACGGCGT GTTCGACGGC GAGCGGCGCA TCGCCGGCGC AACCGAGGAA
TCGGTCTACG CGTCGATCGG CCTCGCATGG GTGCCGCCCG AGCTGCGCGA GAACCGGGGC
GAGATCGAAG CCGCGCGCGA GGGCCGGCTG CCGGCCCTCG TCGAGCGCAA GCATCTGCGC
GGCGACCTGC ATGCGCACAC GAACGCGACC GACGGGCGCG ACAGCCTGCG GGACATGGCG
CTCGAGGCGC GCAAGCGCGG CCTCGATTAT CTGGCGATCA CCGATCATGC GCGCGGGCTC
GGCGTCGCGC ACGGCCTCGA CGCGGAGCGT CTCGCCAGAC AAATCGACGA GATCGACCGC
TTGAACGAGA CACTCGACGG CATCGTGCTC CCGAAGGGGA TCGAGGTCGA CATTCTGGAG
GACGGCAGCC TCGATCTGCC CGACGGCGTG CTCGCGCGGC TCGATCTCGT GGTCGGCGCG
GTTCACGGCC ATTTCGATTT GTCGCGCGCC GCGCAGACCG AGCGCGTGCT GCGCGCGATG
GACCATCCGT ATTTCTCGAT CCTCGCGCAT CCGTCGGGGC GGCTGCTCGG CGAGCGCGAC
GCGTGCGACA TCGATCTCGC CCGCGTGATC GAGCACGCTC GCGCTCGCGG CTGCCATCTG
GAGCTGAACG CGCAGCCGCA GCGGCTGGAC CTCGCCGATG TCTGGTGCCG GCATGCGGCC
GAGGCGGGCG TGCTCGTGTC GATCGATTCG GACGCGCATC GGCGCGAGGA TCTGGGCCAT
CTCGGGATCG GCGTCGATCA GGCGCGGCTG GCTGACGAAG GCGCAGGTGC TCAACACGCG
CACGCTCGCG CAGTTGCGGC CGCTGCTCGC GCGGACGATG GGCGGCGGCG CGATGTCGGT
GTCGGCGTCC GAGCCGGCGC CTGTTCCGGC GCCCGTGTCT GCGTCGAAAT CGGCATCGGC
ATCGACATCG ACATCGACAT CGACATCGAC ATCGACATCG ACATCGGCAT CGGCATCGGC
ATCGACGGGC GCTTCGCGAA AGCGTTCGTC CGGCAAGCGC GACACGGCGG GCAGCGCCGA
AGGCGGCGCC CGTCGCACGA AGAAGACGCG GCGCCCGCCC GCCTGAATGC GAAGCGGCGG
CGCCGGCGGT TGCGAGCGTC GATGCGCGCC TGCGGCCGGG CTCGTCCGTG CATCGATGCG
GCACCGTCAT GTTCGGCGCG CTTTCCGCAG CCGCGCGATG CGCGCCGCGC GCGAGCGAAC
GGCGGCGCCC TTCGCCGGCG CCGCATCCGC GCGAGCGCGC GGCCGGCCGT CGCGGCGGCG
CGCGCCGCCG CCATTCATTT CGCTGTGCAG AAAGCGCTTA TCGTTGCCGT TGCCGCATGT
TAG
 
Protein sequence
MPIHNADFAA VFAEIADLLE IQGANPFRVR AYRNAARTIG GLGRDIGALI AAGRSLDDIP 
TIGADLAGKL REIATTGTCA LQRQLRGALP AALVELLGVP GLGAKRVRAL HDALGVETLE
QLKTAAEHGK IRGLPGFGEK TEAHIAEAIG ARLRRKSQRF LLSFATQYLT PLLTYLRETP
GVSEAVAAGS FRRRCESVGD LDIVVTSGDP AKVSARFVEY GEVARVLASG DTRSSVVLRC
GIQADLRVVS PAALGAALVY FTGSKAHNIA MRRIAQARDL KINEYGVFDG ERRIAGATEE
SVYASIGLAW VPPELRENRG EIEAAREGRL PALVERKHLR GDLHAHTNAT DGRDSLRDMA
LEARKRGLDY LAITDHARGL GVAHGLDAER LARQIDEIDR LNETLDGIVL PKGIEVDILE
DGSLDLPDGV LARLDLVVGA VHGHFDLSRA AQTERVLRAM DHPYFSILAH PSGRLLGERD
ACDIDLARVI EHARARGCHL ELNAQPQRLD LADVWCRHAA EAGVLVSIDS DAHRREDLGH
LGIGVDQARL ADEGAGAQHA HARAVAAAAR ADDGRRRDVG VGVRAGACSG ARVCVEIGIG
IDIDIDIDID IDIDIGIGIG IDGRFAKAFV RQARHGGQRR RRRPSHEEDA APARLNAKRR
RRRLRASMRA CGRARPCIDA APSCSARFPQ PRDARRARAN GGALRRRRIR ASARPAVAAA
RAAAIHFAVQ KALIVAVAAC