Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0710 |
Symbol | |
ID | 4888111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 673928 |
End bp | 675856 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640130650 |
Product | DNA polymerase X family protein |
Protein accession | YP_001061709 |
Protein GI | 126444286 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.963352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATCC ACAATGCCGA TTTCGCGGCG GTCTTCGCGG AGATCGCCGA CTTGCTCGAG ATACAGGGGG CCAATCCGTT TCGCGTGCGT GCGTACCGCA ACGCGGCGCG CACCATCGGC GGGCTCGGCC GCGACATCGG CGCGCTGATC GCGGCCGGCC GCAGCCTCGA CGATATCCCG ACGATCGGCG CCGACCTCGC CGGCAAGCTG CGCGAGATCG CGACGACGGG CACCTGCGCG TTGCAGCGGC AACTGCGCGG GGCGCTGCCG GCGGCGCTCG TCGAGTTGCT CGGCGTGCCG GGGCTCGGCG CGAAACGCGT GCGCGCGCTG CACGACGCGC TCGGCGTCGA GACGCTCGAG CAACTGAAGA CGGCGGCCGA GCACGGCAAG ATCCGCGGGC TGCCCGGCTT CGGCGAGAAA ACCGAGGCGC ACATCGCGGA GGCGATCGGC GCGCGGCTGC GGCGCAAGTC GCAGCGGTTC CTGCTGTCGT TCGCGACGCA GTACCTGACG CCGCTGCTCA CGTATCTGCG CGAAACGCCG GGCGTGTCCG AGGCGGTGGC GGCGGGCAGC TTCCGCCGGC GGCGCGAGAG CGTCGGCGAT CTCGACATCG TCGTCACGTC GGGCGATCCG GCGAAGGTCT CGGCGCGCTT CGTCGAGTAC GGCGAAGTCG CGCGCGTGCT CGCGAGCGGC GATACGCGCT CGAGCGTCGT GCTTCGCTGC GGCATTCAGG CCGATCTGCG CGTGGTGTCG CCGGCGGCGC TCGGCGCGGC GCTCGTCTAT TTCACCGGCT CGAAGGCGCA TAACATCGCG ATGCGGCGCA TCGCGCAGGC GCGCGATCTG AAGATCAACG AATACGGCGT GTTCGACGGC GAGCGGCGCA TCGCCGGCGC AACCGAGGAA TCGGTCTACG CGTCGATCGG CCTCGCATGG GTGCCGCCCG AGCTGCGCGA GAACCGGGGC GAGATCGAAG CCGCGCGCGA GGGCCGGCTG CCGGCCCTCG TCGAGCGCAA GCATCTGCGC GGCGACCTGC ATGCGCACAC GAACGCGACC GACGGGCGCG ACAGCCTGCG GGACATGGCG CTCGAGGCGC GCAAGCGCGG CCTCGATTAT CTGGCGATCA CCGATCATGC GCGCGGGCCC GGTGTCGCGC ACGGCCTCGA CGCGGAGCGT CTCGCCAGAC AAATCGACGA GATCGACCGC TTGAACGAGA CACTCGACGG CATCGTGCTC CTGAAGGGGA TCGAGGTCGA CATTCTGGAG GACGGCAGCC TCGATCTGCC CGACGGCGTG CTCGCGCGGC TCGATCTCGT GGTCGGCGCG GTTCACGGCC ATTTCGATTT GTCGCGCGCC GCGCAGACCG AGCGCGTGCT GCGCGCGATG GACCATCCGT ATTTCTCGAT CCTCGCGCAT CCGTCGGGGC GGCTGCTCGG CGAGCGCGAC GCGTGCGACA TCGATCTCGC CCGCGTGATC GAGCACGCTC GCGCTCGCGG CTGCCATCTG GAGCTGAACG CGCAGCCGCA GCGGCTGGAC CTCGCCGATG TCTGGTGCCG GCATGCGGCC GAGGCGGGCG TGCTCGTGTC GATCGATTCG GACGCGCATC GGCGCGAGGA TCTGGGCCAT CTCGGGATCG GCGTCGATCA GGCGCGGCGC GGCTGGCTGA CGAAGGCGCA GGTGCTCAAC ACGCGCACGC TCGCGCAGTT GCGGCCGCTG CTCGCGCGGA CGATGGGCGG CGGCGCGATG TCGGTGTCGG CGTCCGAGTC GGCGTCTGTT CCGGCGCCCG TGTCTGCGTC GAAATCGGCA TCGACATCGG CATCGACATC GGCATCGACG GGCGCTTCGC GAAAGCGTTC GTCCGGCAAG CGCGACACGG CGGGCAGCGC CGAAGGCGGC GCCCGTCGCA CGAAGAAGAC GCGGCGCCCG CCCGCGTGA
|
Protein sequence | MPIHNADFAA VFAEIADLLE IQGANPFRVR AYRNAARTIG GLGRDIGALI AAGRSLDDIP TIGADLAGKL REIATTGTCA LQRQLRGALP AALVELLGVP GLGAKRVRAL HDALGVETLE QLKTAAEHGK IRGLPGFGEK TEAHIAEAIG ARLRRKSQRF LLSFATQYLT PLLTYLRETP GVSEAVAAGS FRRRRESVGD LDIVVTSGDP AKVSARFVEY GEVARVLASG DTRSSVVLRC GIQADLRVVS PAALGAALVY FTGSKAHNIA MRRIAQARDL KINEYGVFDG ERRIAGATEE SVYASIGLAW VPPELRENRG EIEAAREGRL PALVERKHLR GDLHAHTNAT DGRDSLRDMA LEARKRGLDY LAITDHARGP GVAHGLDAER LARQIDEIDR LNETLDGIVL LKGIEVDILE DGSLDLPDGV LARLDLVVGA VHGHFDLSRA AQTERVLRAM DHPYFSILAH PSGRLLGERD ACDIDLARVI EHARARGCHL ELNAQPQRLD LADVWCRHAA EAGVLVSIDS DAHRREDLGH LGIGVDQARR GWLTKAQVLN TRTLAQLRPL LARTMGGGAM SVSASESASV PAPVSASKSA STSASTSAST GASRKRSSGK RDTAGSAEGG ARRTKKTRRP PA
|
| |