Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0618 |
Symbol | |
ID | 4903597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 599682 |
End bp | 601904 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640143724 |
Product | helix-hairpin-helix DNA-binding motif-containing protein |
Protein accession | YP_001074654 |
Protein GI | 126456341 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.138492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATCC ACAATGCCGA TTTCGCGGCG GTCTTCGCGG AGATCGCCGA CTTGCTCGAG ATACAGGGGG CCAATCCGTT TCGCGTGCGT GCGTACCGCA ACGCGGCGCG CACCATCGGC GGGCTCGGCC GTGACATCGG CGCGCTGATC GCGGCCGGCC GCAGCCTCGA CGATATCCCG ACGATCGGCG CCGACCTCGC CGGCAAGCTG CGCGAGATCG CGACGACGGG CACCTGCGCG TTGCAGCGGC AACTGCGCGG GGCGCTGCCG GCGGCGCTCG TCGAGTTGCT CGGCGTGCCG GGGCTCGGCG CGAAACGCGT GCGCGCGCTG CACGACGCGC TCGGCGTCGA GACGCTCGAG CAACTGAAGA CGGCGGCCGA GCACGGCAAG ATCCGCGGGC TGCCCGGCTT CGGCGAGAAA ACCGAGGCGC ACATCGCGGA GGCGATCGGC GCGCGGCTGC GGCGCAAGTC GCAGCGGTTC CTGCTGTCGT TCGCGACGCA GTACCTGACG CCGCTGCTCA CGTATCTGCG CGAAACGCCG GGCGTGTCCG AGGCGGTGGC GGCGGGCAGC TTCCGCCGGC GGTGCGAGAG CGTCGGCGAT CTCGACATCG TCGTTACGTC GGGCGATCCG GCGAAGGTCT CGGCGCGCTT CGTCGAGTAC GGCGAAGTCG CGCGCGTGCT CGCGAGCGGC GATACGCGCT CGAGCGTCGT GCTTCGCTGC GGCATTCAGG CCGATCTGCG CGTGGTGTCG CCGGCGGCGC TCGGCGCGGC GCTCGTCTAT TTCACCGGCT CGAAGGCGCA TAACATCGCG ATGCGGCGCA TCGCGCAGGC GCGCGATCTG AAGATCAACG AATACGGCGT GTTCGACGGC GAGCGGCGCA TCGCCGGCGC AACCGAGGAA TCGGTCTACG CGTCGATCGG CCTCGCATGG GTGCCGCCCG AGCTGCGCGA GAACCGGGGC GAGATCGAAG CCGCGCGCGA GGGCCGGCTG CCGGCCCTCG TCGAGCGCAA GCATCTGCGC GGCGACCTGC ATGCGCACAC GAACGCGACC GACGGGCGCG ACAGCCTGCG GGACATGGCG CTCGAGGCGC GCAAGCGCGG CCTCGATTAT CTGGCGATCA CCGATCATGC GCGCGGGCTC GGCGTCGCGC ACGGCCTCGA CGCGGAGCGT CTCGCCAGAC AAATCGACGA GATCGACCGC TTGAACGAGA CACTCGACGG CATCGTGCTC CCGAAGGGGA TCGAGGTCGA CATTCTGGAG GACGGCAGCC TCGATCTGCC CGACGGCGTG CTCGCGCGGC TCGATCTCGT GGTCGGCGCG GTTCACGGCC ATTTCGATTT GTCGCGCGCC GCGCAGACCG AGCGCGTGCT GCGCGCGATG GACCATCCGT ATTTCTCGAT CCTCGCGCAT CCGTCGGGGC GGCTGCTCGG CGAGCGCGAC GCGTGCGACA TCGATCTCGC CCGCGTGATC GAGCACGCTC GCGCTCGCGG CTGCCATCTG GAGCTGAACG CGCAGCCGCA GCGGCTGGAC CTCGCCGATG TCTGGTGCCG GCATGCGGCC GAGGCGGGCG TGCTCGTGTC GATCGATTCG GACGCGCATC GGCGCGAGGA TCTGGGCCAT CTCGGGATCG GCGTCGATCA GGCGCGGCTG GCTGACGAAG GCGCAGGTGC TCAACACGCG CACGCTCGCG CAGTTGCGGC CGCTGCTCGC GCGGACGATG GGCGGCGGCG CGATGTCGGT GTCGGCGTCC GAGCCGGCGC CTGTTCCGGC GCCCGTGTCT GCGTCGAAAT CGGCATCGGC ATCGACATCG ACATCGACAT CGACATCGAC ATCGACATCG ACATCGGCAT CGGCATCGGC ATCGACGGGC GCTTCGCGAA AGCGTTCGTC CGGCAAGCGC GACACGGCGG GCAGCGCCGA AGGCGGCGCC CGTCGCACGA AGAAGACGCG GCGCCCGCCC GCCTGAATGC GAAGCGGCGG CGCCGGCGGT TGCGAGCGTC GATGCGCGCC TGCGGCCGGG CTCGTCCGTG CATCGATGCG GCACCGTCAT GTTCGGCGCG CTTTCCGCAG CCGCGCGATG CGCGCCGCGC GCGAGCGAAC GGCGGCGCCC TTCGCCGGCG CCGCATCCGC GCGAGCGCGC GGCCGGCCGT CGCGGCGGCG CGCGCCGCCG CCATTCATTT CGCTGTGCAG AAAGCGCTTA TCGTTGCCGT TGCCGCATGT TAG
|
Protein sequence | MPIHNADFAA VFAEIADLLE IQGANPFRVR AYRNAARTIG GLGRDIGALI AAGRSLDDIP TIGADLAGKL REIATTGTCA LQRQLRGALP AALVELLGVP GLGAKRVRAL HDALGVETLE QLKTAAEHGK IRGLPGFGEK TEAHIAEAIG ARLRRKSQRF LLSFATQYLT PLLTYLRETP GVSEAVAAGS FRRRCESVGD LDIVVTSGDP AKVSARFVEY GEVARVLASG DTRSSVVLRC GIQADLRVVS PAALGAALVY FTGSKAHNIA MRRIAQARDL KINEYGVFDG ERRIAGATEE SVYASIGLAW VPPELRENRG EIEAAREGRL PALVERKHLR GDLHAHTNAT DGRDSLRDMA LEARKRGLDY LAITDHARGL GVAHGLDAER LARQIDEIDR LNETLDGIVL PKGIEVDILE DGSLDLPDGV LARLDLVVGA VHGHFDLSRA AQTERVLRAM DHPYFSILAH PSGRLLGERD ACDIDLARVI EHARARGCHL ELNAQPQRLD LADVWCRHAA EAGVLVSIDS DAHRREDLGH LGIGVDQARL ADEGAGAQHA HARAVAAAAR ADDGRRRDVG VGVRAGACSG ARVCVEIGIG IDIDIDIDID IDIDIGIGIG IDGRFAKAFV RQARHGGQRR RRRPSHEEDA APARLNAKRR RRRLRASMRA CGRARPCIDA APSCSARFPQ PRDARRARAN GGALRRRRIR ASARPAVAAA RAAAIHFAVQ KALIVAVAAC
|
| |