Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0605 |
Symbol | |
ID | 3845035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 705300 |
End bp | 708080 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637837910 |
Product | DNA polymerase I |
Protein accession | YP_438805 |
Protein GI | 83716432 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.503998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAG AACGAAATCT GGAAGGTAAG ACCCTGCTAT TGGTTGACGG TTCAAGCTAT CTGTATCGGG CTTACCATGC GATGCCTGAT TTGCGCGGCC CTGGCGGGGA GCCGACCGGA GCGCTCTACG GAATCATCAA TATGCTGCGC CGCATGCGCA AGGATGTCAG TGCAGAGTAT AGCGCGTGCG TGTTCGACGC CAAGGGCAAA ACTTTCCGCG ATGATCTGTA CGCCGATTAC AAGGCGCACC GCCCGCCGAT GCCGCCCGAT CTCGCGCTGC AGATCGAGCC GATCCACTCG GCCGTGCGCG CGCTCGGCTG GCCGCTCCTG ATGATCGAGG GCGTCGAGGC CGACGACGTG ATCGGCACGC TCGCGAAGCG GGCCGAACAG CACGGCATGA ACGTGATCGT ATCGACGGGC GACAAGGACC TCGCGCAACT CGTCACCGAT CGCGTCACGC TCATCAACAC GATGACGAAC GAGGCGCTCG ATCGCGACGG CGTGCTTGCG AAGTTCGGCG TGCCGCCGGA GCGGATCGTC GATTACCTGT CGCTCATCGG CGACACCGTC GACAATGTGC CGGGCGTCGA GAAGTGCGGG CCGAAGACGG CCGTCAAGTG GCTGACGCAG TACGGCTCCC TCGACGGCGT CGTCGAGCAT GCCGGCGAGA TCAAGGGCGT CGTCGGCGAC AACCTGCGCC GCGCGCTCGA TTTCCTGCCG CTCGCTCGCA AGCTCGTAAC GGTCGAGACG GCGTGCGAGC TCGCGCCGCA CGTCGAATCG TTCGACGCGT CGCTCGCGAC GGACGGCGAG GGCCGCGACG CGCTGCGCGA GATCTTCTCG ACGTACGGCT TCAAGACGTG GCTGCGCGAG CTCGACAGCG AGCCCGCCGC GAACGGCGCG GCCGCCGCGG CCGCTATGGC CGGCGCGGCG CAAGACCCGG CGGGCGGCGC GCCGGCCGAG CTGCCGCTTG CGATGGCGCG CGATTACATG ACCGTGCAGA CGTGGGAGCA GTTCGACGCG TGGCTCGCGA AGATTTCCGC GGCCGAGCTG ACCGCGTTCG ATACCGAGAC GACGTCGCTC GATCCGATGC TCGCGCAAAT CGTCGGCCTG TCGTTCTCGG TGGAGCCGGG CCACGCCGCG TACGTGCCGG TCGCGCATCG CGGCCCCGAC ATGCCCGCGC AACTGCCGCG CGACGAGGTG CTCGCGAAGC TCACGCCGTG GCTCGAGGAT GCGAGCAAGA AGAAGCTCGG CCAGCATCTG AAATACGATG CGCAGGTGCT CGCGAACTAC GGGATCGCGC TGAACGGCAT CGAGCACGAC ACGCTGCTCG AGTCGTACGT GCTCGAATCG CACCGCACGC ACGACATGGA CAGTCTCGCG CTGCGCCATC TCGGCGTGAG GACGATCAAG TACGAGGATG TCGCGGGCAA GGGCGCGCAG CAGATCGGCT TCGACGAGGT GCCGCTCGAG CAGGCGTCCG AATATGCGGC CGAGGACGCG GACATCACGC TGCAGCTGCA TCACGCGCTG TATCCGCAGA TCGCGCGCGA GCCGGGTCTC TCGCGCGTGT ATCGCGACAT CGAGATGCCG GTGTCGCTCG TGCTGCGCAA GATGGAGCGC ACCGGCGTGC TGATCGACAG CGACCGGCTG GGCCGTCAGA GCAGCGAGAT CGCGACGCGG CTCATCGAGC TCGAGCAGCA GGCGTACGGG CTTGCGGGCG GCGAATTCAA TCTCGGCTCG CCGAAGCAGA TCGGCCAGAT CTTCTTCGAG CGGCTGCAAC TGCCCGTCGT CAAGAAGACG CCGAGCGGCG CGCCGTCGAC CGACGAAGAG GTGCTGCAAA AGCTAGCCGA GGACTATCCG CTGCCCAAGC TGCTGCTCGA GCATCGCGGC TTGTCGAAGC TGAAGTCGAC CTACACCGAC AAGCTGCCGC GAATGGTCAA CCCGAACACG GGCCGCGTGC ACACGAACTA TGCGCAGGCG GTGGCGGTCA CGGGGCGGCT CGCTTCGAAT GATCCGAACC TGCAGAACAT TCCGGTGCGC ACGGCGGAAG GGCGGCGCAT CCGCGAGGCG TTCATCGCGC CGCCGGGCAG CAAGATCGTG TCGGCCGACT ATTCGCAGAT CGAACTGCGC ATCATGGCGC ACATTTCCGA GGACGAGTCG CTGCTGCGCG CGTTCGCGCA CGGCGAGGAC ATTCACCGCG CGACCGCGGC CGAGGTGTTC GGCGTGACGC CGCTCGAAGT GACGTCCGAT CAGCGGCGCA TCGCGAAGGT GATCAACTTC GGCCTGATCT ATGGAATGAG CTCGTTCGGC CTCGCGTCTA ATCTCGGCAT CACGCGGGAT GCGGCGAAGC TCTACATCGA CCGCTACTTC CTTCGCTATC CGGGCGTTGC CCGCTACATG GAGGAAACGC GCGCGCGCGC GAAGGAGAAG GGCTACGTCG AGACGGTGTT CGGCCGCCGC CTGTGGCTGC CCGAGATCAA CGGCGGCAAC GGGCCGCGCC GGCAGGCCGC CGAGCGCGCG GCGATCAACG CGCCGATGCA GGGCACGGCC GCCGATCTGA TCAAGCTGTC GATGATCGCG GTCGACGACT GGCTCGAACG CGGCGGCTTG CGCGCGCGGA TGATCATGCA GGTGCACGAC GAACTCGTGC TCGAGGTGCC GGAAAGCGAA CTGTCGATCG TGCGCGAGAA GCTGCCCGAG ATGATGTGCG GCGTCGCGAA ACTGAAGGTG CCGCTCGTCG CCGAGGTCGG CGCGGGCGAG AACTGGGAAG AGGCGCACTG A
|
Protein sequence | MPEERNLEGK TLLLVDGSSY LYRAYHAMPD LRGPGGEPTG ALYGIINMLR RMRKDVSAEY SACVFDAKGK TFRDDLYADY KAHRPPMPPD LALQIEPIHS AVRALGWPLL MIEGVEADDV IGTLAKRAEQ HGMNVIVSTG DKDLAQLVTD RVTLINTMTN EALDRDGVLA KFGVPPERIV DYLSLIGDTV DNVPGVEKCG PKTAVKWLTQ YGSLDGVVEH AGEIKGVVGD NLRRALDFLP LARKLVTVET ACELAPHVES FDASLATDGE GRDALREIFS TYGFKTWLRE LDSEPAANGA AAAAAMAGAA QDPAGGAPAE LPLAMARDYM TVQTWEQFDA WLAKISAAEL TAFDTETTSL DPMLAQIVGL SFSVEPGHAA YVPVAHRGPD MPAQLPRDEV LAKLTPWLED ASKKKLGQHL KYDAQVLANY GIALNGIEHD TLLESYVLES HRTHDMDSLA LRHLGVRTIK YEDVAGKGAQ QIGFDEVPLE QASEYAAEDA DITLQLHHAL YPQIAREPGL SRVYRDIEMP VSLVLRKMER TGVLIDSDRL GRQSSEIATR LIELEQQAYG LAGGEFNLGS PKQIGQIFFE RLQLPVVKKT PSGAPSTDEE VLQKLAEDYP LPKLLLEHRG LSKLKSTYTD KLPRMVNPNT GRVHTNYAQA VAVTGRLASN DPNLQNIPVR TAEGRRIREA FIAPPGSKIV SADYSQIELR IMAHISEDES LLRAFAHGED IHRATAAEVF GVTPLEVTSD QRRIAKVINF GLIYGMSSFG LASNLGITRD AAKLYIDRYF LRYPGVARYM EETRARAKEK GYVETVFGRR LWLPEINGGN GPRRQAAERA AINAPMQGTA ADLIKLSMIA VDDWLERGGL RARMIMQVHD ELVLEVPESE LSIVREKLPE MMCGVAKLKV PLVAEVGAGE NWEEAH
|
| |