Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2003 |
Symbol | |
ID | 3848868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2266852 |
End bp | 2268153 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637841672 |
Product | phage SPO1 DNA polymerase domain-containing protein |
Protein accession | YP_442527 |
Protein GI | 83721489 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.262884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTGG CTGAAGCGGC GCTCGAAGAG CTGGGACTCG CGCCCATGTG GGTGCGGCGC GGCGCGGCGC GCGCCGGCGG CGTGAACGAG GATGCGACGG GCGCGGCGCG GGAGACCGAC GTCGCGGCGG TTGCGCTAGG GGCGCAACCG GTGTCGGATG CGGCGCGCCG GATGTCGCAT GACGGCGCGC AAGGCAGGGG GCGCGGTGGT GCGGGGGCGC CGGCCGCGTC GACGTCTGCG GACGACGCAT CGGCCGACGG CGCCGCGCAC GCGGAATCGA TCGGGACCGC CGCGCTCGCC GACGGTCGCG CGGCGCCTGC GCGGCAGGCT GCCGACGCGC GCGGCCAAGC GCGGCAAGCG GCGGCCGAAT CCGGCATGCG GGCGGCGGAT GCGCCTGCGG CGTTCGAGTC GGGCGCGCGA AATTCGCGGG ACTCGACGAT CGCGCGCGCG GCTTCGACGG TCGAGCCGGT CGCCGCGGGC GGCGCGCGTC GCGTGCCGCC GCACGCCGCC GTCGCCGCCG CTGCCGCGGA ATCGGCGGCC TTCGAGCAGG ATGCGGCGCA GCCGGCACGT TCGTCGGCGG CGTCCGCGCC GGCCGCACGA ACCGGGGGCG ACGCCGGCGC GGCGGCAGCC GACGAAGACA TGTCGTGGTT CGATCTCGAG CCGGGGGTCG AGCCCGCGCC GCCTGACGTC GCCGCGGAGC CCGCCGCTCG CGCGCCGTCC GTCGCCGAGC TCGGCTGGGA CGAATTGCGC GCGCGCGTGG CCGACTGCGA GCGCTGCCGT CTTTGCGAGA AGCGCACGAA CACGGTGTTC GGCGTCGGCG ACGAGCGCGC GGACTGGATG CTCGTCGGCG AGGCGCCGGG CGAGAACGAG GACAAGCAGG GCGAGCCGTT CGTCGGCCAG GCGGGCAAGC TGCTCGACAA CATGCTGCGC GCGCTCGCGC TCAAGCGCGG CGAGAACGTC TATATCGCGA ACGTGATCAA GTGCCGGCCG CCCGGCAACC GCAATCCCGA GCCCGACGAG GTCGCGCGCT GCGAGCCGTA TCTGCAGCGA CAAGTCGCGC TCGTGAAGCC GAAGCTGATC GTCGCGCTCG GCCGCTTCGC CGCGCAGACG CTTCTCAAGA CGGACGGAAG CATCGCCTCG ATGCGCGGGC GCGTGCACGA GTACGAAGGC GTGCCCGTGA TCGTCACGTA CCATCCGGCG TATCTGCTGC GCAGCCTGCA GGACAAGGCG AAAGCCTGGT CCGATCTGTG CCTCGCGAAC GATACCTACC GGAGTGCCGC GCCGGCCGCC GATCCGCAAT GA
|
Protein sequence | MALAEAALEE LGLAPMWVRR GAARAGGVNE DATGAARETD VAAVALGAQP VSDAARRMSH DGAQGRGRGG AGAPAASTSA DDASADGAAH AESIGTAALA DGRAAPARQA ADARGQARQA AAESGMRAAD APAAFESGAR NSRDSTIARA ASTVEPVAAG GARRVPPHAA VAAAAAESAA FEQDAAQPAR SSAASAPAAR TGGDAGAAAA DEDMSWFDLE PGVEPAPPDV AAEPAARAPS VAELGWDELR ARVADCERCR LCEKRTNTVF GVGDERADWM LVGEAPGENE DKQGEPFVGQ AGKLLDNMLR ALALKRGENV YIANVIKCRP PGNRNPEPDE VARCEPYLQR QVALVKPKLI VALGRFAAQT LLKTDGSIAS MRGRVHEYEG VPVIVTYHPA YLLRSLQDKA KAWSDLCLAN DTYRSAAPAA DPQ
|
| |