Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_2481 |
Symbol | |
ID | 6283997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | - |
Start bp | 2800157 |
End bp | 2801401 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642622035 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_001896101 |
Protein GI | 187924459 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.256124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000867118 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCTGC ATGAATCGGT TCTGGAAGAG TTCGGACTCA CACCGCTGTG GGTGCGTCGT GGGATGGCAG CGGCGCCGGA GCCGGTTGCT GGACAACCGC AAGCGCAAGC CGCAGCCGAG GTCGGGCAGG GTGCGGGTGC GCATCGTGAT GAGGCGCGTG GCGTGGCATC AGCGATGCCG CCGGCTGCTC GCGCGCAGGA AACCTCGCAG CCCGCGATGC GCGCGGCCGG CGTTCAACAA GCCAACGCCG CGCCGGCGCC GGGCGAACGC GGCACGTCTG GCCGTCAGGA TCAGGCATCG CCCGTAGCCG GCGCACGCGA CCGGTTCGCT GATCGCCAGT TGGCGGCAGA GAGGCGCTCA GACGAGTCGC CGCGTTCCGA GCTGAACGGT GAACAGGCTG AAGGTCAAAG GCAGCCTTCA CAAGGCCCAA CGCGCGTCCC AACGCAAGGC CCAGCGCAAC CACCGACGCA AGCCCGGATG CAAGCCTCGG TCGAGGCAAA CGAGCCGCAG ACACCGCAGT CATCGCGTCC CCAGCAACCC CGCAGCCCCG AGCCGCCGAC ATTCGACGCG CCACCCAGCG ACGATTTCGC ATGGTTCGAC GATCTGCCGA CTCAAGCACC CGCAGACGCA CGCCTCGAAG CCCCCGCACC GCCGGCAATC CACACGCTCG ACTGGGACGC ACTGAGCGAG CGCGTCGCCG CCTGCCAGCT TTGCCGTCTG TGCGAGAAGC GCACGAACAC CGTATTCGGC GTCGGCGATC GCGGCGCCGA CTGGATGCTG ATCGGCGAAG CGCCGGGCGA GAATGAGGAT CGTCTGGGCG AGCCGTTCGT CGGCCAGGCC GGCAAGCTGC TGGACAACAT GCTGCGTTCG CTGACGCTCG CGCGCGACAC CAACGTCTAT ATCGCCAACG TGATCAAATG CCGGCCGCCC GGCAACCGCA ATCCCGAACC GGACGAAGTC GCGCGTTGCG AGCCATATCT GCAACGTCAG GTCGCGCTGG TCAAGCCGAA GCTGATCGTC GCGCTCGGCC GCTTCGCGGC GCAGAGTTTG CTGAAGACCG AGGCGAGCAT TTCGTCGCTG CGCGGCCGCG TGCACGAATA CGAAGGCGTG CCCGTGATCG TCACCTATCA CCCGGCGTAT CTGTTGCGCA GCCTGCCCGA TAAGGCGAAG GCCTGGGCCG ACCTGTGCCT CGCCCGCGAC ACGTGGCGGG CGGCGGGCGG TGCGCCGTCG AACGCGCCGA AGTGA
|
Protein sequence | MALHESVLEE FGLTPLWVRR GMAAAPEPVA GQPQAQAAAE VGQGAGAHRD EARGVASAMP PAARAQETSQ PAMRAAGVQQ ANAAPAPGER GTSGRQDQAS PVAGARDRFA DRQLAAERRS DESPRSELNG EQAEGQRQPS QGPTRVPTQG PAQPPTQARM QASVEANEPQ TPQSSRPQQP RSPEPPTFDA PPSDDFAWFD DLPTQAPADA RLEAPAPPAI HTLDWDALSE RVAACQLCRL CEKRTNTVFG VGDRGADWML IGEAPGENED RLGEPFVGQA GKLLDNMLRS LTLARDTNVY IANVIKCRPP GNRNPEPDEV ARCEPYLQRQ VALVKPKLIV ALGRFAAQSL LKTEASISSL RGRVHEYEGV PVIVTYHPAY LLRSLPDKAK AWADLCLARD TWRAAGGAPS NAPK
|
| |