Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A5352 |
Symbol | |
ID | 3750563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 2412050 |
End bp | 2413060 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637763651 |
Product | Phage SPO1 DNA polymerase-related protein |
Protein accession | YP_369590 |
Protein GI | 78066821 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.142834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATGGG CTGAAGCGGC GCTCGAGGAA ATGGGCCTCG CGCAGATCTG GGTGCGGCGC GGGAAAGGCG CGGAGGCGGG CACGGACGAG GACGCGGCAG CCGAAGCGCA CGCCGCACCG GCCGTCACGG CCGAACGACC TGCGCGGATC GCACGTACAC CGGTACAGGA TGAGGCGCCG CCGGCAGTGG CGCGAAGCGT CGATGCTTCA CCGGCTCGCG AGCCGGCCGC CGCCCCGTCG CGCCGCGTCG ATGACGGCGA TCGCGCGCCA GCACCGGCCA CACCGGTCGC GTCGACCGAC ACGATGCCGC CAATGGACGA CATGCCGCCT GCCGGACCCG ACGATTTCGC GTGGTTCGAT GCGGCGCCGC CGGGTGATCC CGTGCCGGTG GCCGAATCGC GCCCCGTCGG TACGCCGGTT GCCGCGCTCG ACTGGGACGC GCTGGCCGCG CGCGTGTCGG ACTGTACACT TTGCCGGCTG TGCGAGAAGC GGACCAACAC CGTGTTTGGC GTCGGCGACC GTGAAGCAGA CTGGATGCTG ATCGGCGAAG CGCCGGGCGA GAACGAGGAC AAGCAGGGCG AGCCGTTCGT CGGCCAGGCC GGCAAGCTGC TCGACAACAT GCTGCAGTCG CTGTCGCTCA AGCGCGGCGA TAACGTGTAC ATCGCGAACG TGATCAAGTG TCGCCCGCCC GGCAACCGCA ATCCGGAGCC GGACGAGGTC GCGAGTTGCG AGCCCTATCT GCAGCGCCAG GTCGCGCTCG TGAAGCCGAA GCTGATCGTC GCGCTCGGCC GCTTCGCCGC GCAGACGCTG CTGAAGACGG ACGCGAGCAT TGCGTCGCTG CGCGGCCGCG TGCATGCGTA CGAAGGCGTG CCGGTGATCG TGACCTACCA CCCGGCGTAC CTGCTGCGCA GCCTGCAGGA CAAGTCGAAG GCATGGGCCG ACCTGTGCCT CGCGCGCGAT ACGTTCCAGC GTGCCGAGGG CGCCGACGCG AACGGACCGG CCGGACAATG A
|
Protein sequence | MAWAEAALEE MGLAQIWVRR GKGAEAGTDE DAAAEAHAAP AVTAERPARI ARTPVQDEAP PAVARSVDAS PAREPAAAPS RRVDDGDRAP APATPVASTD TMPPMDDMPP AGPDDFAWFD AAPPGDPVPV AESRPVGTPV AALDWDALAA RVSDCTLCRL CEKRTNTVFG VGDREADWML IGEAPGENED KQGEPFVGQA GKLLDNMLQS LSLKRGDNVY IANVIKCRPP GNRNPEPDEV ASCEPYLQRQ VALVKPKLIV ALGRFAAQTL LKTDASIASL RGRVHAYEGV PVIVTYHPAY LLRSLQDKSK AWADLCLARD TFQRAEGADA NGPAGQ
|
| |