Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_5994 |
Symbol | |
ID | 6280505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | + |
Start bp | 2221842 |
End bp | 2223482 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642617055 |
Product | Uracil-DNA glycosylase superfamily |
Protein accession | YP_001889698 |
Protein GI | 187920666 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.263417 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCA TTGCTATCGA ACCGTCTTTT GCCGCCTGGC GCCATGCGGC GCGGGAACTT CTGCGGCAAG GCGTCGAGCC GGAGCGGATC GAGTGGGTTG AATGCGATCC GTGCGATTCG GCCGGCTCGG ATAGCGGCAA CGCAAGCGCT CAACAAGGTT CCGCCACCGC TCAGGCAACT CCCACGCCCG CGATTCCTCG TGAGCTGCTC GCCTGGCTGA AAACGGCGGC CTGCTATTAC GCGCCGGACC GTTGGCCGCT GCTCTATCGC ATCCTGTGGC GTTGGACACA CGGCGAACGC CACGTCCTCG ATCTGCAAGA CGTGGACGGC GCGTTGCTCG ATCAGCGCAT TCAGTCGGTC GAGCATGAAA TCAGTGATCT GGTGGCGCTC ACCCTGTTCA GGCGACGCGA TCCGTCCATG GGAGCGCCGG AATTCGTCGG CTGGTACGAG CCGCATCATG ATCTGCTGGC GCAGGCCGCC GAGCGTTTTG CCGAGCGCAT GGGCGATTCC ACGTGGATGC TGGCGACGCC GCAAGGCGCG GCGTTCTGGA ACGGCATGCT GCTGCAAATC AGCCGGCCGG CGACGGAAGA AAACGGGCAC GTCACGCACG TTTTGCCGGC CGGTCAGTTG AACGGGCCAG CCACGCCAGC ATTGCCGCCG GGCCAGCGGA AAGGCCCGAC TACTCCGACA TTGCCGAAGA GCCAGCCGAA CGGGCCGACT GCTCCGACAT TGCCGCCGAG CCAGCGGAAC GGCCCAACCA CGCCGACATT ACCGCCGAGC CAGCGGAGCG AGCCGACCAC TCCGACATTG CCCCGCGCCG CCATGGCTGG CGAAGCCACG ACCAGTGAAC CCACCGAGGC CCTCTGGCTC GCCTACTACG CCAGCGTATT CAATGGCGCG CCGACGCCCA TGCCGCTGCG TTACTGGAAA ACACCCCCCG CTGGTCCGCC GCTCCCCGCG CGACTCGCGC GCGAACGGAG CCGTCTCGGC GCGCAGAGTG GCACTGTCAC CATTCCTGAG ACACCGCCGC TCGAATATTC GGCGGTGACG CCACCCTTGC GCGAGCCCAC CGGCCCGCTG CCCACCTGCC GACGCTGCGC GTTGTGGCGC AACGCGAAGC AGGCCGTCGC GGGCGCCGGT CCGGCTCGGG CCGCGCTCAT GGTGGTCGGC GAACAACCCG GCGAGCATGA GAACCAGCAC GGTGTGCCCT TCACGGGCCC CGCCGGTCAA CTGCTCGACA CGGTGCTCGT GCGCGCCGGC CTCGAACGCT CGGCGCTCTA TCTGACCTAT GCCGTCAAGC ATTACAAGTG GGAAACGCTG GAACAGCGGC GCGTCCATCG CACACCCGCG CGGCGCGAGG TCGAGGCTTG CCAGTACTGG CTGGACCACG AGCTCGCGCA GGTCGCGCCG CGCGTGGTCG TCACGCTCGG CGCAACGGCG CTGAAGGCGT TGACGGGCGC GCACGTCAAT CTGTCGGAGT ATCTGGGGCA GACCATCGAT CACGGCGGCC GGCTGATCGT GCCGACGTGG CATCCATCGT ATGCGCTGAA AATGGCCGAT GGACGGTTGC GCGAGGATAT CGTCGCGGGC ATGGTGGCAG CGTTCAGGCA CGCGGCAGGG TTGGCGGCGG AAGGGGCGTA G
|
Protein sequence | MKRIAIEPSF AAWRHAAREL LRQGVEPERI EWVECDPCDS AGSDSGNASA QQGSATAQAT PTPAIPRELL AWLKTAACYY APDRWPLLYR ILWRWTHGER HVLDLQDVDG ALLDQRIQSV EHEISDLVAL TLFRRRDPSM GAPEFVGWYE PHHDLLAQAA ERFAERMGDS TWMLATPQGA AFWNGMLLQI SRPATEENGH VTHVLPAGQL NGPATPALPP GQRKGPTTPT LPKSQPNGPT APTLPPSQRN GPTTPTLPPS QRSEPTTPTL PRAAMAGEAT TSEPTEALWL AYYASVFNGA PTPMPLRYWK TPPAGPPLPA RLARERSRLG AQSGTVTIPE TPPLEYSAVT PPLREPTGPL PTCRRCALWR NAKQAVAGAG PARAALMVVG EQPGEHENQH GVPFTGPAGQ LLDTVLVRAG LERSALYLTY AVKHYKWETL EQRRVHRTPA RREVEACQYW LDHELAQVAP RVVVTLGATA LKALTGAHVN LSEYLGQTID HGGRLIVPTW HPSYALKMAD GRLREDIVAG MVAAFRHAAG LAAEGA
|
| |