Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_5231 |
Symbol | |
ID | 6280559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | - |
Start bp | 1408690 |
End bp | 1409745 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642616317 |
Product | 2OG-Fe(II) oxygenase |
Protein accession | YP_001888960 |
Protein GI | 187919929 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCAA CCCATACGAG CTTTCAGCAA CTACCCGTCG TCGACGTGAG CGGCCTGTTC AGCGATGACG ACGCCCAGCG GCTCGCCACC GCACGCGAAC TCGATCGCGC GGCACGCGAG GCCGGCTTCT TCTATGTCAC CGGCCACCAG GTGTCACGCG CGCAACAAAG CGCGTTGATC GAGCAGGCCA AACGCTTCTT CGCAGCCGAA CACGAATGGA AGATGCGCTA CTACATCGGC AAGTCGACGG CGCATCGCGG CTACGTGCCC GAAGGCGAGG AAGTCTTCGC GGGCGGCAAG CGCGACAGGA AGGAAGCCTT CGACACCGGC CGCGAACTGC CCGCCGACGA TCCCGACGTG CGTGCCGGCA CGCCGATGCT CGGCCCGAAT AGCTGGCCGG AACAGGCGGG TTTTCGCGAA GCGGTGGGCG GCTATTACAA AGCGGCGTTC GAACTCGGCC GCGCCCTGTT TCGAGGTTTC TCGCTCGCGC TCGGCTTGCC TGAACAGCAC TTCGATCAGT ATCTACGCAA GCCGCCGAGC CAGTTGCGTC TGATCCACTA TCCGCTCGAT CCGTCGGCGG AAGACCGGCC CGGCATCGGC GCGCATACCG ACTACGAGTG CTTCACGATC CTGCTGCCGA CCGCGCCAGG ACTCGAAGTG ATGAACGGCG AGGGCGAGTG GATCGATGCG CCGCCGGTCG AGAACGCATT CGTCGTGAAC ATCGGCGACA TGCTGGAAGT GTGGACGGGC GGGACCTATG TCGCGACGTC GCATCGCGTG CGCAAGGTGA GGGAGGAGCG CTACTCGTTT CCGCTCTTCT TTGCCTGCGA CTATCACACG GTCGTTGCGC CGCTGCCGCA ATTCGCGACG CCCGAAGCGG TGGCGAAATA CCCGCCCGTA TCGGCGGGCG ACCATCTGTT CGCGCAGACG GCGCAGAGCT TCACTTATCT GAAGGAACGT CTTCAACGAG GAGAACTACT GTTGCCCGAT GGTTCGAAAG CGTTGGCAAG CTTCGGCCAA CAGGCGCGCT ATGCGAGTGC GGAGATGGAC ACTTGA
|
Protein sequence | MNPTHTSFQQ LPVVDVSGLF SDDDAQRLAT ARELDRAARE AGFFYVTGHQ VSRAQQSALI EQAKRFFAAE HEWKMRYYIG KSTAHRGYVP EGEEVFAGGK RDRKEAFDTG RELPADDPDV RAGTPMLGPN SWPEQAGFRE AVGGYYKAAF ELGRALFRGF SLALGLPEQH FDQYLRKPPS QLRLIHYPLD PSAEDRPGIG AHTDYECFTI LLPTAPGLEV MNGEGEWIDA PPVENAFVVN IGDMLEVWTG GTYVATSHRV RKVREERYSF PLFFACDYHT VVAPLPQFAT PEAVAKYPPV SAGDHLFAQT AQSFTYLKER LQRGELLLPD GSKALASFGQ QARYASAEMD T
|
| |