Gene Bphyt_5231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_5231 
Symbol 
ID6280559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp1408690 
End bp1409745 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID642616317 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_001888960 
Protein GI187919929 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAA CCCATACGAG CTTTCAGCAA CTACCCGTCG TCGACGTGAG CGGCCTGTTC 
AGCGATGACG ACGCCCAGCG GCTCGCCACC GCACGCGAAC TCGATCGCGC GGCACGCGAG
GCCGGCTTCT TCTATGTCAC CGGCCACCAG GTGTCACGCG CGCAACAAAG CGCGTTGATC
GAGCAGGCCA AACGCTTCTT CGCAGCCGAA CACGAATGGA AGATGCGCTA CTACATCGGC
AAGTCGACGG CGCATCGCGG CTACGTGCCC GAAGGCGAGG AAGTCTTCGC GGGCGGCAAG
CGCGACAGGA AGGAAGCCTT CGACACCGGC CGCGAACTGC CCGCCGACGA TCCCGACGTG
CGTGCCGGCA CGCCGATGCT CGGCCCGAAT AGCTGGCCGG AACAGGCGGG TTTTCGCGAA
GCGGTGGGCG GCTATTACAA AGCGGCGTTC GAACTCGGCC GCGCCCTGTT TCGAGGTTTC
TCGCTCGCGC TCGGCTTGCC TGAACAGCAC TTCGATCAGT ATCTACGCAA GCCGCCGAGC
CAGTTGCGTC TGATCCACTA TCCGCTCGAT CCGTCGGCGG AAGACCGGCC CGGCATCGGC
GCGCATACCG ACTACGAGTG CTTCACGATC CTGCTGCCGA CCGCGCCAGG ACTCGAAGTG
ATGAACGGCG AGGGCGAGTG GATCGATGCG CCGCCGGTCG AGAACGCATT CGTCGTGAAC
ATCGGCGACA TGCTGGAAGT GTGGACGGGC GGGACCTATG TCGCGACGTC GCATCGCGTG
CGCAAGGTGA GGGAGGAGCG CTACTCGTTT CCGCTCTTCT TTGCCTGCGA CTATCACACG
GTCGTTGCGC CGCTGCCGCA ATTCGCGACG CCCGAAGCGG TGGCGAAATA CCCGCCCGTA
TCGGCGGGCG ACCATCTGTT CGCGCAGACG GCGCAGAGCT TCACTTATCT GAAGGAACGT
CTTCAACGAG GAGAACTACT GTTGCCCGAT GGTTCGAAAG CGTTGGCAAG CTTCGGCCAA
CAGGCGCGCT ATGCGAGTGC GGAGATGGAC ACTTGA
 
Protein sequence
MNPTHTSFQQ LPVVDVSGLF SDDDAQRLAT ARELDRAARE AGFFYVTGHQ VSRAQQSALI 
EQAKRFFAAE HEWKMRYYIG KSTAHRGYVP EGEEVFAGGK RDRKEAFDTG RELPADDPDV
RAGTPMLGPN SWPEQAGFRE AVGGYYKAAF ELGRALFRGF SLALGLPEQH FDQYLRKPPS
QLRLIHYPLD PSAEDRPGIG AHTDYECFTI LLPTAPGLEV MNGEGEWIDA PPVENAFVVN
IGDMLEVWTG GTYVATSHRV RKVREERYSF PLFFACDYHT VVAPLPQFAT PEAVAKYPPV
SAGDHLFAQT AQSFTYLKER LQRGELLLPD GSKALASFGQ QARYASAEMD T