Gene Bphyt_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_0780 
Symbol 
ID6283298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp857359 
End bp858705 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID642620344 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001894427 
Protein GI187922785 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.90704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.708148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCC CAACCCGTAC GCCGCACAAC GGCCGCTCGC GGCTCGAAAT CGAGCCCGGC 
TATCAGTCGG GCTTCGCGAA CGAATTCGCC ACCGAGGCCT TGCCGGGCGC GTTGCCACAA
GGCCGCAACT CGCCGCAGCG CGCGGCTTAT GGTCTGTACG CCGAACAACT GTCGGGCACG
GCGTTCACTG CGCCGCGCGG CCATAATCGC CGCTCGTGGC TGTACCGGAT TCGGCCGGCG
GCAGTGCATA TGCCGTTCAC GCCACTGCCC TCGGAGCGGC TCGTCGCCAA CTTCGCCGAG
GTGCCGCCGA CGCCGCCCAA CCAGTTGCGC TGGGACGCAT TGCCGATGCC GGCCGAGCCA
ACCGACTTCA TCGACGGCTG GGTGACGATG GCGGGCAACG GCGCGGCCGA GTCGATGACC
GGCTGCGCGA TTCACGTGTA CGCGGCGAAC CGCTCGATGC AGGACCGCTT CTTCTACACC
GCCGACGGCG AATTGCTGAT CGTGCCGCAG GAAGGACGCC TGCACATCGC CACCGAGTTC
GGCAAGCTCG ACGTCGGGCC CTTCGAAATC GCGGTGATTC CGCGCGGCGT GCGTTTCGCG
GTGAGTCTGC CGGACGGCGT GGCGCGCGGC TACATCTGCG AGAACTTCGG GGCGTTGCTG
CGTTTACCGG ATCTTGGTCC GATCGGCTCA AACGGCCTCG CCAATCCGCG TGATTTCCTC
ACGCCGCACG CGGCCTACGA AGACCGTGAA GGCGATTTCG AACTCGTCGC CAAGATGAAC
GGCAACCTGT GGCGCGCGGA CATCGGCCAT TCGCCGCTCG ACGTGGTCGC GTGGCACGGC
AATTACGCGC CCTACAAGTA CGATCTGCGC CGCTTCAACA CGATCGGCTC GATCAGCTTC
GACCATCCGG ATCCGTCGAT CTTTCTCGTG CTGCAATCGC AAACCGACAC GCCGGGTGTC
GATAGCATCG ACTTCGTGAT CTTCCCGCCG CGCTGGCTCG CCGCCGAAGA TACGTTCCGT
CCGCCCTGGT TCCATCGCAA TGTGGCAAGC GAGTTCATGG GCCTCGTGCA CGGCGTCTAC
GACGCCAAGG CCGAAGGTTT CGTGCCGGGC GGCGCGTCGT TGCATAACTG CATGTCGGGT
CATGGTCCCG ATGCGGAGAC GTTCGAGAAA GCCTCGCACA GCGATACGTC GACGCCGAAG
AAAGTCGGCG ACACGATGGC CTTCATGTTC GAAACCCGCA CGCTGATCAA GCCGACCCGC
TTCGCGCTCG AAACGGCGCA ATTGCAGGCG CATTACTACG AGTGCTGGCA AGGTCTCACG
AAACACTTCA ACCCGGAGCA ACGATGA
 
Protein sequence
MDTPTRTPHN GRSRLEIEPG YQSGFANEFA TEALPGALPQ GRNSPQRAAY GLYAEQLSGT 
AFTAPRGHNR RSWLYRIRPA AVHMPFTPLP SERLVANFAE VPPTPPNQLR WDALPMPAEP
TDFIDGWVTM AGNGAAESMT GCAIHVYAAN RSMQDRFFYT ADGELLIVPQ EGRLHIATEF
GKLDVGPFEI AVIPRGVRFA VSLPDGVARG YICENFGALL RLPDLGPIGS NGLANPRDFL
TPHAAYEDRE GDFELVAKMN GNLWRADIGH SPLDVVAWHG NYAPYKYDLR RFNTIGSISF
DHPDPSIFLV LQSQTDTPGV DSIDFVIFPP RWLAAEDTFR PPWFHRNVAS EFMGLVHGVY
DAKAEGFVPG GASLHNCMSG HGPDAETFEK ASHSDTSTPK KVGDTMAFMF ETRTLIKPTR
FALETAQLQA HYYECWQGLT KHFNPEQR