Gene Bcep18194_C7045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7045 
Symbol 
ID3734497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp612763 
End bp613707 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content67% 
IMG OID637760747 
Productcatechol 1,2-dioxygenase 
Protein accessionYP_366734 
Protein GI78060159 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID[TIGR02439] catechol 1,2-dioxygenase, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.274631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA AAGTTTTCGA GTCCCGGGAA GTGCAGGATC TGCTGAAGGC CGCGTCGAAC 
GCGGGCGCGG ACAGCGCGAA GGGCGGCAAC GCGCGCACGC AGCAGGTCGT GCTGCGGTTG
CTGGGCGACC TGTTCAAGGC GATCGACGAT CTCGACATCA CGCCCGACGA AGTGTGGGCG
GGCGTCAACT ACCTGAACAA GCTCGGCCAG GACGGCGAAG CGGCGCTGCT CGCGGCCGGC
CTCGGCCTCG AGAAGTTTCT CGACATCCGG ATGGATGCCG CGGACAAGGC GGTCGGCCTC
GACGGCGGCA CGCCGCGCAC GATCGAAGGG CCGCTGTATG TGGCCGGCGC ACCGGTGCGC
GACGGCGTGT CGAAGATCGA CCTCGACGCG GATGACGGCG CGGGCCCGCT CGTGATCCAC
GGCACAGTCA CCGGCCTCGA CGGCAAGCCG ATCGCGGGCG CGCTGGTCGA ATGCTGGCAC
GCGAACTCGC ACGGCTTCTA TTCGCACTTC GACCCGACCG GCAAGCAGAG CGATTTCAAC
CTGCGCGGCG CGGTTAAGAC GGGCGCGGAC GGCAAGTACG AATTCCGCAC GCTGATGCCG
GTCGGCTACG GCTGCCCGCC GCACGGCGCG ACGCAGCAAC TGCTGAACGG TCTCGGCCGC
CACGGCAACC GTCCGGCGCA CGTGCACTTC TTCGTCGACA GCAACGACCA CCGCAAGCTG
ACGACGCAGT TCAACATCGA CGGCGATCCG CTGATCTGGG ATGACTTCGC GTATGCGACA
CGCGAGGAGT TGATCCCGCC CGTGGTCGAC AAGACCGGCG GCACGGCGCT CGGCATGAAG
GCCGATGCGT ACCAGGACAT CGAGTTCAAC TTTGTCCTGA CGCCGCTGGT GCAGGGCAAG
GACAACCAGG TCGTCCACCG CCTGCGCGCA GCCGCGACGG CGTAA
 
Protein sequence
MSVKVFESRE VQDLLKAASN AGADSAKGGN ARTQQVVLRL LGDLFKAIDD LDITPDEVWA 
GVNYLNKLGQ DGEAALLAAG LGLEKFLDIR MDAADKAVGL DGGTPRTIEG PLYVAGAPVR
DGVSKIDLDA DDGAGPLVIH GTVTGLDGKP IAGALVECWH ANSHGFYSHF DPTGKQSDFN
LRGAVKTGAD GKYEFRTLMP VGYGCPPHGA TQQLLNGLGR HGNRPAHVHF FVDSNDHRKL
TTQFNIDGDP LIWDDFAYAT REELIPPVVD KTGGTALGMK ADAYQDIEFN FVLTPLVQGK
DNQVVHRLRA AATA