Gene Bcep18194_C7046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7046 
Symbol 
ID3734615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp613807 
End bp615180 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content64% 
IMG OID637760748 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_366735 
Protein GI78060160 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.296981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGAG CCAACATGTC CGCCACGATC GACCAAGCCA CCGACCTGGA CCACCTGCTC 
GCCACCGCCG TGCAGGACGA CAAGGAGGCC GGCGTATTCC GCTGCCGCCG CGACATCTTC
ACGAACGCCG AGCTGTACGA GCTCGAGATG AAGCACATCT TCGAGAGCAA CTGGGTGTAC
CTGGCGCACG AAAGCCAGAT CCCGGCGAAC AACGATTACT ACACGACGTG GATCGGCCGC
CAGCCGATCG TGATCACGCG CGACAAGACC GGCGAGCTGC ACGCGGTGAT CAATGCCTGC
GCGCACAAGG GCGCGATGCT GTGCCGCCGC AAGCACGGCA ACAAGGGCAG CTTCACGTGC
CCGTTCCACG GCTGGACCTT CTCGAACACC GGCAAGCTGC TGAAGGTGAA GGACGAGAAA
ACGACCGAGT ATCCGGTGCA GTTCAACACG CACGGCTCGC ACGACCTGAA GAAGGTCGCG
CGTTTCGAGA ACTATCGCGG CTTCCTGTTC GGCAGCCTCA GCGCCGACGT GCTGCCGCTC
GAGGAATACC TCGGCGAAGC GCGCGTGATC ATCGACCAGA TCGTCGACCA GGCGCCGAAC
GGGCTCGAAG TGCTGCGCGG CAACTCGTCC TACATCTACG AAGGCAACTG GAAGATGCAG
ATGGAGAACG GCTGCGACGG CTACCACGTC AGCACCGTGC ACTGGAACTA CGCGGCGACG
ATGGGCCGCC GCAAGGAAGA CGGCACCAAG GCCGTCGATG CGAACAGCTG GAGCAAGTCG
GTCGCGGGCG TGTACGGCTT CGACAACGGC CACATCCTGC TGTGGACGCA GACGATGAAC
CCGGAAGTGC GGCCCGTGTA CCAGCACCGT GAAGAGATCA AGGCGCGCGT CGGCGACGTG
CAGGCCGACT TCATCGTCAA CCAGACCCGC AACCTGTGCG TGTACCCGAA CGTGTTCCTG
ATGGACCAGT TCAGCACGCA GATCCGCGTC GTGCGGCCGC TCGGCGTCGA CAAGACCGAA
GTCACGATCT TCTGCTTCGC GCCGAAGGGC GAGAGCGAGA CCGATCGCAC GATCCGGATC
CGCCAGTACG AGGATTTCTT CAACGTGACG GGCATGGGCA CGGCCGACGA TCTCGAGGAG
TTCCGCGCAT GCCAGGCCGG CTATGCGGGC ATCACGGCGA TGTGGAACGA CCTGTCGCGC
GGCGCGCCGC TGTGGGTCGA CGGCCCGGAC GAGAACGCGA AGAAGATGGG GCTGAACCCG
CGCATTTCCG GCGAGCGCAG CGAGGACGAA GGGCTGTTCG TGTGCCAGCA CGAACACTGG
GTGCATGTGA TGCGCGATGC GCTGAAGAAG GAACGCGGGG AGGTGGCAGC ATGA
 
Protein sequence
MGGANMSATI DQATDLDHLL ATAVQDDKEA GVFRCRRDIF TNAELYELEM KHIFESNWVY 
LAHESQIPAN NDYYTTWIGR QPIVITRDKT GELHAVINAC AHKGAMLCRR KHGNKGSFTC
PFHGWTFSNT GKLLKVKDEK TTEYPVQFNT HGSHDLKKVA RFENYRGFLF GSLSADVLPL
EEYLGEARVI IDQIVDQAPN GLEVLRGNSS YIYEGNWKMQ MENGCDGYHV STVHWNYAAT
MGRRKEDGTK AVDANSWSKS VAGVYGFDNG HILLWTQTMN PEVRPVYQHR EEIKARVGDV
QADFIVNQTR NLCVYPNVFL MDQFSTQIRV VRPLGVDKTE VTIFCFAPKG ESETDRTIRI
RQYEDFFNVT GMGTADDLEE FRACQAGYAG ITAMWNDLSR GAPLWVDGPD ENAKKMGLNP
RISGERSEDE GLFVCQHEHW VHVMRDALKK ERGEVAA