Gene Bcep18194_C7702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7702 
Symbol 
ID3734589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp1349194 
End bp1350336 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content64% 
IMG OID637761403 
Productputative dioxygenase 
Protein accessionYP_367390 
Protein GI78060815 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAT GTCTCGCGCA GCAGGACCTC GAGATCGCCG GCCGTGCATT CAATGCGAAG 
GACGTCGCCG CCTCGTCGAC GCTGAATGCG AAGTGCTACA CGGACCCGAA GTACTTTGAC
GCGGAACTGG ACGCCGTTTT CCGCCGGTCG TGGCAGTGGG TGTGTCATGC GGAAAAGGTG
CGCGAGCCGG GCGCCTACTA TGTGGCCGAC GTGGCCGGCC GGAGCATCGC CGTAGTGCGC
GACCGGGCCG GCAGCCTGCG CGCGTTCTAC AACGTCTGCA AGCACCGCGC GCATCACCTG
CTTGCCGGCG AAGGCAAGGC CCGCGTGATC ACGTGCCCGT ATCACGCGTG GAGCTACAAC
CTCGACGGCG GGCTGCGCCA GGCGCCGATG ACGGAGACGC TGATCGACTT CAGGAAAGAG
GAGATCTGCC TGTCGCAAGT GCAGGTCGAG GAATTCTGCG GGTTCGTGTA CGTGAACATG
GACCCGGCCG CCGCGTCGCT CGCGTCGCAA AGCGGGGACC TGAAAGCCGA GATCGACCGG
TTCGCGCCGG ACGTCGGCAC GCTGACGTTC GCGCGTCGTC TGCAGTTCGA TATCCGGTCG
AACTGGAAGA ACGTCGTCGA CAACTTCCTC GAGTGCTACC ACTGCCCGAC CGCCCACAAG
GATTTCGTGA GCCTGCTGCA GTTCGACACC TACAAGGTGA CGACGCACGG GATCTACTCG
AGCCACATGG CGAAAGGCGG CAACGCCGGC AACACCGCGT ACAGCGTGGA AGGCGCGACC
TGCGACGACC ACGCGGTGTG GTGGGTCTGG CCCAACACGT GCCTGCTGCG TTATCCGGGC
CGCGCGAATT TCCTCGTGCT GAACATCATC CCGGTGGCGC CCGATCGCAC GATCGAGACC
TACGACTTCT ACTTCGAATC CGCCGAGCCG ACGCCGCAGG AGATCGAGGC GATCAACTAC
GTCCGCGACG TGCTGCAGCA GGAGGACATC GACCTGGTCG AAAGCGTGCA GCGCGGCATG
AGCACGCCCG CGTTCGAAAG CGGCCGGATC GTGAGCGATC CGCAGCAGTC GGGCATGAGC
GAGCACGCGC TGCATCACTT CCACGGGCTG GTGCTGAAGG CCTATCAGGA CGCACTGAGG
TAA
 
Protein sequence
MSECLAQQDL EIAGRAFNAK DVAASSTLNA KCYTDPKYFD AELDAVFRRS WQWVCHAEKV 
REPGAYYVAD VAGRSIAVVR DRAGSLRAFY NVCKHRAHHL LAGEGKARVI TCPYHAWSYN
LDGGLRQAPM TETLIDFRKE EICLSQVQVE EFCGFVYVNM DPAAASLASQ SGDLKAEIDR
FAPDVGTLTF ARRLQFDIRS NWKNVVDNFL ECYHCPTAHK DFVSLLQFDT YKVTTHGIYS
SHMAKGGNAG NTAYSVEGAT CDDHAVWWVW PNTCLLRYPG RANFLVLNII PVAPDRTIET
YDFYFESAEP TPQEIEAINY VRDVLQQEDI DLVESVQRGM STPAFESGRI VSDPQQSGMS
EHALHHFHGL VLKAYQDALR