Gene Bcep18194_C6764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C6764 
Symbol 
ID3734548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp283610 
End bp284854 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID637760468 
Productputative dioxygenase 
Protein accessionYP_366455 
Protein GI78059880 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.963222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC CGATGGATTT CAACGGAAGC TTCACCGAAT CGGTTGGCCT GGAAACCGGC 
GCCGTGCCGA CGTCGATTTA TACCGACGCC GCACAGTACG AAGTCGAGCG CGAGCGGATC
TTTCGTCGCG CGTGGCTGAT GGTCGGGCGC GTCGAGCGCA TCCCGAAGCC GGGCGATTTC
TTTGTGAAGC AGCTTGCGGT GCTGAACGCA TCGGTTATCG TCGCGCGCTC GGAAGACGGC
ACGATTCGCG CATTTCACAA CGTCTGCGCA CACCGTGCGA ACATCGTCGA GCACCGCGCG
TCCGGCAATG CGAAGCGCTT CGTGTGCCGA TACCACAGCT GGGGATACGA CAATGCCGGG
CAACTGGCGC ATGTGCCCGA CGAATCCGGG TTCTTCGGCC TCGACAAGAA GAAATGCGGG
CTGACACCGG TTGCCCTCGA GATCTGGGAA GGCTGGATCT TCATCAACCT CGCGACGGAA
CCGGAACTCA GCCTCGCGGA ATTCCTCGGG CCGATTGCCG ACGCGCTCAC CGGCATCGAG
TACCAGAACC CGGATCACCC GATCGTGCTG CGCGGAGAAT TCAACGCGAA CTGGAAGGCT
GTCGCCGAGA ATTTCAGCGA GGCCTATCAC GTGGCATCGA TTCACCCGAA GACGCTGGCG
CCGTTCTACA TCGGGCAACA GAACCCGTTC GGCAGGCCGA TCAGCGTCCG GCTTCACGGC
CCGCATCGCT CGATGTCGTG GTGGCTCAAC CCGATCGCGC AGCCGTTCGC GAAATCGAAG
GTCGCGCAAT GGCTGTTCTC GGCCGACCAT TCGGTGACGG GCACCCGCAA GGCGGGGCAG
GTCAACTCGG TCGTCGAGCA CAAGGGCGTG AACCCGACGA AGCACGAGCA CTGGGCGAGC
GACGTCAACT GGTTCTTCCC GAACTGGCAC CTGCAGATCT CGGCCAATCA GTTCTGGACG
CACGAGTTCT GGCCGACGTC GGCGTCGACG ACCGTCTGGG AGGCGCGCTT CTACTACAAG
AAGGCGACGA CGGTCCGCGA GCGCCTGCAG CTCGAGCATT TCACGTCCCA CATCACCGAT
TCGATGCTCG AGGACCTGGG CAACATCGAG AGCATGCAGG TCGGCATGGC GTCCGGCGCG
AAGCCGGTCA TCTACTTCAA CGAGAGCGAG GTCCTGTGCC GTCACGGGAT CGAGCAGGTC
GTCAAATGGT CGGCGGCGCC GACGGCGAAA GGCGCGCTCG ACTGA
 
Protein sequence
MNNPMDFNGS FTESVGLETG AVPTSIYTDA AQYEVERERI FRRAWLMVGR VERIPKPGDF 
FVKQLAVLNA SVIVARSEDG TIRAFHNVCA HRANIVEHRA SGNAKRFVCR YHSWGYDNAG
QLAHVPDESG FFGLDKKKCG LTPVALEIWE GWIFINLATE PELSLAEFLG PIADALTGIE
YQNPDHPIVL RGEFNANWKA VAENFSEAYH VASIHPKTLA PFYIGQQNPF GRPISVRLHG
PHRSMSWWLN PIAQPFAKSK VAQWLFSADH SVTGTRKAGQ VNSVVEHKGV NPTKHEHWAS
DVNWFFPNWH LQISANQFWT HEFWPTSAST TVWEARFYYK KATTVRERLQ LEHFTSHITD
SMLEDLGNIE SMQVGMASGA KPVIYFNESE VLCRHGIEQV VKWSAAPTAK GALD