Gene Bcep18194_A4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4103 
Symbol 
ID3749291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1032124 
End bp1033338 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID637762383 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_368344 
Protein GI78065575 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.274585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.294261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG CCGCGGAAAC CGGCGCGCGC GTTGTCGCGA CCTTCACGTC GCCCGGCCAG 
GTGCGGCTCA CGTCGCTGCC GCCGCTCGCG CTGTACGTGC ATTTCCCGTG GTGCGTGCGC
AAGTGCCCGT ACTGCGATTT CAACTCGCAC GAGTGGAAGG GCGAGCGGTT TCCGGAAACC
GAGTATCTCG ACGCGCTGCG CGCCGATCTC GAGCAGGCGC TGCCGCTCGT GTGGGGCCGG
CAGGTGCATA CGATATTCAT CGGCGGCGGC ACGCCGAGCC TGCTGTCGGC GGCCGGTCTC
GACCGGATGC TGTCCGACGT GCGCGCGCTG CTGCCGCTCG ACGCCGATGC GGAGATCACG
CTCGAGGCCA ATCCGGGCAC GTTCGAGGCC GCGAAGTTCG CGCAGTTCCG CGCGAGCGGC
GTGAATCGCC TGTCGGTCGG CATCCAGAGC TTCAACGAGA CGCACCTGAA GGCGCTCGGC
CGGATTCACG ACACCACGCA GGCACGCGCC GCTGTCGAGA TCGCCGCGAA GAACTTCGAC
AACTTCAACC TCGACCTGAT GTTCGCGCTG CCGAACCAGA CGCTCGACGA ATGCCGCACC
GACGTCGAAA CCGCGCTGTC GTACGCGCCG CCGCATCTGT CGCTGTATCA CCTGACGCTC
GAGCCGAATA CGCTGTTCGC GAAGTTCCCG CCGGTCGTGC CCGACGACGA CGCGTCGGCC
GACATGCAGG AATGGATTCA CGCGCGCACG GCCGAGGCCG GTTACGGACA CTACGAAGTC
TCCGCGTATG CGAAGCCGAA TCATCAGTGC AAGCACAACC TGAACTACTG GCGCTTCGGC
GACTATCTCG GAATCGGTGC GGGCGCACAC ACGAAGCTGT CGTTCCCGAA CCGGATCCTG
CGGCAGGCAC GCTACAAGCA TCCGGCAACC TTCATCGAGC AGGCGATGGC CGGCACGGCC
GTGCAGGAAG AGCGTGAAGT CGGCGCGCGC GACCTGCCGT TCGAGTTCAT GCTGAACACG
CTGCGGCTCG TCGAGGGCTT CCCCGTGCAC AACTTCGCCG AACGCACGGG CCTGCCGATG
AGCACGATCG AGCCGGCGCT GCAGGAAGCG GAACGACGCG GGCTGATCGC GCGCGATTTC
GCGCAGATCG CGCCGACGCC GCTGGGCCAG CGTTTCCTCA ACGACCTGCA GGAATTGTTC
CTGCGCGACG ATTGA
 
Protein sequence
MSQAAETGAR VVATFTSPGQ VRLTSLPPLA LYVHFPWCVR KCPYCDFNSH EWKGERFPET 
EYLDALRADL EQALPLVWGR QVHTIFIGGG TPSLLSAAGL DRMLSDVRAL LPLDADAEIT
LEANPGTFEA AKFAQFRASG VNRLSVGIQS FNETHLKALG RIHDTTQARA AVEIAAKNFD
NFNLDLMFAL PNQTLDECRT DVETALSYAP PHLSLYHLTL EPNTLFAKFP PVVPDDDASA
DMQEWIHART AEAGYGHYEV SAYAKPNHQC KHNLNYWRFG DYLGIGAGAH TKLSFPNRIL
RQARYKHPAT FIEQAMAGTA VQEEREVGAR DLPFEFMLNT LRLVEGFPVH NFAERTGLPM
STIEPALQEA ERRGLIARDF AQIAPTPLGQ RFLNDLQELF LRDD