Gene Bphy_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_1042 
Symbol 
ID6242540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010622 
Strand
Start bp1171552 
End bp1172562 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID642592821 
Productaminodeoxychorismate lyase 
Protein accessionYP_001857277 
Protein GI186475807 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0148163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCC TGAAGAAATG TCTCATCGCC TGCGTCCTGC TCGCGATGCT GATCGGTGCC 
GCGGCATACG GCGCATACCG CTGGGCTACG ACACCCGTCC AACTCGCCAC ACCGCAACTC
GACGTCACGA TCAAGCCGCA TAGCAGTCTG CGCAGCGTCA CGACGCAGCT CAATCGCGGC
GGCGTACCCG TCGAGCCCGA ACTGTTCGTG CTCATGACGC GCGTGCTCGG CCTCCAGACG
GCGCTCAAAT CGGGCAACTA CGAGTTCAAG CAAGGCATCA CGCCGTACGA CGTGCTGCAA
AAGATCGCGC GCGGCGACGT GAACGAATAC GTGGCCACCA TCATCGAGGG CTGGACGTTC
AAGCATATGC GCGCCGAACT GGATGCCAAC CCGGCTTTGA AGCACGACAC GGCGGGCATG
GCGGACGCGG ATCTCCTGAA GGCGATCGGC GCCCCCGAAA CGCCGACGGG CACGGGCGAG
GGGCTGTTCT TCCCCGACAC CTATCTGTTC GACAAGGACA CGAGCGACCT CGACGTCTAT
CGCCGCGCGT ACCGGCTGAT GAAGCTGCGC ATCGACGAAG CGTGGGCGGC GCGTGCGCCC
GGTCTGCCGT ACAAGACGCC GTACGACGCG CTGACGATGG CGTCGATCGT CGAAAAGGAA
ACGGGCAAGG CGTCCGACCG CGCGATGGTC GCGGGCGTGT TCGCCAACCG TCTGCGCGCG
GGCATGCCGC TGCAGACGGA CCCGACCGTC ATCTACGGAA TGGGCGACAG CTACACGGGG
CATCTGCGCA AGAAGGACCT GCAGACGGAC ACTCCCTACA ATACCTATAT GCGAATGGGC
CTGCCGCCTT CGCCGATTGC GCTGCCCGGC GTCGCATCGC TGCAGGCCGC GCTCAATCCG
GCGCCGACCA GCGCGCTGTA TTTCGTGTCG CGCGGCGACG GCAGCAGCAT CTTTTCTGAC
ACGCTCGGCG ATCACAACAA GGCCGTCGAC AAGTACATTC GAGGGCAATG A
 
Protein sequence
MSLLKKCLIA CVLLAMLIGA AAYGAYRWAT TPVQLATPQL DVTIKPHSSL RSVTTQLNRG 
GVPVEPELFV LMTRVLGLQT ALKSGNYEFK QGITPYDVLQ KIARGDVNEY VATIIEGWTF
KHMRAELDAN PALKHDTAGM ADADLLKAIG APETPTGTGE GLFFPDTYLF DKDTSDLDVY
RRAYRLMKLR IDEAWAARAP GLPYKTPYDA LTMASIVEKE TGKASDRAMV AGVFANRLRA
GMPLQTDPTV IYGMGDSYTG HLRKKDLQTD TPYNTYMRMG LPPSPIALPG VASLQAALNP
APTSALYFVS RGDGSSIFSD TLGDHNKAVD KYIRGQ