Gene Bcep18194_B1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1789 
Symbol 
ID3753554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2031729 
End bp2032724 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content67% 
IMG OID637766638 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_372547 
Protein GI78062639 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.47034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.532052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTCC ACTGGAAGAC GCTCGCACTG GCCATGGCGA GCGCGGCGCT GGTCGGCGGG 
GTACCCGCCC ATGCCGCCGA CCCCGCCGAC GTGAAAATCG GCTTCCTCGT CAAACAGCCG
GACGATCCGT GGTTCCAGGA CGAATGGCGC TTCGCCGACC AGGCGGCGAA GGAAAAGCAC
TTCACGCTGA TCAAGATCGC GACGCCGAGC GGTGAAAAGG TGTCGACCGC ACTCGACAGC
CTCGCCGCGC AAAAGGCCCA AGGCGTGATC ATCTGCGCAC CCGACGTGAA GCTCGGCCCC
GGCATCGCGG CGAAGGCGAA GCGCGCGGGG ATCAAGCTGA TGTCGGTCGA CGACCAGCTC
GTCGACGGCC GCGGCGCGCC GCTCCAGGGC GTGCCGCACA TGGGCATCTC CGCGTACAAG
ATCGGCCAGC AGGTCGGCCA GGCGATCGCG GACGAAACGA AGCGGCGCGG CTGGAATCCG
GCCGAGGTCG GCATCATCCG GATCGCGTAC GACCAGTTGC CGACCGCGAA AGAGCGCACC
ACCGGCGCGG TCGACGCGCT CAAGGCTGCC GGCTTTCCGG CCGCGAACGT GATCGATGCC
CCCGAAATGA CGGCCGATAC CGAAGGCGCG TTCAACGCAG CGAACATCGC GCTCACCAAG
CACGCGAATT TCAAGCGCTG GGTGGCATTC GGCTCGAACG ACGACACGAC GGTCGGTGCG
GTACGTGCCG CCGAGGGGCG CGGCATCGGC GCCGACGCGA TGGTCGCGGT CGGCATCAAC
GGCAGCCAGG TCGCGCTCAA CGAATTCGCG AAGCCGAAGC CAACGGGCTT CTACGGCTCG
ATCCTGCTGA ATCCGCGCCA GCACGGCTAC CAGACGAGCA TCAACATGTA TGACTGGATC
ACGAAGAACC AGGCGCCGCC GCCGCTCGTG CTGACGTCCG GCACGCTGAT CACGCGCGAC
AACGAAAAGC AGGCGCGCGC CGCGCTCGGC CTGTGA
 
Protein sequence
MRFHWKTLAL AMASAALVGG VPAHAADPAD VKIGFLVKQP DDPWFQDEWR FADQAAKEKH 
FTLIKIATPS GEKVSTALDS LAAQKAQGVI ICAPDVKLGP GIAAKAKRAG IKLMSVDDQL
VDGRGAPLQG VPHMGISAYK IGQQVGQAIA DETKRRGWNP AEVGIIRIAY DQLPTAKERT
TGAVDALKAA GFPAANVIDA PEMTADTEGA FNAANIALTK HANFKRWVAF GSNDDTTVGA
VRAAEGRGIG ADAMVAVGIN GSQVALNEFA KPKPTGFYGS ILLNPRQHGY QTSINMYDWI
TKNQAPPPLV LTSGTLITRD NEKQARAALG L