Gene Bcep18194_B1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1552 
Symbol 
ID3753317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1750684 
End bp1752375 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content67% 
IMG OID637766401 
Productextracellular solute-binding protein 
Protein accessionYP_372310 
Protein GI78062402 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.851364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGA TCGACCAGTT TCATCGGCTT GGCGAGTTTC TCGACGAGCA GGGCGGCCAG 
CCCGGGTTGC CGGCGCTTGC GCGGGCATTG AACTGCACCG AGCGCAACGT GCGGAGCCTG
CTGCGCAAGA TGGAGGCGCA AGGGTGGTTG CGGTGGGAGG CGGCGCGTGG CCGCGGCCAC
TTCTCGAAGC TGACGATGCT GGTTGCCCCG CAGCATGCGG TGCTCGATCG CCTGTCCTGC
CTGCTGGCGG ACGGTGAACT GGAGCAGGCG TTCGCGAGCC TCGGCGATGA GCAGCGCCAG
CAATTGTTGA AGCGGCTTCC GGATTTCCTC GGAATCCATC CGGCCGGCTC TCACGGCCAT
CGCTTGCGCA TTCCCCTCTA TCGGGCGGTG GACGAACTCG ATCCCTATCG GGTGATCAGC
CGGCTGGAGG CCCATCTGGT GCGGCAGATA TTTTCACGAC TGACCGAATT CGACAGGCAC
ACGCAGCGCG TCAGGCCGGC GCTGGCGCAT CACTGGGAGC CGGAAGAGGC AGGGCGGGTC
TGGCACTTCT GGCTCAGGCC GAACGTCCGT TTCCACGATG GCAGGCTTCT GGAGCCGGAA
GATGTGCGGT ACACCCTGCT GCGCATGCGC GACGAGCCGA GCCATTTCCA GCGCCTGTAC
CGGCATCTGC TCGACGTGGA AATCGGCGAA GGGCGGCGGA TCGTATGTCG TCTCGGCGAC
GTCGATCATC TCTGGCCGCA GCGCGTCGCG GCGGCCAACG CGTCGATCGT GCCACGCCGC
CGGAACGCCG ACTTTGCGCG CATGCCGGTC GGCACGGGGC CGTTCAGGCT GACGCGCCAC
AGCGACTACC GGATCACGTT GTCGGCATTC GGCGATCACT ATCGCGAGCG TGCGTTGCTC
GACGAGCTGG ATCTCTGGTT CCTGCCATCG GCCGAGCAGC CGGACGGATT CGATCTCCGA
TTCGGGTACT CCGCTTCTCA TGCGCCGGAG GAGAAGGGCA TCGTGCGCGT GCAGGCGGGC
TGTACGTACC TGGTCTGCAA CGCCACGCGC GAAGCGTTCC GCGAGCGTGC CGACCGGCTG
GCGCTGGCGG ATTGGCTCGC GCCAGCCCGC TTGTTCGGTC ACGACGATCC CGCGAGGCGG
CCGGCGGCCG GGCTGCTGCC GGCGTGGCGG CATCGCGTCG CGACACCAGC CGCCGAACCC
TTCGTGCCGC AATACACCGA GCTCACGCTG GTCACGGGGC AGACCGACGA TGAACGGGGC
CTGGCCCGTG CAATCGAGGC CAGATTGCGC GACGCGAATA TCCGGCTGAG CGTGTTGGCG
CTGCCTTATG CCGAGCTGAT CCGGCGCGAC TGGCGGGATT CGGCCGACCT GATGCTGGGC
AGCGAGATCC TGCACGACGA CGAGGATTTC GGCTGCTTCG AATGGTTCGG GGCCGACAGC
ATGTTCCGGC AATGGATGTC GGAACATGCC GCGCTCGAAC TGGACCGCCG GCTGCATGCG
GTCCAGGCGC AAGCCGATCC GCGCGCGCGG ATGGCGGACT ATGAGGTCAT CGGCAAGGAA
CTGGTCGATG CGGCGTGGTT GATCCCGATC TCGCACGAGC ACCAGCATGT CGAGCTGGCA
TCGCATGTTG CCGGTGTCGA CGAGGCCGCG CCGCTGGGGT TCGTGTCGTT CGCCGAGCTG
TGGGTGCGTT GA
 
Protein sequence
MRLIDQFHRL GEFLDEQGGQ PGLPALARAL NCTERNVRSL LRKMEAQGWL RWEAARGRGH 
FSKLTMLVAP QHAVLDRLSC LLADGELEQA FASLGDEQRQ QLLKRLPDFL GIHPAGSHGH
RLRIPLYRAV DELDPYRVIS RLEAHLVRQI FSRLTEFDRH TQRVRPALAH HWEPEEAGRV
WHFWLRPNVR FHDGRLLEPE DVRYTLLRMR DEPSHFQRLY RHLLDVEIGE GRRIVCRLGD
VDHLWPQRVA AANASIVPRR RNADFARMPV GTGPFRLTRH SDYRITLSAF GDHYRERALL
DELDLWFLPS AEQPDGFDLR FGYSASHAPE EKGIVRVQAG CTYLVCNATR EAFRERADRL
ALADWLAPAR LFGHDDPARR PAAGLLPAWR HRVATPAAEP FVPQYTELTL VTGQTDDERG
LARAIEARLR DANIRLSVLA LPYAELIRRD WRDSADLMLG SEILHDDEDF GCFEWFGADS
MFRQWMSEHA ALELDRRLHA VQAQADPRAR MADYEVIGKE LVDAAWLIPI SHEHQHVELA
SHVAGVDEAA PLGFVSFAEL WVR