Gene Bmul_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBmul_4021 
Symbol 
ID5769850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia multivorans ATCC 17616 
KingdomBacteria 
Replicon accessionNC_010086 
Strand
Start bp996962 
End bp997942 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content69% 
IMG OID641318324 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001583996 
Protein GI161520569 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.391639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.714624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTT TCCCCCGCTG GATCGCCCGC ACCGTCGCCA CCGCACTCGT CGCCCTGTCG 
GCCGCGTCCG TCTGCGCGCC GGCGAGCGCG GGCCAGGTCG TACGCATCGG CTATCAGAAG
GCCGGGCTGC TCGCGATCAT CCACGCGCAG CATTCGCTCG AAGCGCGGCT GAAGCCGCTC
GGCTACGACG TGCAATGGTT CGAGTTTCCG GCCGGCCCGC AGCTGCTCGA AGCGCTGAAC
GCGAACGGCA TCGACTTCGG CTATACGGGC GCGCCGCCGC CTGTGTTCGC GCAGGCGGCC
GGCGTGCGCT TCGTGTACGT CGGCGCGGAG CCGCCGGCGC CGCACAACGA GGCGGTGTTC
GTGAAGGCCG ATTCGCCGAT CCGCTCGGTG GCCGAGCTGC GCGGCAAGCG CGTCGCGCTG
CAGAAAGGCT CGAGCGCGAA CTACCTGCTG CTCGAAGCGC TGAACAAGGC CGGCGTGCGC
TACGACGAGA TCCGCCCGGT CTACCTGCCG CCCGCCGATG CGCGTGCCGC CTTCGAAAGC
GGGCACGTCG ACGCGTGGGC CGTCTGGGAC CCGTATTACG CGGCCGCGCA AAACGCGTTG
AAGATCCGCA CGCTGTCCGA TTACACGGGC CTCACGCCGA CCAACAACTT CTACGAGGCG
ACGCGCGATT TCGCGCAGCA GCATCCCGAC GTGGTTGCCG CGATTCTCGC GCAGGCGCGC
GAGACCGGCG CATGGGTGAA CGGTCATCCG GCCGAGACGG CCGCGCTGAT CGCGCCGACT
GTCGGCATGC CGGCGCCGCT CGTCGAAACC TGGATCAAGC GCGTGCCGTT CGGCGCGGTG
CCGGTCGACG AGAAGATCGT CGCGGTTCAG CAGCGTGTCG CCGATGCGTT CCTCGCGGCG
AAGCTGATTC CGCAGAAGCT GAATGTCGCC GACAACGCGT GGATCGACCG CCGCGTCGCG
GCCGCGCTCG CCGCGAAATA G
 
Protein sequence
MIRFPRWIAR TVATALVALS AASVCAPASA GQVVRIGYQK AGLLAIIHAQ HSLEARLKPL 
GYDVQWFEFP AGPQLLEALN ANGIDFGYTG APPPVFAQAA GVRFVYVGAE PPAPHNEAVF
VKADSPIRSV AELRGKRVAL QKGSSANYLL LEALNKAGVR YDEIRPVYLP PADARAAFES
GHVDAWAVWD PYYAAAQNAL KIRTLSDYTG LTPTNNFYEA TRDFAQQHPD VVAAILAQAR
ETGAWVNGHP AETAALIAPT VGMPAPLVET WIKRVPFGAV PVDEKIVAVQ QRVADAFLAA
KLIPQKLNVA DNAWIDRRVA AALAAK