Gene Bcep18194_B1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1152 
Symbol 
ID3752917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1294575 
End bp1295555 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID637766001 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_371910 
Protein GI78062002 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.107849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.542589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTT TTACCCGCTG GATCGCCCGC ACCGCCGCCG TCACGCTCGT CGCGCTGTCC 
GCCACGCCGG CCTTCGCGCA AGGCGCCGCC GACAAGGTCG TGCGCATCGG CTACCAGAAA
GCCGGCCTGC TGTCGGTCAT CAAGGCGCAA GGTTCGCTCG AAGCGCGGCT CAAGCCGCTC
GGCTATGGCG TGCAGTGGTT CGAATTCCCG GCCGGCCCGC AACTGCTCGA GGCGCTGAAC
GCGAACAGCA TCGACTTCGG CTATACGGGT GCGCCGCCGC CCGTGTTCGC GCAGGCGGCC
GGCGTGCACT TCGTCTATGT CGGCGCGGAA CCGCCGGCAC CGCACAACGA AGCCGTGGTC
GTGAAGGCCG ATTCGCCGAT CCGCTCGGTC GCCGGCTTGC GCGGCAAGAA GATCGCGTTG
CAGAAGGGCT CGAGCGCGAA CTACCTGCTG CTCGAAGCAC TGAAGAAAGC CGGCGTGCGC
TACGACGAAA TCCGCCCGGT GTACCTTGCG CCGGCCGACG CGCGCGCCGC GTTCGAGAGC
GGCAACGTCG ATGCCTGGGT CGTCTGGGAT CCGTACTATG CAGCCGCGCA AAACGCGCTG
AAGGTGCGCA CGCTGTCCGA CTACACGGGC CTCGCGGCGA CCAACAACTT CTACGAAGCG
ACGCGAGACT TCGCGCAGCA GCATCCCGAC GTGGTCGGCG CGATCCTGAA GCAGGCGCGC
GAGACGGGCC AGTGGGTCAA CACGCATCCG GCCGACACGG CCGCGCTGAT CGCACCGAAG
GTCGGCCTGC CGCAGCCGCT CGTCGAAACG TGGATCAAGC GTGTCCCGTT CGGCGCGGTG
CCGCTCGACG ACAAGATCGT CGCGGCCCAG CAAGGTGTCG CCGATGCGTT TTATGCAGCG
AAGCTAATTC CGCAGAAGCT GAGCGTGGCC GACAACGCAT GGGTCGACAA AAGTGGGGCG
AGTGCGCTCG CGGCGAAATA G
 
Protein sequence
MIRFTRWIAR TAAVTLVALS ATPAFAQGAA DKVVRIGYQK AGLLSVIKAQ GSLEARLKPL 
GYGVQWFEFP AGPQLLEALN ANSIDFGYTG APPPVFAQAA GVHFVYVGAE PPAPHNEAVV
VKADSPIRSV AGLRGKKIAL QKGSSANYLL LEALKKAGVR YDEIRPVYLA PADARAAFES
GNVDAWVVWD PYYAAAQNAL KVRTLSDYTG LAATNNFYEA TRDFAQQHPD VVGAILKQAR
ETGQWVNTHP ADTAALIAPK VGLPQPLVET WIKRVPFGAV PLDDKIVAAQ QGVADAFYAA
KLIPQKLSVA DNAWVDKSGA SALAAK