Gene Bcep18194_A4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4685 
Symbol 
ID3749890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1679536 
End bp1680525 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content68% 
IMG OID637762979 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_368924 
Protein GI78066155 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00029583 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.149517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGA TGAACCGCCG CACATTTGCG CGCGTGATGC TGGCCGCCGG CCTGACTGCC 
GCCGGCGTCC GAACGCACGC CGAGAACGCA CCCGAGGTAT TGCGCATCGG CTACCAGAAG
TCGTCGACGC TCATCACACT GCTCAAGACG CGCGGCGCGC TCGAGCAGGC ACTGGCACCG
CTCGGCCTGC GCGTGTCGTG GCATGAATTC GCGAGCGGAT TGCCGCTGAC CGAAGCGCTC
AACGTCGGCG CCGTCGATTT CAGTGCCGAC GTGGCCGATA CGGTTCCGGT CTTCGCGCAA
GCCGCACGCG CGCGTTTCGT CTACGTCGCG CAGGAGGCGC CTTCGCCGAA GGCACAGGCG
ATCGTCGTCA AGCAGGACAG CGCGCTGCGC ACGCTCGCCG ATCTCAAGGG CAAGCGCATT
GCGGTCACGA AGGCGGCCGG CAGTCATTAC CTGCTGCTTG CCGCGCTCGC ACGCGCGAAG
CTCGGGCCGG CCGATGCGGC GATCCACTAC CTGACACCCG CGGACGGCCG CGCGGCGTTC
GAGCGTGGCA GCGTGGACGC CTGGATCACG TGGGACCCCT ATGTCGCGTC GGTCGACCGG
AATCCCGACG TGCGAATTCT GGCCGACGGC AACGGACTCG CGTCGTATCA GCGCTACTAC
CTCGCATCCA GCAGTTTCGC CGCCGCGCGG CCCGACGTCG TCCAGATCGT GTTCGACCAG
CTGTCGCAGG CCGGCGCATG GCTGCGCGAT CACCCGCAGG AGGCCGCGAA TACGCTCGCG
CCGATCTGGG GGCTCGACGC GGCGACGATC GCGCGCGCAA ACGCACGGCG CAGCTATGTC
GTTCGTGCTG TGGTCGCGCA AAACTTCGGC GAACAGCAGA AGATCGCCGA CACGTTCCTC
GCAGCCGGGC TACTGCCTGC CCGCGTCGAC ACGAGCCAGG CGCAGCGCTG GAATTTCGTT
GCGAAACGTG CTGATCCGGT CGGCGCGTGA
 
Protein sequence
MTSMNRRTFA RVMLAAGLTA AGVRTHAENA PEVLRIGYQK SSTLITLLKT RGALEQALAP 
LGLRVSWHEF ASGLPLTEAL NVGAVDFSAD VADTVPVFAQ AARARFVYVA QEAPSPKAQA
IVVKQDSALR TLADLKGKRI AVTKAAGSHY LLLAALARAK LGPADAAIHY LTPADGRAAF
ERGSVDAWIT WDPYVASVDR NPDVRILADG NGLASYQRYY LASSSFAAAR PDVVQIVFDQ
LSQAGAWLRD HPQEAANTLA PIWGLDAATI ARANARRSYV VRAVVAQNFG EQQKIADTFL
AAGLLPARVD TSQAQRWNFV AKRADPVGA