Gene Bcep18194_B1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1016 
Symbol 
ID3752781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1143603 
End bp1144853 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID637765865 
Productmajor facilitator transporter 
Protein accessionYP_371774 
Protein GI78061866 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.180185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGGC TCGACTACAA GTTGCTCGGC CCCTGCCTGC TCGCCATCGC GATCGACGCG 
ATGGGTTTCG GGCTCGTGTA TCCGATGATG TCCGCGATCT TCAGCGATCC GCACGCGGGC
ATCCTGCCGG CCGACGCCGG CGCGCATGCG CGCAATTTCT ACCTGGGTCT CGGCTACGGG
ATCTACCCGT TGTGCATGTT CTTCGGCTCG TCGCTGATGG GCGACCTGTC GGATCGCTAC
GGCCGCCGCA GGATCCTGCT GCTGTGCGTG CTCGGTCTCG CGGCCGGCTA CGCGATGATG
GCGGCCGGCG CGTGGCATGC GAGCGTCGCG CTGCTGCTGG CCGGCCGTGG GCTTACGGGC
TTGATGGCCG GCTGCCAGGG CATCGCGCAG GCGGCGATCA CCGACCTGAG CACGCCGGAG
AACAAGGCGT ACAACATGAG CATCATGTCG CTCGCATTCA GCGCGGGCGT GATCGTCGGC
CCCGTGCTCG GCGGCGTGAC GTCCGACCGC ACGATTTCGC CGCTGTTCGA CTACGGCACG
CCGTTCGTAT TGGTCGCCGC GCTGTCGCTG GTCTGCGCGT TCTGGACCTG GGCGTCGTAT
CGCGATTCGG CCGCACCGCG CGGCGACACG CGCATCGACC CACTGCTGCC GCTGCGGATC
ATCGGGGAGG CCGCCCGCCA GCGCAACGTC GCGTTCCTGT CGGTGGTGTT CTTCCTGATG
CAGGTCGGCT ACGGGCTCTA CCTGCAGACC ATCATGCTGC TGCTGCAGGC GAAATTCGGC
TACACGAGCG CGCGGCTCGG GCTGTTCAGC GGCGTAATCG GGCTCTGCTT CGTGTTCGGG
CTGCTGTTCG TCGTGCGGCT GATGCTGCGC GTGTGGCGCG TGGTCGACAT CGCGAAGGCG
GGCCTGCTCG TCGCGGGCAT CGGCCAGGTC CTGTCCGCGC TGTTCCCGCA CGAGCCGGTG
CTGTGGGCGC TCGCGATGGT CGTCGGCTGC TTCGACATGG TCGCGTATAC GACGATGTAT
ACGGCGTTTT CCGATGCGGT CAGCGAGGAC CGGCAAGGCT GGGTGCTCGG CGTCGCGGGT
TCGGTGATGG CCGTCGCGTG GGTCGTCACG GGCGCGTTGA CGAACCTGCT GCCCGTGTTC
GGCGAGATCG GCCTGCTGCT GATCGGTGGC GTGGGCTTCC TGCTCAGCTT CGCGATGATG
GTCGCGTACG GCCGCACGCA GCCGGGTGCG CGAACCGCCG CGCTGTCGTA G
 
Protein sequence
MSRLDYKLLG PCLLAIAIDA MGFGLVYPMM SAIFSDPHAG ILPADAGAHA RNFYLGLGYG 
IYPLCMFFGS SLMGDLSDRY GRRRILLLCV LGLAAGYAMM AAGAWHASVA LLLAGRGLTG
LMAGCQGIAQ AAITDLSTPE NKAYNMSIMS LAFSAGVIVG PVLGGVTSDR TISPLFDYGT
PFVLVAALSL VCAFWTWASY RDSAAPRGDT RIDPLLPLRI IGEAARQRNV AFLSVVFFLM
QVGYGLYLQT IMLLLQAKFG YTSARLGLFS GVIGLCFVFG LLFVVRLMLR VWRVVDIAKA
GLLVAGIGQV LSALFPHEPV LWALAMVVGC FDMVAYTTMY TAFSDAVSED RQGWVLGVAG
SVMAVAWVVT GALTNLLPVF GEIGLLLIGG VGFLLSFAMM VAYGRTQPGA RTAALS