Gene Bcep18194_B0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B0047 
SymbolaraH 
ID3751901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp52529 
End bp53548 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID637764893 
ProductL-arabinose transporter permease protein 
Protein accessionYP_370808 
Protein GI78060900 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.149615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG CAATGCAACC CCAACGCACG TCCCCGTCCG CCGATGCCGC CGCCGTACCC 
GCGCGAGCGC GCGGCGGCGT GTGGCAGCTG ATCAACCGCT CCGGCATCGT GATGGTGTTT
CTCGTGCTGT TCGCGACGCT GTCGCTGACC GTGCCGGACT TCCTCACGCC GCGCAACATC
CAGGGCCTGC TGCTGTCGGT CACGCTGATC GGCTCGATCG CGGTGACGAT GATGTTCGTG
CTCGCGCTCG GCGAGGTCGA CCTGTCGGTC GCGTCGATCG TCGCGTTCTC GGGCGTCGTC
GCGTCGACGC TGATCACCGC GACGCACAGC GTCGTGCTCG GCATCGCGGG CGGCGTGCTC
GCGGGCGGTG CGGTCGGGCT CGTCAACGGC GTGCTGATCG CGCGCTGGCG GATCAACTCG
CTGATCGTCA CGCTCGCGAT GATGGAAGTC GTGCGCGGAC TCGCGTTCAT CACGTCGAAC
GGCGACGCGG TGATGATCTC CGAGGAGCGC TTCTTCGATC TCGGCGGCGG GTCGTTTCTC
GGCATCTCGT ATCCGATCTG GAGCAACATC GTCGGCTTCG TCGTGTTCGG CTTCCTGCTG
CGCAAGACGG TGTTCGGCAA GAACGTGCTG GCCGTCGGCG GCAACGGCGA GGCCGCGCTG
CTCGCGGGGC TGCCGGTGAT GCGCATCAAG ATCACCGTGT TCGTGCTGCA GGGGCTCGTG
ACGGGCTTCG CGGGCGTGAT GCTCGCGTCG CGGATGAGCC TCGGCGACCC GAAGACGTCG
GTCGGGCTCG AACTCGGCGT GATCTCCGCG TGCGTGCTCG GCGGCGTATC GCTGACGGGC
GGCGTCGCGA CGATCTCCGG CGTGCTGGTC GGCGTGCTGA TCATGGGCTC TGTCCAGGAT
GCGATGAGCC TGCTGAACGT GCCGACGTTT TACCAATATT TGATACGCGG CGGGATTCTG
TTGCTCGCGG TGCTGTTCGA CCAGTATCGT CGCAACCAGC GGCGCGCGAT GAAGCTCTGA
 
Protein sequence
MSQAMQPQRT SPSADAAAVP ARARGGVWQL INRSGIVMVF LVLFATLSLT VPDFLTPRNI 
QGLLLSVTLI GSIAVTMMFV LALGEVDLSV ASIVAFSGVV ASTLITATHS VVLGIAGGVL
AGGAVGLVNG VLIARWRINS LIVTLAMMEV VRGLAFITSN GDAVMISEER FFDLGGGSFL
GISYPIWSNI VGFVVFGFLL RKTVFGKNVL AVGGNGEAAL LAGLPVMRIK ITVFVLQGLV
TGFAGVMLAS RMSLGDPKTS VGLELGVISA CVLGGVSLTG GVATISGVLV GVLIMGSVQD
AMSLLNVPTF YQYLIRGGIL LLAVLFDQYR RNQRRAMKL