Gene Bcep18194_A3704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3704 
SymbolaraG 
ID3748888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp592414 
End bp593925 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID637761984 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_367949 
Protein GI78065180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.727825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGCGG CACTGCGTTT TGACAATATC GGCAAGGTCT TTCCTGGCGT GCGCGCACTC 
GACGGCATCT CGTTCGACGT GCACGCGGGC GAGGTGCATG GCCTGATGGG CGAGAACGGC
GCGGGCAAGT CGACGCTGCT GAAGATTCTC GGCGGCGAAT ACCAGCCCGA TGCGGGCAGC
GTGCTGGTCG ATGGCCAGCC CGTGCAGTTC GCGAGTGCGG CCGCGTCGAT CGCGGCCGGC
ATCGCGGTGA TTCACCAGGA GCTGCAGTAC GTGCCGGACC TGACGGTCGC GGAAAACCTG
CTGCTCGGCC GCCTGCCGAA TGCGTTCGGC TGGGTGCGCA AGCGCGAGGC GAAGCGTTAC
GTGCGCGAGC GGCTCGCCGC GATGGGCGTC GATCTCGATC CCGATGCGAA GCTCGGGCGG
CTGTCGATCG CGCAGCGGCA GATGGTCGAG ATCTGCAAGG CGCTGATGCG CAATGCGCGC
GTGATCGCGC TCGACGAACC GACCAGCTCG CTGTCGCATC GCGAGACGGA AGTGCTGTTC
AAGCTCGTCG ACGACCTGCG TGCGCAGGGC CGCGCGCTGA TCTACATCTC GCACCGGATG
GACGAGATCT ACCGGCTCTG CAATGCGTGC ACGATCTTCC GCGACGGGCG CAAGATCGCG
TCGCACGAGT CGCTCGCCGA CGTGCCGCGC GAACGGCTCG TCGCCGAGAT GGTCGGGCGC
GAGATTTCGG ACATCTACCA TTACGCGCCG CGTGCGCTCG GCGATGTGCG CTTCGCCGCC
GAAGGCATCG ACGGCCCGGC ATTGCGCGAA CCCGCGAGCT TCTCGGTGCG CGCGGGCGAG
ATTGTCGGCT TTTTCGGGCT CGTCGGCGCG GGCCGCAGCG AACTGATGCG GCTCGTGTAC
GGCGCGGATC ACCGGCGCGC GGGCGTGCTG ACGCTTGACG GCGCGCGCAT CGACGTGAAG
CGCACCGGCG ACGCGATCCG CCACGGCATC GTGCTGTGCC CCGAGGACCG CAAGGAAGAG
GGGATCATCG CGATCGCGTC GGTCGCGGAG AACATCAACA TCAGCTGCCG CCGCCATTCG
CTGCGCGCGG GGCTCTTCAT CAACCGCAAG GCCGAAAGCG AAACGGCCGA CCGTTTCATC
CAGCGGCTCA AGATCAAGAC GCCGAACCGG CGGCAGAAGA TCCGCTTCCT GTCGGGCGGC
AACCAGCAGA AGGCGATCCT GTCGCGCTGG CTCGCGGAGC CCGACCTGAA GGTCGTGATC
CTCGACGAAC CGACGCGCGG GATCGACGTC GGCGCGAAGC ACGAGATCTA CGACGTGATC
TACCGGCTCG CGGAACGCGG CTGCGCGATC GTGATGGTGT CGTCGGAGCT GCCGGAAGTG
CTCGGCGTGT CCGACCGCAT CGTCGTGATG CGCGAAGGCC GGATCGCGGG CGAACTGCCG
CGCGAACAGG CGAACGAGCA CGCGGTGCTG AGCCTGGCGC TGCCGCAGAC GAGCGCCGTC
GAGGCGGCCT GA
 
Protein sequence
MSAALRFDNI GKVFPGVRAL DGISFDVHAG EVHGLMGENG AGKSTLLKIL GGEYQPDAGS 
VLVDGQPVQF ASAAASIAAG IAVIHQELQY VPDLTVAENL LLGRLPNAFG WVRKREAKRY
VRERLAAMGV DLDPDAKLGR LSIAQRQMVE ICKALMRNAR VIALDEPTSS LSHRETEVLF
KLVDDLRAQG RALIYISHRM DEIYRLCNAC TIFRDGRKIA SHESLADVPR ERLVAEMVGR
EISDIYHYAP RALGDVRFAA EGIDGPALRE PASFSVRAGE IVGFFGLVGA GRSELMRLVY
GADHRRAGVL TLDGARIDVK RTGDAIRHGI VLCPEDRKEE GIIAIASVAE NINISCRRHS
LRAGLFINRK AESETADRFI QRLKIKTPNR RQKIRFLSGG NQQKAILSRW LAEPDLKVVI
LDEPTRGIDV GAKHEIYDVI YRLAERGCAI VMVSSELPEV LGVSDRIVVM REGRIAGELP
REQANEHAVL SLALPQTSAV EAA