Gene BURPS1106A_A0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0760 
Symbol 
ID4904089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp754131 
End bp755141 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID640143866 
Productquaternary amine ABC transporter periplasmic substrate-binding protein 
Protein accessionYP_001074796 
Protein GI126455704 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGGCCG CGCGGTTGAG TCGAAATCGC CTGCGCAATC CGTCGAACAG GGAGGCAACC 
ATGAAGTCCA CCACGACATT CGTATTCGGT GCGGCGCTCG CCGCGGCATG CGCGCTGCCG
TCGCAATCCT TCGCGCAGGA CTCGGCCGCC TGCCGCAACG TGCGCTTCGC GGACATCGGC
TGGACCGACA TCACGTCGAC GACGGCGCTC GCGTCGCTGC TGTTCGACGG TCTAGGCTAC
AAGCCGACGA CGACGATCGC GTCCGTGCCG ATTTCGTTCG CAGGACTCAA GAACAGGCAG
CTCGACGTAT CGCTCGGCTA CTGGTGGCCG GTGCAGCAGC ATCAGTTGCA GCCGTTCCTC
GATTCGAAAT CGATCTCGGT GGTCGAGCCG CCGAACCTGT CGGGCGCGAA GGCGACGCTC
GCGGTGCCGA GCTACGTGTA CCAGGCCGGG CTGAAATCGT TCGACGACAT CGCGAAGCAT
CGCGCCGAGC TCGACGGCAA GATCTACGGG ATCGAGCCCG GCAGCAGCGC GAACGCGACG
ATCCAGAAGA TGATCGATAC GAACCAGTAC GGGCTCGGCG GTTTCAAGCT CGTCGAATCG
AGCGAGGCGG GGATGCTCGT CACGGTCGAG CGCGCGATCC GCGACAAGAA GTGGGTCGTG
TTCCTCGGCT GGGAGCCGCA TCCGATGAAC ATCCAGATCG GCATGAACTA CCTGTCGGGC
GGCGACGCGG CGTTCGGCCC GAACTACGGC GAAGCGCGCG TGTACACGCT GACGTCGCCC
GATTACATGG CGCGCTGCCC GAACGCGGGC AAGCTCGTCG GCAATCTGCG CTTCACCACG
CAAATGGAAA ACCAGCTGAT GCAGGCGGTG ATGAACAAGG TGAAGCCCGC GGAAGCGGCG
AAGGCGTACA TCCGAAAGAA TCCGCAAGTG CTCGATGCGT GGCTTGCCGG CGTGAAGACC
TACGACGGCA AGGACGGGCT GGCTGCGGTG AAGGCTTATC TGGGGCTCTG A
 
Protein sequence
MQAARLSRNR LRNPSNREAT MKSTTTFVFG AALAAACALP SQSFAQDSAA CRNVRFADIG 
WTDITSTTAL ASLLFDGLGY KPTTTIASVP ISFAGLKNRQ LDVSLGYWWP VQQHQLQPFL
DSKSISVVEP PNLSGAKATL AVPSYVYQAG LKSFDDIAKH RAELDGKIYG IEPGSSANAT
IQKMIDTNQY GLGGFKLVES SEAGMLVTVE RAIRDKKWVV FLGWEPHPMN IQIGMNYLSG
GDAAFGPNYG EARVYTLTSP DYMARCPNAG KLVGNLRFTT QMENQLMQAV MNKVKPAEAA
KAYIRKNPQV LDAWLAGVKT YDGKDGLAAV KAYLGL