Gene BURPS668_A0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0850 
Symbol 
ID4888808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp828559 
End bp829509 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content64% 
IMG OID640130790 
Productquaternary amine ABC transporter periplasmic substrate-binding protein 
Protein accessionYP_001061849 
Protein GI126444696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCCA CCACGACATT CGTATTCGGC GCGGCGCTCG CCGCGGCATG CGCGCTGCCG 
TCGCAATCCT TCGCGCAGGA CTCGGCCGCC TGCCGCAACG TGCGCTTCGC GGACATCGGC
TGGACCGACA TCACGTCGAC GACGGCGCTC GCGTCGCTGC TGTTCGACGG TCTAGGCTAC
AAGCCGACGA CGACGATCGC GTCCGTGCCG ATTTCGTTCG CAGGACTCAA GAACAGGCAG
CTCGACGTAT CGCTCGGCTA CTGGTGGCCG GTGCAGCAGC ATCAGTTGCA GCCGTTCCTC
GATTCGAAAT CGATCTCGGT GGTCGAGCCG CCGAACCTGT CGGGCGCGAA GGCGACGCTC
GCGGTGCCGA GCTACGTGTA CCAGGCCGGG CTGAAATCGT TCGACGACGT CGCGAAGCAT
CGCGCCGAGC TCGATGGCAA GATCTATGGG ATCGAGCCCG GCAGCAGCGC GAACGCGATG
ATCCAGAAGA TGATCGACAC GAACCAGTAC GGGCTCGGCG GCTTCAAGCT CGTCGAATCG
AGCGAGGCGG GGATGCTCGT CACGGTCGAG CGCGCGATCC GCGACAAGAA GTGGGTCGTG
TTCCTCGGCT GGGAGCCGCA TCCGATGAAC ATCCAGATCG GCATGAACTA CCTGTCGGGC
GGCGACGCGG CGTTCGGCCC GAACTACGGC GAAGCGCGCG TGTACACGCT GACGTCGCCC
GATTACATGG CGCGCTGCCC GAACGCGGGC AAGCTCGTCG GCAATCTGCG CTTCACCACG
CAGATGGAAA ACCAGCTGAT GCAGGCGGTG ATGAACAAGG TGAAGCCCGC GGAAGCGGCG
AAGGCGTACA TCCGAAAGAA TCCGCAAGTG CTCGATGCGT GGCTTGCCGG CGTGAAGACC
TACGACGGCA AGGACGGGCT GGCTGCGGTG AAGGCTTATC TGGGGCTCTG A
 
Protein sequence
MKSTTTFVFG AALAAACALP SQSFAQDSAA CRNVRFADIG WTDITSTTAL ASLLFDGLGY 
KPTTTIASVP ISFAGLKNRQ LDVSLGYWWP VQQHQLQPFL DSKSISVVEP PNLSGAKATL
AVPSYVYQAG LKSFDDVAKH RAELDGKIYG IEPGSSANAM IQKMIDTNQY GLGGFKLVES
SEAGMLVTVE RAIRDKKWVV FLGWEPHPMN IQIGMNYLSG GDAAFGPNYG EARVYTLTSP
DYMARCPNAG KLVGNLRFTT QMENQLMQAV MNKVKPAEAA KAYIRKNPQV LDAWLAGVKT
YDGKDGLAAV KAYLGL