Gene BURPS668_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1866 
Symbol 
ID4884367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1829599 
End bp1830645 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content69% 
IMG OID640127794 
Productputative ABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001058901 
Protein GI126441454 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.346438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGCG CATCGTCTCT CGGCCGACGC TCGCTCTCAC TTACCGCCGC GCTGGCCGTG 
CTGGTCGCGG CGCTCGCCGC CGCATCGTTG GTCGCCGCGC CGGCGGCGCG CGCCGAAGGG
CGCATTCGGG TCGCCGAGCA GTTCGGCATC GTCTACCTGC TGCTGAACGT CGCGCGCGAT
CGGCATCTGA TCGAGCAGGC GGGACGCGCC GAGGGCATCG CGATCGATGT CGACTGGGTC
AAGCTCTCGG GCGGCGCGGC GATCAACGAT GCGCTCCTGT CCGGCTCGAT CGACATCGCG
GGCGCGGGCG TCGGGCCGCT CCTGACGATC TGGGACCGCA CGCGCGGCCG GCAGAACGTG
AAGGGTGTCG CGTCGCTCGG CAATTTGCCG TATTACCTCG TCAGCAACGA TCCGCGCGTG
AAGACGATCG CCGATTTCAC CGCGCGCGAG CGCATCGCGG TGCCGGCGGT GACGGTATCG
GTGCAATCGC GCCTGCTGCA GTTCGCGGCC GCCCAGCGTT GGGGCGATCG TGCGTACGAC
CGGCTCGACA AGCTGACGCA GGCCGTCGCG CACCCGGACG CGGCGGCCGC GATCATCGCG
GGCCGCACCG AGCTCACCGC GCACTTCGGC AATCCGCCGT TCCAGGAGCA GGAACTCGCG
GCCAATCCGA ACGCGCACAT CGTGCTGAGC TCGTACGACG TGCTCGGCGG GCCGAGCTCG
GCGACGGTGC TGTACGCGAC CGAGCGATTC CGCCGCGACA ATCCGAAGAC CTACCGCGCG
TTCGTCGCCG CGCTCGGGCA GGCGGCGCGC TACGTGCAGA CGAACCCGGA GGGCGCGGTC
GACGCGTATC TGCGTGTGAA CGGCTCGAAG GCCGATCGCG CGCTGCTGCT GAAAATCGTC
AGGAATCCAC AGGTGCAGTT CAGGATCGCG CCGCAGAACA CGTTCGCGCT CGCGGCGTTC
ATGCACCGCG TCGGCGCGAT CCGCCACGAG CCGAAGACGT GGCGCGACTA TTTCTTCGAC
GATCCGGCGA CCGCACAGGG CAGTTGA
 
Protein sequence
MTRASSLGRR SLSLTAALAV LVAALAAASL VAAPAARAEG RIRVAEQFGI VYLLLNVARD 
RHLIEQAGRA EGIAIDVDWV KLSGGAAIND ALLSGSIDIA GAGVGPLLTI WDRTRGRQNV
KGVASLGNLP YYLVSNDPRV KTIADFTARE RIAVPAVTVS VQSRLLQFAA AQRWGDRAYD
RLDKLTQAVA HPDAAAAIIA GRTELTAHFG NPPFQEQELA ANPNAHIVLS SYDVLGGPSS
ATVLYATERF RRDNPKTYRA FVAALGQAAR YVQTNPEGAV DAYLRVNGSK ADRALLLKIV
RNPQVQFRIA PQNTFALAAF MHRVGAIRHE PKTWRDYFFD DPATAQGS