Gene BURPS668_A1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1806 
Symbol 
ID4886907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1764254 
End bp1765324 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content73% 
IMG OID640131744 
Producthypothetical protein 
Protein accessionYP_001062801 
Protein GI126443047 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCC GCACGAACGC GGCTTTCACC GCCCTCCTCG CCGCCGCGCT GTTCGGCGCC 
ACCACGCCGC TCGCGAAGAC GCTGCTCGGC TCGCTCACGC CGTTCATGGT CGCGGGCCTG
TTCTATCTCG GCAGCGGCGT CGGCCTCGGG GCGTTCATGC TGATGCGCCG GCTCGCGCGC
GGCGCCGGCG CCGGCGCATC GCCCGCCGGC CACGCGCGGC TGCCGCTTGC CGAGCTCCCG
TGGCTCGCGG GCGCGGTCGC GGCGGGCGGC ATCGCGGGCC CGGCGCTGCT GATGCTCGGC
CTCGCGACGA CGCCCGCCGC GACGAGCGCG CTGCTGCTCA ATCTCGAAGG CGTGTTCACC
GCGCTGATCG CGTGGGCCGT ATTCCGCGAG AACGTGGATG CGCAGATTTT CGCCGGCATG
GCCGCGATCG TCGCGGGCGG CGTGCTGCTG TCGTGGCATC CGGGCGCGGC GGGCGTGCCG
CTCGGCGCGC TGCTCGTCGC GGCCGCCTGC GCGTGCTGGG CGATCGACAA CAACCTGACG
CGCAAGGTCT CGACTCACGA CGCCGCGGCG ATCGCGTGCG TCAAGGGCCT CGTCGCCGGC
ACGGTCAACC TCGGCATCGC GCTCGCGCTC GGCGCGCGGC TGCCCGCCGC CGCCGACAGC
GCGGCCGCGA TGCTCACGGG CTTCGCCGGC TATGGCGTGA GCCTCGTGCT GTTCGTCGTC
GCGCTGCGCA ATCTCGGCAC CGCGCGGACC GGCGCGTATT TCTCGGTCGC GCCGCTGTTC
GGCGTCGGGC TGTCGCTCGC GCTGTGGCCC GAATGGCCGC CGCTGTCGTT CTGGGCCGCC
GCGGCGCTGA TGGCGCTCGG CATCTGGCTG CACCTGCGCG AGCGCCACGA GCATCCGCAT
ACGCACGAGG CGCTCGAGCA CAGCCATCGG CACCGGCACG ACACGCATCA TCAGCACGCG
CACGACTTCG ACTGGGACGG CACGGAGCCG CACACGCACG CGCACCGGCA CACGCCGATC
ACGCACACGC ATGCGCATTT CCCGGACATT CATCACCGGC ACTCGCACTG A
 
Protein sequence
MSARTNAAFT ALLAAALFGA TTPLAKTLLG SLTPFMVAGL FYLGSGVGLG AFMLMRRLAR 
GAGAGASPAG HARLPLAELP WLAGAVAAGG IAGPALLMLG LATTPAATSA LLLNLEGVFT
ALIAWAVFRE NVDAQIFAGM AAIVAGGVLL SWHPGAAGVP LGALLVAAAC ACWAIDNNLT
RKVSTHDAAA IACVKGLVAG TVNLGIALAL GARLPAAADS AAAMLTGFAG YGVSLVLFVV
ALRNLGTART GAYFSVAPLF GVGLSLALWP EWPPLSFWAA AALMALGIWL HLRERHEHPH
THEALEHSHR HRHDTHHQHA HDFDWDGTEP HTHAHRHTPI THTHAHFPDI HHRHSH