Gene BURPS1106A_A1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1721 
Symbol 
ID4905512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1697610 
End bp1698680 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content73% 
IMG OID640144827 
Productputative transporter 
Protein accessionYP_001075755 
Protein GI126456411 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0542336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCC GCACGAACGC GGCTTTCACC GCCCTCCTCG CCGCCGCGCT GTTCGGCGCC 
ACCACGCCGC TCGCGAAGAC GCTGCTCGGC TCGCTCACGC CGTTCATGGT CGCGGGCCTG
TTCTATCTCG GCAGCGGCGT CGGCCTCGGG GCGTTCATGC TGATGCGCCG GCTCGCGCGC
GGCGCCGGCG CCGGCGCATC GCCCGCCGGC CACGCGCGGC TGCCGCTTGC CGAGCTTCCG
TGGCTCGCGG GCGCGGTCGC GGCGGGCGGC GTCGCGGGCC CGGCGCTGCT GATGCTCGGC
CTCGCGACGA CGCCCGCCGC GACGAGCGCG CTGCTGCTCA ATCTCGAAGG CGTGTTCACC
GCGCTGATCG CGTGGGCCGT GTTCCGCGAG AACGTGGATG CGCAGATTTT CGCCGGCATG
GCCGCGATCG TCGCGGGCGG CGTGCTGCTG TCGTGGCATC CGGGCGCGGC GGGCGTGCCG
CTCGGCGCGC TGCTCGTCGC GGCCGCCTGC GCGTGCTGGG CGATCGACAA CAACCTGACG
CGCAAGGTCT CGACTCACGA CGCCGCGGCG ATCGCGTGCG TCAAGGGCCT CGTCGCCGGC
ACGGTCAACC TCGGCATCGC GCTCGCGCTC GGCGCGCGGC TGCCCGCCGC CGCCGACAGC
GCGGCCGCGA TGCTCACGGG CTTCGCCGGC TATGGCGTGA GCCTCGTGCT GTTCGTCGTC
GCGCTGCGCA ATCTCGGCAC CGCGCGGACC GGCGCGTATT TCTCGGTCGC GCCGCTGTTC
GGCGTCGGGC TGTCGCTCGC GCTGTGGCCC GAATGGCCGC CGCTGTCGTT CTGGGCCGCC
GCGGCACTGA TGGCGCTCGG CATCTGGCTG CACCTGCGCG AGCGCCACGA GCATCCGCAT
ACGCACGAGG CGCTCGAGCA CAGCCATCGG CACCGGCACG ACACGCATCA TCAGCACGCG
CACGACTTCG ACTGGGACGG CACGGAGCCG CACACGCACG CGCACCGGCA CACGCCGATC
ACGCACACGC ATGCGCATTT CCCGGACATT CATCACCGGC ACTCGCACTG A
 
Protein sequence
MSARTNAAFT ALLAAALFGA TTPLAKTLLG SLTPFMVAGL FYLGSGVGLG AFMLMRRLAR 
GAGAGASPAG HARLPLAELP WLAGAVAAGG VAGPALLMLG LATTPAATSA LLLNLEGVFT
ALIAWAVFRE NVDAQIFAGM AAIVAGGVLL SWHPGAAGVP LGALLVAAAC ACWAIDNNLT
RKVSTHDAAA IACVKGLVAG TVNLGIALAL GARLPAAADS AAAMLTGFAG YGVSLVLFVV
ALRNLGTART GAYFSVAPLF GVGLSLALWP EWPPLSFWAA AALMALGIWL HLRERHEHPH
THEALEHSHR HRHDTHHQHA HDFDWDGTEP HTHAHRHTPI THTHAHFPDI HHRHSH