Gene BURPS1106A_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0522 
Symbol 
ID4900682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp484673 
End bp485644 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content67% 
IMG OID640133752 
Productputative ABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001064805 
Protein GI126452786 
COG category[R] General function prediction only 
COG ID[COG2984] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAT TCAAGATCGT GGCCGCTCAT TCGATCGCGG CGGGCGTCGC GGCGTTCGCG 
ATGCTGGGCG CCGGCGCCGC GCACGCGCAG ACCGTCAAGG TGCTGTCGAT CGTCGATCAT
CCGGCGCTCG ACGCGATCCG CGACGGCGTG CGCGCGCAGC TGAAGGCCGA AGGCTACGGC
GACGACAAGC TCAAGTGGGA ATACCAGAGC GCGCAGGGCA ACACCGGCAC CGCCGCGCAG
ATCGCGCGCA AGTTCATCGG CGACCGTCCG GACGTGATCG TCGCGATCGC GACGCCCGCC
GCGCAAGCCG TCGTCGCATC GACGAAGACC GTGCCTGTCG TCTATTCGGG CGTGACCGAT
CCCGTTGCCG CGCAGCTCGT CAAGGGCTGG GGGCCGACGG GTACCAACGT GACGGGCGTG
TCCGACCAGC TGCCGCTCGA CCGGCAGGTC GCGCTCATCA AGCGCGTGGT GCCGAAGGTG
AAGACGGTCG GGATGGTCTA CAACCCGGGC GAGGCAAACT CGGTCGTCGT CGTGAAGGCG
CTCAAGGAGA TCCTCGCGAA GCAGGGGATG ACGCTCAAGG AGGCGGCCGC GCCGCGCACC
GTCGACATCG CGCCCGCCGC GAAGAGCCTG ATCGGCAAGG TCGACGTGAT CTATACGAAC
ACCGACAACA ACGTCGTGTC CGCATACGAA TCGCTCGTGA AGGTCGCGAA CGAGGCGAAG
ATCCCGCTCG TCGCGGGCGA CACCGACAGC GTGAAGCGCG GCGGCATCGC GGCGCTCGGC
ATCAACTACG GCGACCTCGG CCGGCAGACG GGCAAGGTCG TCGCGCGGAT CCTGAAGGGC
GAGAAGCCGG GCGCGATCGC ATCGGAGACG AGCAGCAATC TCGAGCTGTT CGTGAACACC
GACGCGGCCG CCAAGCAGGG CGTGACGCTT GCGCCCGATC TCGTCAAGGA AGCGAAGACG
GTCATCAAGT AA
 
Protein sequence
MKRFKIVAAH SIAAGVAAFA MLGAGAAHAQ TVKVLSIVDH PALDAIRDGV RAQLKAEGYG 
DDKLKWEYQS AQGNTGTAAQ IARKFIGDRP DVIVAIATPA AQAVVASTKT VPVVYSGVTD
PVAAQLVKGW GPTGTNVTGV SDQLPLDRQV ALIKRVVPKV KTVGMVYNPG EANSVVVVKA
LKEILAKQGM TLKEAAAPRT VDIAPAAKSL IGKVDVIYTN TDNNVVSAYE SLVKVANEAK
IPLVAGDTDS VKRGGIAALG INYGDLGRQT GKVVARILKG EKPGAIASET SSNLELFVNT
DAAAKQGVTL APDLVKEAKT VIK