Gene BURPS1106A_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0056 
Symbol 
ID4900076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp54090 
End bp55349 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID640133286 
Productputative ABC transporter, substrate-binding protein 
Protein accessionYP_001064341 
Protein GI126452961 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGC TTTCCCTCAA CATGAAGCGC GCGCTTGCCG GCGCGTGCAT CGGCGCCGCG 
GGATTTCTCG CGCAACAGGC GGCGCTCGCG CAGAGCTGCG GGCTCGCGAA CGGCAAGCCC
GCGACGGGCG CGCCGATTCC GATCGGCGCG ATCGTCGGCA AGACCGGCCC CGACGACTTC
AGCTCGTCGG CACGCGCGGC CGCCGCGTAT TTCAAGTGCG TGAACGCGAA CGGCGGCATC
AACGGCCGGC CTGTCCAGTA TCTCGTCGAA GACGATCAAT GGAATCCGGA AACGGCGTCG
CAGGTCGCCT CGAAACTCGT ACGCGACCGC AAGGTGCTCG CGCTCGCGGG CAACGCGAGC
TTCGTCGAAT GCGGCGCGAA CGCGAAGTTC TACGAACAGG AGAACGTGAT CGCGATCGCC
GGCGTCGGCG TGCCGCGCGA ATGCTATTTC GCGCGCAACT ACGCGCCGCT CAACATGGGG
CCGCGCCTGT CGATGACGGA GGCGGCGCTG TACGCGAAGC AGCAGTACAA GGCGACGCGG
ATGGTGTGCA TCGCGCCGAA CATCCCGAGC CTCGGCGCTT GGTCGTGCGA AGGGCCGGCG
CTCTGGGGCA AGCGCAACGG CGTGAGTGTC GACACGATCG TGATGGACCC CGACTCCGCC
GATCCGACGT CGGTCGTGCT GCAGGCGGCG TCGAAGAATC CGCAGGCGAT CCTGCTCGGG
CTGCCGAAAG GGCTGATGGT GCCGATCCTG TCGGCCGCCG AGCAGCAGAA CCTCGGCCGG
CGGATTCACT TCGTCTCGGC GGCGTCCGGC TACGACCTCG GCGTGCCGAA GGCGATCGGC
CCGTACTGGA AGGGCAATTT CGACGTGAAC CTCGAGTTCC AGCCGCTCGA CGCGCAAACG
CCGGACAACC GGAACTGGCT CGCGGTGATG GACAAGTACG GCGACAGGAA GGACCCGCGC
GACACGTTCT CGCAGGCGGG CTATCTGGCC GCGCGGCTCG TGACCGACAC GCTGCTGAAA
CTGCCCGCGA ACCAGCTCGA TCGCGCGCAC GTGACGGCCG CGCTGCGCGA GGTGAAGGAC
TTCCGCAGCG ACATCCTGTG CGGCCCGTTC TACGTCGGCG CGGGCGAGCG GCACAACGCG
AACAACGCGG GCCGGATGGC GCAATCGACG GGCACGGGCT GGAAGACGGT GTCGACGTGC
CAGGCGGTCG ACGATCCGCA ACTCGCCGAC ATCCGCGCGG CCGAAAAGAA GATGCACTGA
 
Protein sequence
MKPLSLNMKR ALAGACIGAA GFLAQQAALA QSCGLANGKP ATGAPIPIGA IVGKTGPDDF 
SSSARAAAAY FKCVNANGGI NGRPVQYLVE DDQWNPETAS QVASKLVRDR KVLALAGNAS
FVECGANAKF YEQENVIAIA GVGVPRECYF ARNYAPLNMG PRLSMTEAAL YAKQQYKATR
MVCIAPNIPS LGAWSCEGPA LWGKRNGVSV DTIVMDPDSA DPTSVVLQAA SKNPQAILLG
LPKGLMVPIL SAAEQQNLGR RIHFVSAASG YDLGVPKAIG PYWKGNFDVN LEFQPLDAQT
PDNRNWLAVM DKYGDRKDPR DTFSQAGYLA ARLVTDTLLK LPANQLDRAH VTAALREVKD
FRSDILCGPF YVGAGERHNA NNAGRMAQST GTGWKTVSTC QAVDDPQLAD IRAAEKKMH