Gene BURPS1106A_0256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0256 
SymboldppA 
ID4899380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp238285 
End bp239913 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content63% 
IMG OID640133486 
Productdipeptide ABC transporter, periplasmic dipeptide-binding protein DppA 
Protein accessionYP_001064539 
Protein GI126454101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATA ACCGTCTGTT GCGCGCACTG CGTGCTACCG CCATCGCGGG CGTTGCAGCG 
GCATCGTTCG GCATCGCGGG TTCTGCATTC GCACAGATCC CGAACAAAAC GCTCGTCTAC
TGCTCAGAAG GCAGCCCGGC GGGCTTCGAT TCCGCGCAAT TCACGACGGG CGTCGATTTC
ACCGCGTCGA CGTTCCCGAT CTACAACCGC CTCGTCGAAT TCGAGCGCGG CGGCACGAAG
GTCGAGCCGG GCCTCGCCGA GAAGTGGGAC ATCTCGGCCG ACGGCAAGGT CTACACGTTC
CATCTGCGCC ACGGCGTCAA GTTCCATACG ACCGATTTCT TCAAGCCCAC GCGCGAATTC
AACGCGGACG ACGTCGCGTT CACGTTCGAG CGGATGCTCG ATCCGAATCA GGCGTTTCGC
AAGGCGTACC CGGTGTCGTT CCCGTACTTC ACCGACATGG GCCTCGACAA GCTGATCGTG
AAGATCGAGA AGCTCGATCC GTACACGGTC CGCTTCACGC TGAAGGAGCC GAACGCGCCG
TTCATCCAGA ACCTCGCGAT GGAATTCGCG TCGATCCTCT CGGCCGAATA CGCGGACCAA
CTGATGAAGG CGGGCAAGGC GGCCGACATC AACCAGAAGC CGATCGGCAC GGGCCCGTTC
ATCTTCCGCA GCTACACGAA GGACGCGACG ATCCGCTTCG ACGGCAATCC TGATTATTGG
AAGAAGGGCG CGGTGAAGAT CTCGAAGCTG ATCTTCTCGA TCACGCCCGA CCCGGGCGTG
CGCGTGCAGA AGATCAAGCG CAACGAGTGC CAGGTGATGA GCTATCCGCG GCCCGCGGAC
ATCGCGACGC TGAAGGCCGA TTCGAACGTC GACATGCCGT CGCTGCCGGG CTTCAACCTC
GGCTACCTCG CGTACAACGT GCAGCACAAG CCCGTCGACA AGCTCGAAGT GCGCCAGGCG
CTCGACATGG CGATCAACAA GAAGGCGATT CTCGAATCCG TCTATCAGGG CGCGGGCCAG
GCGGCGAGCG CGCCGATGCC GCCGACCCAA TGGTCGTACG ACAAGAACCT GAAGGCCGCC
GCCTACGATC CGGCGAAGGC GAAGGCGCTG CTCGCGAAGG CGGGCTACCC GAACGGCTTC
CCGATCACGC TGTGGGCGAT GCCCGTGCAG CGCCCGTACA ACCCGAACGC GAAGCTGATG
GCCGAGATGA TCCAGGCCGA CTGGGCGAAG ATCGGCGTGC AGGCGAAGAT CGTCACGTAC
GAGTGGGGCG AGTACATCAA GCGCGCGCAT GCGGGCGAGC ACGATACGAT GCTGATCGGC
TGGAACGGCG ACAACGGCGA TCCCGACAAC TGGCTCGGCA CGCTGCTCGG CTGCGAGGCG
GTCAAGGGCA ACAACTTCTC CGAGTGGTGC TACAAGCCGT TCGACGAGCT GATCCAGAAG
GGCCGCGTGA CGACGTCGCA GGACGGCCGC ACGAAGATCT ACATGCAGGC GCAGCAGATC
TTCGCGCAGC AACTGCCGTT CTCGCCGATC GCGAACTCGA CCGTCTATCA GCCGGTGCGC
AAGAACATCG TCGACATGCG GATCGAGCCG CTCGGCTATG CGCGCTTCGA CGGCGTCAGC
GTGAAATAA
 
Protein sequence
MEHNRLLRAL RATAIAGVAA ASFGIAGSAF AQIPNKTLVY CSEGSPAGFD SAQFTTGVDF 
TASTFPIYNR LVEFERGGTK VEPGLAEKWD ISADGKVYTF HLRHGVKFHT TDFFKPTREF
NADDVAFTFE RMLDPNQAFR KAYPVSFPYF TDMGLDKLIV KIEKLDPYTV RFTLKEPNAP
FIQNLAMEFA SILSAEYADQ LMKAGKAADI NQKPIGTGPF IFRSYTKDAT IRFDGNPDYW
KKGAVKISKL IFSITPDPGV RVQKIKRNEC QVMSYPRPAD IATLKADSNV DMPSLPGFNL
GYLAYNVQHK PVDKLEVRQA LDMAINKKAI LESVYQGAGQ AASAPMPPTQ WSYDKNLKAA
AYDPAKAKAL LAKAGYPNGF PITLWAMPVQ RPYNPNAKLM AEMIQADWAK IGVQAKIVTY
EWGEYIKRAH AGEHDTMLIG WNGDNGDPDN WLGTLLGCEA VKGNNFSEWC YKPFDELIQK
GRVTTSQDGR TKIYMQAQQI FAQQLPFSPI ANSTVYQPVR KNIVDMRIEP LGYARFDGVS
VK