Gene BURPS1106A_2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2547 
Symbol 
ID4900276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2501035 
End bp2502987 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content71% 
IMG OID640135774 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001066801 
Protein GI126453009 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.524722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCG GTTCGCCCCG GGCCCGCCGG CCGCGATCGC CGCAACGCGC GGCGCCATCA 
GAACAGGCCG CGCGCGCCGC CGCGCCGCGA CGGGCGGCCC GCGCGCGCGC CGCGCTCGCG
CGCTTCGCGC GCCGGGCGGC GGCGGGCGTC GCGCTCGCCT TCGTCGCGGC GCCCGCGGCG
CACGCCGTCT ACGCGATCGC GCAGTACGGC GAGCCGAAGT ATCCGGCGGG CTTCGCGCAT
TTCGACTACG TGAACCCCGA CGCGCCGAAG GGCGGCACGC TCGTGCTCGC GAACCCGAAC
CGGCTCACGA CGTTCGACAA GTTCAATCCG TTCACGATGC GCGGCAACCC GGCGCCCGGA
ATCGACCTGC TGTTCGAGAG CCTGACGACG GGCAGCGCCG ACGAGCCCGC CTCCGCGTAC
GGCCTGCTCG CGGACGACAT CGCCGTCGCG CCGGACGGCC TGTCGGTCAC GTTCCATCTG
AATCCGCGCG CGCGCTTCTC GAACGGAGAA CCCGTCACCG CGGCGGACGT CAAGTATTCG
TTCGACACGC TGAAGAGCCC GAAGGCGGCG CCGCAATACC CGGCGTACTA CGCGGACATC
GCGCGCGCGG TGATCGTCGA CGCGGCGACC GTGCGCTTCG AGTTTCGCCG CAAGAACCGC
GAGCTGCCGC TGATCGCGGG CGGCATCCCG GTGTTCTCGC GCAAATGGGG CGTGCGCGCG
GACGGCTCGC GCATCGCGTT CGACCAGATC GCGTTCGAGC AGCCGATCGG CAGCGGCCCG
TACCTGATCG AGCGCTACGA CAGCGGGCGC ACGATCACGT ACCGGCGCAA TCCCGCCTAC
TGGGGCGCGG CGCTGCCCGT GCGGATCGGC ACGAACAACT TCGAGCGCAT CGTCTACAAG
CTGTACGGCG ACGGCGTCGC GCGGCTCGAG GCGTTCAAGG CCGGCGAATA CGACGTGCTC
GTCGAGTACA TCGCGCGCAA CTGGGCGCGG CGCGACGTCG GCAAGCGCTT CGACAGCGGC
GAGCTCGTCA AGCGCGAGTT CCGCCAGCAC AACGGCGCGG GAATGCAGGG CTTCTTCATG
AACCTGCGCC GGCCGCTGTT CCAGGACGTG CGCGTGCGCC ACGCGCTCGA TCTCGCGTTC
GATTTCGAAT GGCTGAACCG GCAGCTTTTC TATGGCGCGT ACACGCGCCT GAACAGCTAT
TTCGCCGATA CCGACCTGCA GGCGACGGGC ACGCCGAGCG CGGGCGAGCT CGCGCTGCTC
GCCCCGTTGC GCGCGCAGCT CGACCCGGCC GTGTTCGGGC CGATGACCGT GCAGCCGAGC
ACCGATTTGC CCGCGTCGCT GCGCGCGAAC CTGCTGAAGG CGCGCGCGCT GCTCGCCGAG
GCCGGCTGGA CCTACCGCGA CGGCGCGCTG CGCAACGCGA AGGGCGAGCC GTTCGTGTTC
GAGATTCTCG ACGATTCGGG CTCGGCGTTC GAGCCGGTGG TCGCCGCGTA CATCCGCAAT
CTCGCGAAGC TCGGGATCGT CGCGAAGTAC CGGACGGCCG ATTTCGCGCT GCTGCAAAAG
CGCCTCGACG CGTTCGACTA CGACATGACG ACGGTCCGCT ACCCGGGCGT CCAGGTGCCG
GGCGCCGAGC AGGTCGCACG CTTCGCGAGC CGCTATGCGG ACGAGCCGGG CTCGGACAAC
CTGACGGGGC TCAAGTCGCC CGCGGTCGAC GCGATCCTGA AGGCGCTCAC GCAGGCCGAG
ACGCGCGACG AACTGCTCGA CGCGACGCAC GCGCTCGACC GCGTGCTGAT GCACGGCTAC
TATGCGGTGC CGCAGTGGTA CAGCGCCGTG CACCGGATCG CGTTCAAGCG CACGCTCGCC
TACCCGTCGG TGCTGCCGCT GTACTATTCG GCGGAAGGCT GGGTCGCCTC GACGTGGTGG
GCGAGGCCCG AGCATGGCGC GTCCGCGCGT TAG
 
Protein sequence
MTIGSPRARR PRSPQRAAPS EQAARAAAPR RAARARAALA RFARRAAAGV ALAFVAAPAA 
HAVYAIAQYG EPKYPAGFAH FDYVNPDAPK GGTLVLANPN RLTTFDKFNP FTMRGNPAPG
IDLLFESLTT GSADEPASAY GLLADDIAVA PDGLSVTFHL NPRARFSNGE PVTAADVKYS
FDTLKSPKAA PQYPAYYADI ARAVIVDAAT VRFEFRRKNR ELPLIAGGIP VFSRKWGVRA
DGSRIAFDQI AFEQPIGSGP YLIERYDSGR TITYRRNPAY WGAALPVRIG TNNFERIVYK
LYGDGVARLE AFKAGEYDVL VEYIARNWAR RDVGKRFDSG ELVKREFRQH NGAGMQGFFM
NLRRPLFQDV RVRHALDLAF DFEWLNRQLF YGAYTRLNSY FADTDLQATG TPSAGELALL
APLRAQLDPA VFGPMTVQPS TDLPASLRAN LLKARALLAE AGWTYRDGAL RNAKGEPFVF
EILDDSGSAF EPVVAAYIRN LAKLGIVAKY RTADFALLQK RLDAFDYDMT TVRYPGVQVP
GAEQVARFAS RYADEPGSDN LTGLKSPAVD AILKALTQAE TRDELLDATH ALDRVLMHGY
YAVPQWYSAV HRIAFKRTLA YPSVLPLYYS AEGWVASTWW ARPEHGASAR