Gene BURPS1106A_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1977 
Symbol 
ID4902143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1943286 
End bp1944584 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content69% 
IMG OID640135207 
Productputative transporter, periplasmic substrate-binding protein 
Protein accessionYP_001066242 
Protein GI126452565 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.212694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGACG CTCGCCGCAC CGCGCACGGC GCATCACGAG AACCCGAACG CGCGGCGAGT 
TTCCCGACGG GCGCGCCCAC TGTGTCGATC CATGCCGAAA GCGATTGGGC ATCCGAACCG
ATCGATTCGT TTGAGGCCGC CCGGCAATCG AAAATCGTGA CGTTTTCTCG AGCGAGGGTC
GACATGAACG TTCGCATGGG GTGTTTATCG CTGCTGGCCG CCGTCGCATT CGACGCGCAC
GCGCAATCAT CGGTCGTCCG GATCGGCGTC GCGATGCCGC TGACGGGGCC CGTCGCCCAT
CTGGGCAAGG ACGTGCAGAA CGGCGCGCAG CTCGCCGTCG ACGAGCTGAA CCGCGCGCCG
CCGACGATCG ACGGCAAGCC GGTGAAGTTC GTGCTCGTCG TCGAAGACGA TCAGGGCGAT
CCACGGCAGG CGGTGCAGGT CGCGCAGCGC CTCGTCGACG CGCGGGTCGC GGGCGTCGTC
GGCGATCTGA ACTCGGGGCC GACGATCGTC GCGGCGAAGG TCTACGCGGC GGCCGGCATC
GCGCAGATCG CGCCCGCCGC GACCCATTCC GCGTACACGC AGCTAGGCTA CAAGACCGCG
TTCCGGCTGA TGGCGACCGA CAACCAGCAA GGCGCGTCGC TCGCCGGGCT CGCCGCGAAG
CTCGCGAAAG GCCGGCCGAT CGCGCTGATC GACGATCGCG GCGCCTACGG CCAGGGCCTC
ATCGATCAGA CGGAGAAGAC ACTGCGCGCG AGCGGCGTCA CGCGAATCAT CCGCGACTAC
ACGACGGACA CCGCGGTCAA TTTCGCGTCG ATTCTCACGC GCGTGAAGGG CGCGCATGCG
GCCGTGATCG TCTACGGCGG CGCCGATGCA CAGGCGGGGC CGATGGTGCG GCAGATGAAG
GCGCTCGGAA TCGACGCGGC GTTCGTCGGC AGCGACGGCG TGTGCACCGG GCAGTGGACG
GCGCTGTCGT CGGGCGCGAA CGAAGGCCAG TTCTGCACGC AAGCAGGCGA CCCGCGCGCG
AGGATGGCGG GTTACGCGGC GTTCGAGCGG CGTTTCGAGG CGCGCTACGG CAAGGTGATC
GTGTTCGCGC CGTATGGCTA CGACGCGGTG ATGCTGCTCG CCGACGCGAT GCGGCGCGCG
AATTCGACCG AGCCGGCCGC TTTGCTCGGC GCGCTCGCGA CGACCCGTTA CGACGGCGTG
ATCGGCCGCA TCCGCTTCAG TCCGCAGGGC GACAACCTGA ACGGCGCGGT GACGGTCTAC
CGCGTGCAGC GCGGCGCGCT CGTGCCGGTG TCCGACTGA
 
Protein sequence
MRDARRTAHG ASREPERAAS FPTGAPTVSI HAESDWASEP IDSFEAARQS KIVTFSRARV 
DMNVRMGCLS LLAAVAFDAH AQSSVVRIGV AMPLTGPVAH LGKDVQNGAQ LAVDELNRAP
PTIDGKPVKF VLVVEDDQGD PRQAVQVAQR LVDARVAGVV GDLNSGPTIV AAKVYAAAGI
AQIAPAATHS AYTQLGYKTA FRLMATDNQQ GASLAGLAAK LAKGRPIALI DDRGAYGQGL
IDQTEKTLRA SGVTRIIRDY TTDTAVNFAS ILTRVKGAHA AVIVYGGADA QAGPMVRQMK
ALGIDAAFVG SDGVCTGQWT ALSSGANEGQ FCTQAGDPRA RMAGYAAFER RFEARYGKVI
VFAPYGYDAV MLLADAMRRA NSTEPAALLG ALATTRYDGV IGRIRFSPQG DNLNGAVTVY
RVQRGALVPV SD