Gene BURPS1106A_A0754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0754 
Symbol 
ID4904360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp746998 
End bp748035 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content68% 
IMG OID640143860 
Productquaternary amine ABC transporter periplasmic substrate-binding protein 
Protein accessionYP_001074790 
Protein GI126457569 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCCG GCGTCACGCA TTTTCAAATG GAGCAAGCGA TGAAACGATA CGAATCCATT 
GCGCGGCGGC TCGCGCGCCG CGCGGCAGCC GCATCGCCGG CGTTCGCGGC GTTGGCATGG
TGCGCCGCGG CGGCCGCCGC CACGACCACG GCGGCCGCGG CGGAGCCGGC CGCCTGTCGC
GACGTGCGGA TGGCCGGCCC CGGCTGGACC GATATCGAAG CGACGAACGC GCTCGCGGGC
GTCGTGCTGA AGGCGCTCGG TTACCGGCAG AGCGTGTCGA ACCTGTCGGT GCCGATCACG
TATCAAGGTC TGAAGAAAGG GCAGCTCGAC GTGTTCCTCG GCAACTGGAT GCCGGCGCAG
GCGCCGCTCG TCAAGCCGTT CGTCGACGCG CGCGCGATCG ACGTGCTCCA CGCGAACCTG
AGCCATGCGA AATTCACGCT CGCGGTGCCG GACTACGTGG CGGCGGCGGG CGTGCATTCG
TTCGCCGACC TCGCGAAGTA CGCGCAGCGC TTCGGCGCGA AGATCTACGG CATCGAGCCG
GGCGCGCCGG CCAATCAGAA CATCTCGCGC ATGCTCGCCG ACAAGGCGCT CGGGCCGGCG
AACTGGCAGC TCGTCGAATC GAGCGAGACA GGGATGCTGA CGCAGGTCGA GCGCGCGGTG
CGCGAGCGCC AGTGGATCGT GTTTCTCGGC TGGGAGCCGC ACCTGATGAA CACGAAATTC
CATCTCGTTT ATCTGTCGGG CGGCGACGCG TATTTCGGGC CGGACTACGG CGGCGCGACC
GTCAACACCG TCGCGCGCGC GGATTTCGCG AGCCAGTGCG CGAATCTCGC GCGGCTGTTC
CGACAAATGA CGTTCACCGT CGATCTGGAG AACGGAATGA TCGCCGCGAT GCTGCAGGGC
AAGCGCTCCG CCGTGGATGC CGCGCAACAC GCGCTGCGTG CGAACCCGTC GCTCGTCGAA
GCATGGCTCG ACGGCGTGCG CACCGCGAGC GGCGCGCCAG GCTTGCCTGC GGTGCGCGCG
GCGCTCGATG CGCAATGA
 
Protein sequence
MTPGVTHFQM EQAMKRYESI ARRLARRAAA ASPAFAALAW CAAAAAATTT AAAAEPAACR 
DVRMAGPGWT DIEATNALAG VVLKALGYRQ SVSNLSVPIT YQGLKKGQLD VFLGNWMPAQ
APLVKPFVDA RAIDVLHANL SHAKFTLAVP DYVAAAGVHS FADLAKYAQR FGAKIYGIEP
GAPANQNISR MLADKALGPA NWQLVESSET GMLTQVERAV RERQWIVFLG WEPHLMNTKF
HLVYLSGGDA YFGPDYGGAT VNTVARADFA SQCANLARLF RQMTFTVDLE NGMIAAMLQG
KRSAVDAAQH ALRANPSLVE AWLDGVRTAS GAPGLPAVRA ALDAQ