Gene BURPS1710b_A2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2122 
Symbol 
ID3694452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2585594 
End bp2586733 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content68% 
IMG OID637732376 
Productglycine betaine ABC transporter substrate-binding protein 
Protein accessionYP_337273 
Protein GI76817784 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGACTGG CAGCCGTTCC AGGATGCGAG CCAGCGTTAC ATGCGCAATC ACCTCGAACT 
CGACGCGCTC GAGGCAGCCG CGCGTTTTCC TCGTCCGCAC GCATGACGCC CGGCGTCACG
CATTTTCAAA TGGAGCAAGC GATGAAACGA TACGAATCCA TTGCGCGGCG GCTCGCGCGC
CGCGCGGCAG CCGCATCGCC GGCGTTCGCG GCGTTGGCAT GGTGCGCCGC GGCGGCCGCC
GCCACGACCA CGGCGGCCGC GGCGGAGCCG GCCGCCTGTC GCGACGTGCG GATGGCCGGC
CCCGGCTGGA CCGATATCGA AGCGACGAAC GCGCTCGCGG GCGTCGTGCT GAAGGCGCTC
GGTTACCGGC AGAGCGTGTC GAACCTGTCG GTGCCGATCA CGTATCAAGG TCTGAAGAAA
GGGCAGCTCG ACGTGTTCCT CGGCAACTGG ATGCCGGCGC AGGCGCCGCT CGTCAAGCCG
TTCGTCGACG CGCGCGCGAT CGACGTGCTC CACGCGAACC TGAGCCATGC GAAATTCACG
CTCGCGGTGC CGGACTACGT GGCGGCGGCG GGCGTGCATT CGTTCGCCGA CCTCGCGAAG
TACGCGCAGC GCTTCGGCGC GAAGATCTAC GGCATCGAGC CGGGCGCGCC GGCCAATCAG
AACATCTCGC GCATGCTCGC CGACAAGGCG CTCGGGCCGG CGAACTGGCA GCTCGTCGAA
TCGAGCGAGA CAGGGATGCT GACGCAGGTC GAGCGCGCGG TGCGCGAGCG CCAGTGGATC
GTGTTTCTCG GCTGGGAGCC GCACCTGATG AACACGAAAT TCCATCTCGT TTATCTGTCG
GGCGGCGACG CGTATTTCGG GCCGGACTAC GGCGGCGCGA CCGTCAACAC CGTCGCGCGC
GCGGATTTCG CGAGCCAGTG CGCGAATCTC GCGCGGCTGT TCCGACAAAT GACGTTCACC
GTCGATCTGG AGAACGGAAT GATCGCCGCG ATGCTGCAGG GCAAGCGCTC CGCCGTGGAT
GCCGCGCAAC ACGCGCTGCG TGCGAACCCG TCGCTCGTCG AAGCATGGCT CGACGGCGTG
CGCACCGCGA GCGGCGCGCC GGGCTTGCCT GCGGTGCGCG CGGCGCTCGA TGCGCAATGA
 
Protein sequence
MGLAAVPGCE PALHAQSPRT RRARGSRAFS SSARMTPGVT HFQMEQAMKR YESIARRLAR 
RAAAASPAFA ALAWCAAAAA ATTTAAAAEP AACRDVRMAG PGWTDIEATN ALAGVVLKAL
GYRQSVSNLS VPITYQGLKK GQLDVFLGNW MPAQAPLVKP FVDARAIDVL HANLSHAKFT
LAVPDYVAAA GVHSFADLAK YAQRFGAKIY GIEPGAPANQ NISRMLADKA LGPANWQLVE
SSETGMLTQV ERAVRERQWI VFLGWEPHLM NTKFHLVYLS GGDAYFGPDY GGATVNTVAR
ADFASQCANL ARLFRQMTFT VDLENGMIAA MLQGKRSAVD AAQHALRANP SLVEAWLDGV
RTASGAPGLP AVRAALDAQ