Gene BURPS1106A_A1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1930 
SymbolproX 
ID4905057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1890003 
End bp1890989 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content64% 
IMG OID640145036 
Productglycine betaine/L-proline ABC transporter, periplasmic glycine betaine/L-proline-binding protein 
Protein accessionYP_001075964 
Protein GI126455720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.111406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGACGA CATCTTTGAT GATGAGGAGA AGCACGATGA AGCGAAATCT GATCGCGGCG 
GCCTGCGGGC TCGCCATCGC GGCCGCGCCG TTCGCGAGCG CCCGGGCGGG CGATGCGCCG
ACCTGCAAGG CGGTGCGCTT TGCGGATGTC GGCTGGACCG ACATCGCCGC GACGACGGGG
CTCGCGTCGA CGATGCTCGC CGGGCTCGGC TATGCGCCGA CGAAGACGAT CGCTTCGGTG
CCGATCACGT TCGCGGGGAT CAAGAGCAAG CAGATCGACG TGTTTCTCGG CTACTGGTCG
CCGACGATGG ACCCGATGAT CGCGCCGTTC ACGAAGGCGG GCACGATCAA GGTGCTCGCC
GCGCCGAATC TGACGGGCGC GAAGTACACG CTCGCCGTGC CCGATTACGT GTATCAGGGC
GGCCTGAAAT CGTTCGCCGA CATCCAGAAA TACGCGGACA AGCTCAACGG CAGGATCTAC
GGGATCGAGC CCGGCAACGA CGGCAACGCG CTCATCAAGA AGATGATCGA CGGCAACCAG
TTCGGCCTCG GCAAGTTCAA GCTCGTCGAA TCGAGCGAGG CGGGGATGCT CGTCGAGGTG
AACCGCGCGA TCCGCGACAA GCAGTGGATC GTGTTCCTCG GCTGGGAGCC GCATCCGATG
AACGTGCAGA TGAAGATCGA TTACCTGAGC GGCGGCGACG ACGTGTTCGG CCCGAACTAC
GGCGAGGCGA AGGTGCTGAC CGCCACGCCG CCCGATTACG CGGCGCGTTG CCCGAACGTC
GCGAAGTTCG TGTCGAACCT GCAGTTCACG ACATCGATCG AGAACCATGT GATGCTGCCG
ATCATGAACA AGGAAGACCC GAACAAGGCG GCGGCCGAAT GGCTGAAGGC GAATCCGCAG
TCGCTCGACA AGTGGCTCGC CGGCGTGACG ACGTTCGACG GCAAGCCGGG GCTGCCGGCC
GTCAAGCACT ACCTCGGCAT TCAGTAA
 
Protein sequence
MLTTSLMMRR STMKRNLIAA ACGLAIAAAP FASARAGDAP TCKAVRFADV GWTDIAATTG 
LASTMLAGLG YAPTKTIASV PITFAGIKSK QIDVFLGYWS PTMDPMIAPF TKAGTIKVLA
APNLTGAKYT LAVPDYVYQG GLKSFADIQK YADKLNGRIY GIEPGNDGNA LIKKMIDGNQ
FGLGKFKLVE SSEAGMLVEV NRAIRDKQWI VFLGWEPHPM NVQMKIDYLS GGDDVFGPNY
GEAKVLTATP PDYAARCPNV AKFVSNLQFT TSIENHVMLP IMNKEDPNKA AAEWLKANPQ
SLDKWLAGVT TFDGKPGLPA VKHYLGIQ