Gene BURPS1106A_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1994 
Symbol 
ID4900183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1957113 
End bp1958333 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content76% 
IMG OID640135224 
Productamine ABC transporter, permease protein 
Protein accessionYP_001066259 
Protein GI126455455 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.455758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCGC GCACGGCGGG CGGCGCGGCC GCGCCCGCTC CCGCGCCGCG CCCGCGCGCG 
ATGCCCGCGT GGGCCGCGCG CGTCGACAAG GTCGGCGTGC TGATCGCCGC GCTCGTCGCG
TACGCGGCGT TCGTGCTGCC GTTCGTCACG CTGCGCGCGA ACCGGATCGC GGCGGGCGCG
GAGCTCGCGC CCGCCGCGGT GTTTCCGGCG CTCCACGCGT ACGCGCTCGA CGCGCTGTGG
GCGGCGGGCG CGCTGTTCGC GCTCGTGCAC AGCCGCGCGG CATGGCGCGC GGCCGTCGGC
GTCGGGCTCG TGTTCGCGCT GGGCGTGGCG ATCGGCGCGG CGCCCGCGCA TCTCGTCACG
CCGGATACGC CGCTCGCGCG CGTGTCGCCC GCGGCGGGCG CGTGGCTGCT GCTGTTCGCG
TTCGCGGTGC TGATCGCCGA CGCGCTCGCC CGGATCGCGC TCGCGCCCGC GATGCGCCTC
GTCGCGCTCG CCGCGGCGAG CGCCGCGCTC GCGGCATTCA TTCACGGCGG CTTCTGGGAC
GGGCTGTCGG TGATGCAGGA ATACGCGGTG CGCGCCGATA CGTTCCGCAA CGAGGCGATC
CGGCATCTCG CGCTCGTCGC CGGCTCGGTG GCGGCGGCCG TCGCGCTCGG CGTGCCGCTC
GGCATCGGCT GCACGCGCTC GGCCGCGCTG CGCGGCGCGT TGCTGCCGCT GCTGAACGTC
GTGCAGACGA TCCCGAGCAT CGCGCTGTAC GGCCTGCTGA TGGCGCCGCT CGCGATCCTC
GCCGCGCGCG TGCCGCTCGC CGCCCGCCTC GGCGTGAGCG GCATCGGCGT CGCGCCCGCG
CTGATCGCCC TGTTCCTGTA TGCGCTGCTG CCGATCGTGT CGAGCGTCGT CGTCGGATTC
GCGCAGGTGC CCGCCGCCGT CGTCGAGGCC GCGCTCGCGA TGGGGATGAC GGGCCGCGAG
CGGCTCGTCG CGATCGAGCT GCCGCTCGCG CTGCCCGTCG TGCTTTCCGG CGTGCGCATC
GTGCTCGTGC AGAACATCGG CCTCACGGCC GTCGCCGCGC TGATCGGCGG CGGCGGCTTC
GGCACGTTCA TCTTCCAGGG GATCGGCCAG TCGGCGACCG ATCTCGTGCT GCTCGGCGCG
CTGCCGACGA TCGCGCTCGC GCTCGTCACC GCCGTGCTGT TCGAGGCCGC GACCGACCTA
GCGAAAGGAG CGCGCCGATG A
 
Protein sequence
MTPRTAGGAA APAPAPRPRA MPAWAARVDK VGVLIAALVA YAAFVLPFVT LRANRIAAGA 
ELAPAAVFPA LHAYALDALW AAGALFALVH SRAAWRAAVG VGLVFALGVA IGAAPAHLVT
PDTPLARVSP AAGAWLLLFA FAVLIADALA RIALAPAMRL VALAAASAAL AAFIHGGFWD
GLSVMQEYAV RADTFRNEAI RHLALVAGSV AAAVALGVPL GIGCTRSAAL RGALLPLLNV
VQTIPSIALY GLLMAPLAIL AARVPLAARL GVSGIGVAPA LIALFLYALL PIVSSVVVGF
AQVPAAVVEA ALAMGMTGRE RLVAIELPLA LPVVLSGVRI VLVQNIGLTA VAALIGGGGF
GTFIFQGIGQ SATDLVLLGA LPTIALALVT AVLFEAATDL AKGARR