Gene BURPS668_0696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0696 
Symbol 
ID4882417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp673402 
End bp674532 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID640126624 
Producthypothetical protein 
Protein accessionYP_001057748 
Protein GI126441267 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGCG ACGCCGCCGC CGAGCGCGCG CAACGCATCG GCCTCGTCCT TTCCGACATT 
TCCGATAGCC GAATCCGTCG GCTTGCCAAC CGTGCGAACC TTCTTCCTGC TCTTCGTCGC
GCCGGGCTTG TCTGCCGGAC AACCGGCCTC CTGCACGCCC GCGGTGCGTA CCGCCTTCGG
CGGGAGCGGC CGATGAGCGC CGTCGCCTGC GCGCCGGCGC GCCCGTCGCT CGATCTGCGC
GCGATCGGCG TGATGATCCT GCTGTGCGCG ATCTGGGGCT TTCAACAAGT CGCGATCAAG
AGCGCGACGC ATGCGATTCC GCCGATGCTC CAGGCCGGGC TGCGCTCGGC GATCGCGGCG
GTCGGCGTGT GGGCGTGGGC GCGCGCGCGC GGCACGCCGA TCTTCCGCAC GGACGGCACG
TTAGGCGCCG GCATCGTCGC CGGCACGCTG TTCGCGGGCG AGTTCGTCTG TCTGTTCTTC
GGCCTCACGC TGACGAGCGC CGCGCGCATG GCGATCTTCC TGTACACCGC GCCGTGCTTC
ACCGCGCTCG GCCTTCACCT GTTCGCGCCG GGCGAGACAA TGCGCCGCCA GCAATGGGCG
GGCGTCGCGA TCGCGTTCGC GGGCATCGCG GTCGCGTTCG CCGACGGTTT CGCGCGACCG
GCCGCCGGCG GCGCATCGGC GCTCGCCGGA CTCGCGGGCG ACGCGCTCGG CGTGCTCGGC
GGCGTGATGT GGGCGGCGAC GACGGTCGTC GTGCGTTCGA CGTCGCTCGC GCACGCGAGT
GCGAGCAAGA CGCTGTTCTA TCAGTTGACC GTGTCGTCGG CGGTATTGCT CGGCCTCGCG
GTCGTCACCC GCCAGACGAC GTTCGCGAAC GTGACCCCGC TCGCCGTCGC AAGCCTGGCC
TATCAGGGCG TGATCGTCGC GTTCGCGAGC TATCTCGCGT GGTATTGGCT GCTCACGCGC
TACAGCGCGG CGCGGCTCTC GGTGTTCACG TTTCTCGCGC CGCTCTTCGG CGTGAGCTTC
GGCGTGCTGC TGCTCGGCGA TGCGATCGGC CCGCGCTTCG TCGCGGCGGC CGCGCTCGTG
CTCGCGGGCA TCGCGCTCGT CAACGCGCCG CCGCGCGGCG CTCGCAATTA G
 
Protein sequence
MRRDAAAERA QRIGLVLSDI SDSRIRRLAN RANLLPALRR AGLVCRTTGL LHARGAYRLR 
RERPMSAVAC APARPSLDLR AIGVMILLCA IWGFQQVAIK SATHAIPPML QAGLRSAIAA
VGVWAWARAR GTPIFRTDGT LGAGIVAGTL FAGEFVCLFF GLTLTSAARM AIFLYTAPCF
TALGLHLFAP GETMRRQQWA GVAIAFAGIA VAFADGFARP AAGGASALAG LAGDALGVLG
GVMWAATTVV VRSTSLAHAS ASKTLFYQLT VSSAVLLGLA VVTRQTTFAN VTPLAVASLA
YQGVIVAFAS YLAWYWLLTR YSAARLSVFT FLAPLFGVSF GVLLLGDAIG PRFVAAAALV
LAGIALVNAP PRGARN