Gene BURPS668_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2058 
Symbol 
ID4884534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2049091 
End bp2050203 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content73% 
IMG OID640127986 
ProductApbE family protein 
Protein accessionYP_001059093 
Protein GI126442057 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.673444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCCG GGTGCAGGAT CGCCTCCCGT CAATCGTTTT CGGAAATTCC ACCGTTGCCC 
GGTTTGAAGA AACTCGTCGG ATCGTCGCTC GCGCTCGCCG TCGTCGTCAC GCTGATGGCG
TGGCTCGCGC TGAGGTCCCC GCAGGTATAC GTGCAGGGCA CGTACGTGTT CGGCACGCGC
GTGCAGCTCG CGCTCTACGG CGTGCCGCTC GATCGCGCGC AGCAGGCGAC GAACGCGGTG
TTCGCGCATT TCCAGGCGAT GCAGCGCGGG CTGCATGCGT GGCAGCCGTC GGAAATCACG
CGCCTGAACC GGAGCATCGC CGCGGGCGAG CCGTTTCGCG CGTCGCCCGC GACGGCCGAG
ATCCTGCGCG CCGCGCGCCG GCTGTCGCTC GACACGCAGG GGCTCTTCGA GCCCGGCATC
GGGCGGCTCA TTCGCCTGTG GGGCTTCCAG TCCGATCAGT TCCGCGTTGC GCCGCCGTCG
CCCGACGCGG TGCGGCGCGA GCTCGCGCGC GGCGCGCGCA TCGCCGATCT CGCGATCTCG
CCCGACGGCG TCGTCACGAG CGCGAACCGC GCGGTCGCGA TCGATCTGGG CGGCTTCGCG
AAGGGCTGGG CGCTCGACGA CGCGGCCGCG ATCCTCAGGC GGCAGGGCAT CGCCAACGCG
CTGATCGACG TCGGCGGCAA TCTGCTCGCG CTCGGCAGCA AGGGCGGCGC GGCCTGGCGC
GTCGGCGTGC AGGACCCGCG CAAGCCCGGC ACGCTCGCGA CGCTCGAGCT GCGCGACGGC
GAGGCGATCG GCACGAGCGG CGATTACGAG CGCTTCTTCC AGGCGGAGGG CGTGCGCTAC
TGCCACCTCA TCGATCCGCG CAGCGGCTTT CCCGCCGTGC AAAGCGAGGC GGTGACCGTG
CTCGTCGCGC CCGGCCCGCA CGCGGGCGCG CTGTCCGACG GCGCGAGCAA GCCGCCGTTC
ATCGCGGGGC GCGCGGCGAT GCCGCTCGCG CGCCGGCTCG GCGTGCAGGC CGTGCTGATC
GTCGATGCGC AGGGGCGCGT GTGGGCGACC GACGCGATGG CCGCGCGCGC GCGCTTCGCC
GATCCGGCGC TGCGCGCCGC CCGGCTCGAC TAA
 
Protein sequence
MSSGCRIASR QSFSEIPPLP GLKKLVGSSL ALAVVVTLMA WLALRSPQVY VQGTYVFGTR 
VQLALYGVPL DRAQQATNAV FAHFQAMQRG LHAWQPSEIT RLNRSIAAGE PFRASPATAE
ILRAARRLSL DTQGLFEPGI GRLIRLWGFQ SDQFRVAPPS PDAVRRELAR GARIADLAIS
PDGVVTSANR AVAIDLGGFA KGWALDDAAA ILRRQGIANA LIDVGGNLLA LGSKGGAAWR
VGVQDPRKPG TLATLELRDG EAIGTSGDYE RFFQAEGVRY CHLIDPRSGF PAVQSEAVTV
LVAPGPHAGA LSDGASKPPF IAGRAAMPLA RRLGVQAVLI VDAQGRVWAT DAMAARARFA
DPALRAARLD