Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2058 |
Symbol | |
ID | 4884534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2049091 |
End bp | 2050203 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640127986 |
Product | ApbE family protein |
Protein accession | YP_001059093 |
Protein GI | 126442057 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.673444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCCG GGTGCAGGAT CGCCTCCCGT CAATCGTTTT CGGAAATTCC ACCGTTGCCC GGTTTGAAGA AACTCGTCGG ATCGTCGCTC GCGCTCGCCG TCGTCGTCAC GCTGATGGCG TGGCTCGCGC TGAGGTCCCC GCAGGTATAC GTGCAGGGCA CGTACGTGTT CGGCACGCGC GTGCAGCTCG CGCTCTACGG CGTGCCGCTC GATCGCGCGC AGCAGGCGAC GAACGCGGTG TTCGCGCATT TCCAGGCGAT GCAGCGCGGG CTGCATGCGT GGCAGCCGTC GGAAATCACG CGCCTGAACC GGAGCATCGC CGCGGGCGAG CCGTTTCGCG CGTCGCCCGC GACGGCCGAG ATCCTGCGCG CCGCGCGCCG GCTGTCGCTC GACACGCAGG GGCTCTTCGA GCCCGGCATC GGGCGGCTCA TTCGCCTGTG GGGCTTCCAG TCCGATCAGT TCCGCGTTGC GCCGCCGTCG CCCGACGCGG TGCGGCGCGA GCTCGCGCGC GGCGCGCGCA TCGCCGATCT CGCGATCTCG CCCGACGGCG TCGTCACGAG CGCGAACCGC GCGGTCGCGA TCGATCTGGG CGGCTTCGCG AAGGGCTGGG CGCTCGACGA CGCGGCCGCG ATCCTCAGGC GGCAGGGCAT CGCCAACGCG CTGATCGACG TCGGCGGCAA TCTGCTCGCG CTCGGCAGCA AGGGCGGCGC GGCCTGGCGC GTCGGCGTGC AGGACCCGCG CAAGCCCGGC ACGCTCGCGA CGCTCGAGCT GCGCGACGGC GAGGCGATCG GCACGAGCGG CGATTACGAG CGCTTCTTCC AGGCGGAGGG CGTGCGCTAC TGCCACCTCA TCGATCCGCG CAGCGGCTTT CCCGCCGTGC AAAGCGAGGC GGTGACCGTG CTCGTCGCGC CCGGCCCGCA CGCGGGCGCG CTGTCCGACG GCGCGAGCAA GCCGCCGTTC ATCGCGGGGC GCGCGGCGAT GCCGCTCGCG CGCCGGCTCG GCGTGCAGGC CGTGCTGATC GTCGATGCGC AGGGGCGCGT GTGGGCGACC GACGCGATGG CCGCGCGCGC GCGCTTCGCC GATCCGGCGC TGCGCGCCGC CCGGCTCGAC TAA
|
Protein sequence | MSSGCRIASR QSFSEIPPLP GLKKLVGSSL ALAVVVTLMA WLALRSPQVY VQGTYVFGTR VQLALYGVPL DRAQQATNAV FAHFQAMQRG LHAWQPSEIT RLNRSIAAGE PFRASPATAE ILRAARRLSL DTQGLFEPGI GRLIRLWGFQ SDQFRVAPPS PDAVRRELAR GARIADLAIS PDGVVTSANR AVAIDLGGFA KGWALDDAAA ILRRQGIANA LIDVGGNLLA LGSKGGAAWR VGVQDPRKPG TLATLELRDG EAIGTSGDYE RFFQAEGVRY CHLIDPRSGF PAVQSEAVTV LVAPGPHAGA LSDGASKPPF IAGRAAMPLA RRLGVQAVLI VDAQGRVWAT DAMAARARFA DPALRAARLD
|
| |