Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2113 |
Symbol | |
ID | 4899798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2105431 |
End bp | 2106489 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640135343 |
Product | ApbE family protein |
Protein accession | YP_001066378 |
Protein GI | 126452973 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCCGGTT TGAAGAAACT CGTCGGATCG TCGCTTGCGC TCGCCGTCGT CGTCACGCTG ATGGCGTGGC TCGCGCTGAG GTCCCCGCAG GTATACGTGC AGGGCACGTA CGTGTTCGGC ACGCGCGTGC AGCTCGCGCT CTACGGCGTG CCGCTCGATC GCGCGCAGCA GGCGACGAAC GCGGTGTTCG CGCATTTCCA GGCGATGCAG CGCGGGCTGC ATGCGTGGCA GCCGTCGGAA ATCACGCGCC TGAACCGGAG CATCGCCGCG GGCGAGCCGT TTCGCGCATC GCCCGCGACG GCCGAGATCC TGCGCGCCGC GCGCCGGCTG TCGCTCGACA CGCAGGGGCT CTTCGAGCCC GGCATCGGGC GGCTCATTCG CCTGTGGGGC TTCCAGTCCG ATCAGTTCCG CGTTGCGCCG CCGTCGCCCG ACGCGGTGCG GCGCGAGCTC GCGCGCGGCG CGCGCATCGC CGATCTCGCG ATCTCGCCCG ACGGCGTCGT CACGAGCGCG AACCGCGCGG TCGCGATCGA TCTGGGCGGC TTCGCGAAGG GCTGGGCGCT CGACGACGCG GCCGCGATCC TCAGGCGGCA GGGCATCGCC AACGCGCTGA TCGACGTCGG CGGCAATCTG CTCGCGCTCG GCAGCAAGGG CGGCGCGGCC TGGCGCGTCG GCGTGCAGGA CCCGCGCAAG CCCGGCACGC TCGCAACGCT CGAGCTGCGC GACGGCGAGG CGATCGGCAC GAGCGGCGAT TACGAGCGCT TCTTTCAGGC GGAGGGCGTG CGCTACTGCC ACCTCATCGA TCCGCGCAGC GGCTTTCCCG CCGTGCAAAG CGAGGCGGTG ACCGTGCTCG TCGCGCCCGG CCCGCACGCG GGCGCGCTGT CCGACGGCGC GAGCAAGCCG CCGTTCATCG CGGGGCGCGC GGCGATGCCG CTCGCGCGCC GGCTCGGCGT GCAGGCCGTG CTGATCGTCG ATGCGCAGGG GCGCGTGTGG GCGACCGACG CGATGGCCGC GCGCGCGCGC TTCGCCGATC CGGCGCTGCG CGCCGCCCGG CTCGACTAA
|
Protein sequence | MPGLKKLVGS SLALAVVVTL MAWLALRSPQ VYVQGTYVFG TRVQLALYGV PLDRAQQATN AVFAHFQAMQ RGLHAWQPSE ITRLNRSIAA GEPFRASPAT AEILRAARRL SLDTQGLFEP GIGRLIRLWG FQSDQFRVAP PSPDAVRREL ARGARIADLA ISPDGVVTSA NRAVAIDLGG FAKGWALDDA AAILRRQGIA NALIDVGGNL LALGSKGGAA WRVGVQDPRK PGTLATLELR DGEAIGTSGD YERFFQAEGV RYCHLIDPRS GFPAVQSEAV TVLVAPGPHA GALSDGASKP PFIAGRAAMP LARRLGVQAV LIVDAQGRVW ATDAMAARAR FADPALRAAR LD
|
| |