Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1982 |
Symbol | |
ID | 4903883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1951623 |
End bp | 1952807 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640145088 |
Product | ApbE family protein |
Protein accession | YP_001076016 |
Protein GI | 126457068 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.63389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAAGA CGTCTATTGA ATGGTCGCCC GGCGCGCGGC TGAATCGCTG CCGCGCAAGC GGCGCGACGA TGGGCACGCG CTACGGCGCG CAGTTCTACG CGCCGCCGAC GGCCGACGCG AGGGCGATCG CGGCCGCCCT CGACGCGGCG GTGCGGGCGG TCGACGCGCA GATGTCGAAC TGGAAGGCCG ATTCGGATCT GTCGCGGCTC AATCGCGCGA CGCCCGGAAG CTGGACGCCG ATCTGCGCGA ACCTCGCCGC GGTGCTCGTG CGCGCGCGGG AAATCGGCCG CGAGACGGAC AACGCGTTCA ACATCGGCGT CGGCACGCTC GTCGATCGAT GGGGATTCGG GCCGGGCGCG GCCGCGAACC GACAAGCGGA CAACGAATGG GCGGCGAATC GACAGGCGGC CGGCCGACAC ACGGTTGATC GACGTACGGT TGATCGACAC ACGGCGGACC GACACACGGC GGACCGACAA ACGAAGGACG GGCGCACGCC GGACCGCCAG CCGGCGCGCC CGGCCGACCC CGCGAACGGG TTGTCGGGCG CGATCGACGC GCGCCGCCGC GCGTCGATCC TGCGCGGCCC CGTGCCGTCG CCGTGCCGCC CGATCGACGA ACTGCTCGAA GTCGATGTCG CGCGGGGCCG GGCGCGCCGG CTCGCGGACG TCGCCTTCGA CCTGTGCGGG ATCGCGAAGG GCTTCGGCGT GGACGAGCTT GCGCGCGTGC TCGATCGCCA CGGCATCGGC GCATGGCTCG TCGGCATCGA CGGCGAGCTG CGCGCGCGCG GATGCAAGCC GGACGGCTCG CCGTGGGCGA TCGCGCTCGA AGCGCCCGAC TACGGCCGGC GCGGCGCGAT GGGCGCGATC GATCTCGTCG ACGCGGCCGT CGCGACCTCC GGCGATTACC GGCATTGGGC CGACTTCGGC GGCGAACGCC TCTCGCATAC GATGGACCCG CGCGCCGGCG CGCCGCTGCG CGGCGACATC GCCTCGGTCA CGGTCGTCGC GCCGACCTGC ACCGACGCGG ACGCGTACGC CACCGCGTTG ATGGTGCTCG GCGCGCAGGC GGGATGCGCG CACGCCGAAC GCCACGGGCT CGACGCGCTG TTCGTCGTGC GCGACGGCGA CGCGCTGCGC ACGATCGGCT GCGGCGCTTT CGCGGACGCG GGGCCGGCGG GCTGA
|
Protein sequence | MSKTSIEWSP GARLNRCRAS GATMGTRYGA QFYAPPTADA RAIAAALDAA VRAVDAQMSN WKADSDLSRL NRATPGSWTP ICANLAAVLV RAREIGRETD NAFNIGVGTL VDRWGFGPGA AANRQADNEW AANRQAAGRH TVDRRTVDRH TADRHTADRQ TKDGRTPDRQ PARPADPANG LSGAIDARRR ASILRGPVPS PCRPIDELLE VDVARGRARR LADVAFDLCG IAKGFGVDEL ARVLDRHGIG AWLVGIDGEL RARGCKPDGS PWAIALEAPD YGRRGAMGAI DLVDAAVATS GDYRHWADFG GERLSHTMDP RAGAPLRGDI ASVTVVAPTC TDADAYATAL MVLGAQAGCA HAERHGLDAL FVVRDGDALR TIGCGAFADA GPAG
|
| |