Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2756 |
Symbol | |
ID | 4885785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2628724 |
End bp | 2629602 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640132692 |
Product | decarboxylase family protein |
Protein accession | YP_001063748 |
Protein GI | 126442731 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCGA CGCCGCGCCG CGCGCGCACC GCGCCGAAGC GCCGCGCGCC ACTGTCGAGC ACTGTCGAGC GGCTCGTGTC GAGCCCGACG TATCGGCAAG CCGACGAGGA TCTCGCGTTC CTGCAGCGCC CCGAAATGTG CGGCGTGCGC TTGCAGCTCG ACTACTGGAA GACCGAGGAA ACGCTGCAAC GCTTCGGCAT CTGCGATACG GTGGTGGTCT ACGGCAGCAC GCGGATCGCG TCGCCCGCCG TCGCGCGCGC GAGGCTCGCC GACGCGCAGC ACCGGCTCGC CGAGCGTCCG AACGACCCCG AGCGCCGCCA TGCGGTCAGC GTCGCGGTGC GGCTGCTCGA GCGCAGCCAC TACTACGGCG TCGCGCGCGA TCTCGGCCGG CTCGTCGGCG AAACCGGCCG CTCGCCGCAT CCTCGGCGCC TCACGATCAT CACGGGCGGC GGCCCCGGCA TCATGGAGGC GGCCAATCGC GGCGCGCACG AGCGGGGCGC GCCGAGCATC GGGCTCAACA TCACGCTGCC GCGCGAGCAA TTCCCGAATC CCTACGTGAC GCCCGAGCTG TGTTTTCGCT TCCATTACTT CGCGATCCGC AAGCTGCACC TGCTCGAACG CGCGAAGGCC GCGGTATTCT TTCCCGGCGG CTACGGCACC TGCGACGAGC TGTTCGAAGT GCTGACGCTG TTGCAGACCC GCAAGATCGC GCCGCTGCCC GTCGTGCTCG TCGGCCGCGC GTTCTGGCGC TCGGCGGTCG ATTTCGGGTT TCTCGTCGAC GAAGGAATGA TCGACCCGTG CGACGCAGCG CTGTTCCGGT TCTGCGAAAC CGCCGACGAG ATCTGGGCCG CGATCGGCGG CCCGCACGGG CCGGCCTAG
|
Protein sequence | MRPTPRRART APKRRAPLSS TVERLVSSPT YRQADEDLAF LQRPEMCGVR LQLDYWKTEE TLQRFGICDT VVVYGSTRIA SPAVARARLA DAQHRLAERP NDPERRHAVS VAVRLLERSH YYGVARDLGR LVGETGRSPH PRRLTIITGG GPGIMEAANR GAHERGAPSI GLNITLPREQ FPNPYVTPEL CFRFHYFAIR KLHLLERAKA AVFFPGGYGT CDELFEVLTL LQTRKIAPLP VVLVGRAFWR SAVDFGFLVD EGMIDPCDAA LFRFCETADE IWAAIGGPHG PA
|
| |