Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3121 |
Symbol | |
ID | 4884948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3060622 |
End bp | 3062115 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640129049 |
Product | hypothetical protein |
Protein accession | YP_001060133 |
Protein GI | 126442175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.320306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGCGC TCGCCGGCGG GGCGCAACAT GGCTCGATCC ATGCGAGCGC GCTCGGTTCG GCCGACGCAT CGATCAGCAA CGTCGCACTC GACAACTTGA CGGGTACGCT TGCGAACGCG GCGAGCACGC TGCAAAACGC GGCGGCGAAC AACCCGCTGT CGGGCGTCGC GAGCAATGCA GTCGGCACGC TGACGAATCT GGCGAACAAC AATCCGTTGC CCGGCGCACT GAGCAATGCG GTGGGCACGC TGCAAAACGC AGCGGCGAAC AACCCGCTGT CGGGTGTCGC GAGCAACGCG GTCGGCACGC TGACGAATCT GGCGAACAAC AATCCGTTGC CGGGTGCGTT GAGCAATGCG GTGGGCACGC TGCAAAACGC GGCGGCGAAT AGCCCGCTGT CGGGCGTCGC GAACAACGCG GTCGGCACGC TGACGAATCT GGCGAACAAC AATCCGCTGC CGGGTGCGCT GAGCAATGCG GTGCGCACGC TGCAAAACGC GGCGGCGAAC AACCCGCTGT CGGGCGTCGC GAGCAACGCG GTCGGCACGC TGACGAATCT GGCGAACAAC AATCCGCTGC CGGGTGCGCT GAGCAATGCG GTGCGCACGC TGCAAAACGC GGCGGCGAAC AACCCGCTGT CGGGCGTCGC GAGCAATGCA GTCGGTACGC TGACGAATCT GGCGAACAAC AATCCGTTGC CCGGCGCACT GAGCAACGTG GCTGGCGTAC TGGCCGGCGC CGCCGGCAAC GTGACGGGCG GGCTCGCCGG CGCGGGCAGC ACCGCGGCCG GCGCGATCTC GGGGGCGATG AGCAACAACC CGTTGCCGGG CGTCGTGAAC AACGTGGTCG GCACGCTCGC GAACGCGATC GGCAGCAATC CGATCACGCC CATCACGAGC GTCGCGAGCG GTCTCGCGAA CGCGCTTTCC GTTGCCAATC CCGCCGCGCT GACCGCGGCC GCAAACACCG TCGCGGGCAC GCTCGCGCGC GCGGCGAACG GCACCCCGGT CGCGGGCGCG ATCGGCGGCC TCGTGGCCGC GCTGCCCGTC GCTAATCCGG CCGGCGCGCT GACGAGCGCG GCGAACAACG CAGCAAGCAC GATCGCGACG GTGGCGGGCA CCAACCCGGC TGCCGCGATC GGCGGCGTCG CGGGCGCATT GACGGGGGCG GCCGGCACCG GCGTGGCGAC GGCCTCGCAA CTCGGAAGCG TCGGCTCGGC GCTGATGGGT TCGGGCGCGG CCTCGGCTGG CAAGGTTTTG ACGTCGGGCA GCGCCGCATT CGGCAGCGCG GCCGCATCGG CCGGCTTGCT GCTGACGACG GGAGCGGCCG CCGCGAGCTC GGTCGTCAAT TCGCTGGGCT CGTCGGTCGG CGCAGTGGTG GCGTCGCTGC CGAACCTGAG CGTGTCGTCG TCGAAGTCGA CGGCTGCGGC GTCGAATCCG CTGGCACCCG TCTCGTCGAT GGTCGCGACG CTCGTCGGCG CGCTGCCGAA GTAA
|
Protein sequence | MLALAGGAQH GSIHASALGS ADASISNVAL DNLTGTLANA ASTLQNAAAN NPLSGVASNA VGTLTNLANN NPLPGALSNA VGTLQNAAAN NPLSGVASNA VGTLTNLANN NPLPGALSNA VGTLQNAAAN SPLSGVANNA VGTLTNLANN NPLPGALSNA VRTLQNAAAN NPLSGVASNA VGTLTNLANN NPLPGALSNA VRTLQNAAAN NPLSGVASNA VGTLTNLANN NPLPGALSNV AGVLAGAAGN VTGGLAGAGS TAAGAISGAM SNNPLPGVVN NVVGTLANAI GSNPITPITS VASGLANALS VANPAALTAA ANTVAGTLAR AANGTPVAGA IGGLVAALPV ANPAGALTSA ANNAASTIAT VAGTNPAAAI GGVAGALTGA AGTGVATASQ LGSVGSALMG SGAASAGKVL TSGSAAFGSA AASAGLLLTT GAAAASSVVN SLGSSVGAVV ASLPNLSVSS SKSTAAASNP LAPVSSMVAT LVGALPK
|
| |