Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1658 |
Symbol | |
ID | 3688195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 1766628 |
End bp | 1767638 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637728114 |
Product | gp38 |
Protein accession | YP_333061 |
Protein GI | 76809256 |
COG category | [S] Function unknown |
COG ID | [COG4422] Bacteriophage protein gp37 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000388971 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCTC GACGCCGCCC GCACCCAAGG GAGCAAATCG TGAGCGAGAA CAGCAAAATC GAATGGTGCG ACCACACGTT CAACCCGTGG GAAGGTTGCC AGAAGGTCGG TCCGGGATGC GACCACTGTT ATGCCGAGGC GCGCAATGCG CGCTTTTCCG GCGGCACGGC GGTCAATTGG GGGCCCGGCG CGCCGCAGCG GCGCACGTCG CCCGCGAACT GGCGCAAGCC CATGAAGTGG AACCGCGACG GCGCGTTCTA TGCAATCCAC GGCCACCGAC AGCGCGTGTT CTGCGCGTCG CTCGCCGACG TTTTCGACAA CACTGTCGAT CCGGCGTGGC GCGCGGACCT GTTCCGCCTG ATCGCCGACA CGCCAAACCT CGACTGGTTG TTGCTGACGA AGCGCATCGG CAACGTCGCG GCGATGCTAC GCGAGATCGG AATCGACCGA CTGCCGGATA ACGTCTGGAT CGGCGCGACG ATCGTCAACC AGGAAGAGGC CGACCGCGAC ATCCCGAAGC TGCTCGCAGT ACCCGCGCGC GTACGCTTCC TGTCGATGGA GCCGCTGCTT GGGTCGGTTG ACCTGCGCTT CCACATCTAC AGCGAGCCAA CCGGCAACTT CCGCACGCAC GGCGGCAAGC GCCAGTTCGA ACTACGCCGA CCAGCCGACG GCGGCCTACA TTGGGTGATC GCCGGCGGCG AAAGCGGCCA CGGCGCCCGC CCGATGCATC CCGACTGGGC TCGGTCGCTG CGCGACCAGT GCGCTGCCGC AGATGTGCCG TTCCTGTTCA AGCAATGGGG TGAACACTCT CTCGCCTACG ACCGCGATCG GGACGATCCG GACTATCGTC GGTGCGATCG CATGGCTCGC CTACCCGGCC GCTGGATCAA TCTGGCAGGC GGACACGGCT TCAATGGCGA ACGCGTCCAT TATGCAGAGC GCGTCGGCAA GAAAGCCGCC GGCCGGCTGC TCGACGGCCG CACGCACAAC GAATTCCCGG AAGATCGATG A
|
Protein sequence | MRPRRRPHPR EQIVSENSKI EWCDHTFNPW EGCQKVGPGC DHCYAEARNA RFSGGTAVNW GPGAPQRRTS PANWRKPMKW NRDGAFYAIH GHRQRVFCAS LADVFDNTVD PAWRADLFRL IADTPNLDWL LLTKRIGNVA AMLREIGIDR LPDNVWIGAT IVNQEEADRD IPKLLAVPAR VRFLSMEPLL GSVDLRFHIY SEPTGNFRTH GGKRQFELRR PADGGLHWVI AGGESGHGAR PMHPDWARSL RDQCAAADVP FLFKQWGEHS LAYDRDRDDP DYRRCDRMAR LPGRWINLAG GHGFNGERVH YAERVGKKAA GRLLDGRTHN EFPEDR
|
| |