Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2042 |
Symbol | |
ID | 4888298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1974650 |
End bp | 1975747 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640131980 |
Product | putative cell surface protein |
Protein accession | YP_001063037 |
Protein GI | 284159993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCGTCG CGTCGGCGAG CGTCGGCGGC CTCACGTTCG GCGGCTTCGC GGGCAGCGCG CCCATCGGCG TGTTCAGCGT CGGCGCACCG GGCGCGGAAC GCCAGATCAC GAACGTCGCC GCCGGCCGCA TCTCCGCGGC CAGCACCGAC GCCGTCAACG GCAGCCAGCT CTATGCGACC AACAGCAATG TCGCGTCGCT GTCGACCGGT CTGAACGCGA CCAACAGCAA CCTCGCGTCG CTGTCCACGT CCACCTCGAC CGCCGTCGGC TCGCTGTCCA CCGGCCTGTC CACGACCAAC AGCACCGTCG CCTCGCTGTC CACGTCGACC TCGACCAGCA TCGGCTCGCT GTCCACCGGC CTCTCGACCG CGAACAGCAA CCTCGCGTCG CTGTCCACGT CCACCTCGAC CGGCATCGGC TCGCTGTCCA CCGGCCTCGC GACGACCAAC AGCAATGTCG CGTCGCTGTC GACGAGCGTG ACCAACATCA ACACGCAGCT CACGTCGCTG TCGACGTCGA TCACGAACAA CGTGATCCGG TCGCTGCCCG CGAGCACCGG CGTCGCCGCG GACATGAGCG CGCCGAAGGC GACCTCGCCG TCCGTCACGG CCGGCTCGAA CTCGGTCGCG CTCGGCGCGG GCTCGAACGA CGGCGGTCGC TCGAACGTCG TGTCGGTGGG CAGCGACACG CAGCAGCGCC AGATCACGAA CGTCGCGGCC GGCACCGAGG GCACCGACGC GGTCAACGTC AACCAGTTGA ATACGCTGTC GACGTCGATG TCGCAATCGC TGTCGAATCA GCAAACGCAG CTCAACAATC TCGGCTCGCA ACTGAACCAG ACGCAGCAGC AACTGCAGCA GACCGACACG ATGGCCCGCC AGGGGATCGC GGCGGTCGCG GCGATGGCGT CGATTCCGCA CATGGACCGC GACTCGAACT TCGCGATGGG CGTGGGCACC TCTTCGTTCC TCGGCCAGAA GGCGATCGCG GTCGGCATGC AGGCGCGCAT CACCGAGAAC CTGAAGGCGT CGCTGAACGG CGGCTTCGCC GGCAATCAGA AGGTCATCGG CGCGGGCATG CTCTATCAGT GGAAGTAA
|
Protein sequence | MPVASASVGG LTFGGFAGSA PIGVFSVGAP GAERQITNVA AGRISAASTD AVNGSQLYAT NSNVASLSTG LNATNSNLAS LSTSTSTAVG SLSTGLSTTN STVASLSTST STSIGSLSTG LSTANSNLAS LSTSTSTGIG SLSTGLATTN SNVASLSTSV TNINTQLTSL STSITNNVIR SLPASTGVAA DMSAPKATSP SVTAGSNSVA LGAGSNDGGR SNVVSVGSDT QQRQITNVAA GTEGTDAVNV NQLNTLSTSM SQSLSNQQTQ LNNLGSQLNQ TQQQLQQTDT MARQGIAAVA AMASIPHMDR DSNFAMGVGT SSFLGQKAIA VGMQARITEN LKASLNGGFA GNQKVIGAGM LYQWK
|
| |