Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2030 |
Symbol | |
ID | 4905038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1997865 |
End bp | 1998911 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640145135 |
Product | hypothetical protein |
Protein accession | YP_001076063 |
Protein GI | 126458585 |
COG category | [S] Function unknown |
COG ID | [COG3520] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03347] type VI secretion protein, VC_A0111 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.23868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGCCG CGAATGGGGG CGCAATGCCT GCTCTAGAAC CCGTTCTGCT CGGCGAGGCG AAGCACTTCG CGTATTTCCA GGCGATCCGG CTGCTGCGGC GCATCGCGCG CGAACGGCGC GGCGACGCCG CGGGGCGGCC CGACGCGCCG CTGCCGATCC ACACGCGGCC GAATCTCTCG CTGTCGTTTC CGGACACCGA CGTCGAGCGC ATCGACAAGG CCGACGACGG CGGCTACCGG GTCGTCGCGA ACTTCTTCGG TCTGTACGGC GTGTCCTCGC CGCTGCCGAC GTTCTACACC GAGGACCTGA TCGACGAAGC GTTCAGGGGC CGCCACGCGG CGCGCGGCTT TCTCGACGTG CTGCATCGCG CGCTCTATCC GCTGCTGTTC GACGCGTGGC TCAAGCATCG GCTGGCGCTG CGGATCGTCG AGGAACGCGA CGAGCACGCG CTGCGCCCGC TCTACGCACT CGCGGGCGTC GACGCGCGCA TCGCGCGCGA CGCGGGCCTG CACGAGCACG CGCTGCTGCG CTACGTCGGG CTGCTCAGCC AGCGGCCGCG CTCCGCGGCC GGTCTGCGCG CGCTGCTCGC CGACGCGTTC GCGCCCGCGA CGGTCGACAT CGAGCCGTGC GTGCCGCAAT GGCTGCCGAT TCCGGACGAC CAGCGCACCC GCGTCGGCAC GCGCGCGCGC CGGCTCGGCG CCGACGCGCG GGTCGGCGCG CGCATGCGCG ACGACGGCGC CCGGCTGCGG ATCGTGCTCG GCGACGTGCC CGGCCCGCTG TTCCGCGCGC TGATGCCGGG CGGCGATGCG TTTGCCCGGC TGCGCTTGCT CGTACGCCTG TATCTGACCC AACCGTTCGC CGTGGACGTC GTGGTGCGCG TGCGCGCGCG CGACGCGGCC CCCGCGCGCT GCGGCCGCCG CGCGTGGTCG CGCGTCGGGC TCGACGCGTG GCTGGGCGGC CCGAGCGCGG AACGCGCGGC GTCGCCCGAA TTCCGTCTTC CGACTTCCCT CTTTGATCAA GCGAGACCCC ATCATGCTGC TGGTTGA
|
Protein sequence | MAAANGGAMP ALEPVLLGEA KHFAYFQAIR LLRRIARERR GDAAGRPDAP LPIHTRPNLS LSFPDTDVER IDKADDGGYR VVANFFGLYG VSSPLPTFYT EDLIDEAFRG RHAARGFLDV LHRALYPLLF DAWLKHRLAL RIVEERDEHA LRPLYALAGV DARIARDAGL HEHALLRYVG LLSQRPRSAA GLRALLADAF APATVDIEPC VPQWLPIPDD QRTRVGTRAR RLGADARVGA RMRDDGARLR IVLGDVPGPL FRALMPGGDA FARLRLLVRL YLTQPFAVDV VVRVRARDAA PARCGRRAWS RVGLDAWLGG PSAERAASPE FRLPTSLFDQ ARPHHAAG
|
| |