Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3909 |
Symbol | |
ID | 4884622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3808721 |
End bp | 3809647 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640129837 |
Product | toxin |
Protein accession | YP_001060903 |
Protein GI | 126442145 |
COG category | [R] General function prediction only |
COG ID | [COG4128] Zonula occludens toxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.113281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACGT TGATCACAGG GGTTCCGGGG AGCGGGAAGA CGCTGCATGC GGTCTGGTTG TTGACGAAGA TCTCGAAGGG GCGTCGCGTG CTGGTCGACG GCATTCGCGA TCTGGCGATC GAGCACGTCG AGATTGACGA GCCTTGGTTA CGCCAATGGC ACGAAAAGGC CGAAGCGCAC GATCTGATCG TGGTCGATGA GGCACAACGC ATCTATCCGC CGACGACGGT AAGCCAAAAG CCGACGCCGG ACGTGGAGCA ACTGCACGTG CATCGTCACA AGGGCGTCGA CTTCATCCTC ATCACGCAGC ATCCGCAGCG GATCAGCAAG ACGGTGCGCG ATCTGGTCGG GCGACATATC CACGTGCGCA ACCTGTTTGG GCTCAAGCGC GCGATGCTCT ACGAGTGGGA CCATTGCCAC AATCCAAGCA GCCTGAAAGA CGCGGTGAAA CGGCAATGGG CGTATCCGCG CGAGGTGTTC AAGCTCTACA CGAGCGCCGA AGTCCACACG AAGAAACAGG CGGTCGTGCC CAAGGCGTTG TTCGTCGTGC CGATCGCGTT GGGCGTGTTG CTGTACTGCT CGGTGAAGTT CTTCTACAGC GCGCGCGACG GCTTCGGGGT GACGCCCGGC ATGTCGGAAG CCGAGCACGA AAGAACCGAC GCCCCCACGC AGTCGCCGCA ACCCGCGAAG GCTGAGCGCG CGTCGGCCGC GCCCCTCTCG TCCGATTGGC GCATTGCCGG GCGCTACGTA ACGGAAGGCG TGGGCTATGT GGTGCTCGTT GCGGCGGATG GCCGGTTGCG CCCGATGTCA TTGGCGGGAT TCAGCGGCGA AGGAATGCTG CTGACGGGGG ATGTCGACGG AAAGACCGTT GGCGCTTGGA CCGGCGCACA CGCCGGTAAA ACAGAACAAG GCGGGGGCAT GCAATGA
|
Protein sequence | MITLITGVPG SGKTLHAVWL LTKISKGRRV LVDGIRDLAI EHVEIDEPWL RQWHEKAEAH DLIVVDEAQR IYPPTTVSQK PTPDVEQLHV HRHKGVDFIL ITQHPQRISK TVRDLVGRHI HVRNLFGLKR AMLYEWDHCH NPSSLKDAVK RQWAYPREVF KLYTSAEVHT KKQAVVPKAL FVVPIALGVL LYCSVKFFYS ARDGFGVTPG MSEAEHERTD APTQSPQPAK AERASAAPLS SDWRIAGRYV TEGVGYVVLV AADGRLRPMS LAGFSGEGML LTGDVDGKTV GAWTGAHAGK TEQGGGMQ
|
| |