Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1791 |
Symbol | |
ID | 4884609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1766235 |
End bp | 1767821 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640127719 |
Product | hypothetical protein |
Protein accession | YP_001058830 |
Protein GI | 126441249 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0910077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCGT TCAAATCATC GCTGCGTCGC GGGTTGCGCG TTTCGGGATG CGCGCTATGG TTGGCGCTGT CGAAGCTCGA CGCCGCACAT GCGGCTGCCG GACCCGACGC ATCGCCGGAC GCATGGCCGA GCGCCGCGCT GTCCGATCGC GCCGCCGTCG AGGTGGACGC ATCGCAATTG CTGCCGACGC CCCAGCTCGT GCGGTGGCAA GTCGACCTCG ACCGCCGCGG CCTGCGATCG ACCGGTTCGC CCGCGCACGA GCGCTACATC GACGTGCTGC GCCGCCGCCT CGCGCGCGCC GGCGTGCAGC AGCTCCATAC CGAGGCGACG ACGCTCGCGC GCTGGGCCGT GCGGCACTGG CGGCTCGACG TCGCCGACGG CGCGCCGCGC GAGCGCATCG ACGTCGCGGG CTACCTTCCG TATTCCGGCG ACACCGGCCG CGACGGCATC GTCGCGCCGA TCCGCTATCT CGCGGCCGGG CAGACGCCCG ATGCGGACGT GGCCGGCAAG ATCGCCGTCG TCGAATGGCC GGCATTGCCG TTGACGGGCG CGTTCTTCCG CGAGCGCGCG CTGCGCGTGT TCGATCCGGA CAACGCGTTC GCGCCGTCGG CGCCCTATGT GCGCACGAGC TTCATGCTCG GCACGCTCAC CGCGATGCTC GACAGGCTGC AAGCGGCGGG CGCGGCGGGC GTCGTGATGA TCGCCGATAC GTCGAGCGCC GAGGCGACCC GCCTGTACGC GCCGTACGAC GGCCGGCTGC GGCGCGTGCC CGGCCTGTTC GTCGATCGCG CGACGGGCGC GAAGCTCGCT TCGCTCGCGG AGCGCCGCGC GACGCTGCGG CTGTCGCTCG ATGCCGGCGT CGAGCGCGTG CACACCCGCA ACCTGATCGG CATCATCCCC GGCATGAGCG ATGAGCTGAC CGTCGTCAAC AGCCACACCG ACGGCACGAA CGGAATCGAG GACAACGGCC CGAACGCGAT CGTCGCGATC GCGCAATACC TGAGCCGCCT GCCCCGCGCG GCGCTGCCTC GCACGGTGAT GATCCTGCTG TCGAGCGGGC ATTTCGCGGG AGGCGTCGGC GCCGAGGATT TCATCGCGCG ACATGCGCGC GACGGCCTCG TCGCGCGCAT CGCGTGCGTC GTGACGATCG AGCACCTCGG CGCGCAGGAA TGGCTGCCGA ACGCGCAAGG CGCGCTCGCG CCGACGGGCC GCGCGGAGCC CACCGCGCTG TTCATGCCGG CCGTGCCGGC GCTCGTCGAC GCGGCCGACG CGCTCGTGCG CCGCGCGAAT GCCGCGCCCG CGTTCGTGAT GCCGCCGCTG AATCCGAACG GAGACGGCAG CGCGAACGAC GCGCTCTGGC CCGGCGAAGG ACAGTACTTC TGGGGCCGCG CGCGCGTGCC GACGATCAAC CTGATCACGG GGCCCACGTA TCTCCTCAAT TACGGCGTAT CGACGGCTAA GAAGATCGAC TATGCGCGCC TGCGCCGCGA GATCGCCGCG ACGACGCAGA TGCTGCTCGA TCTGTCGCGG GTGCCGTTCG ACGCGCTGCG CGCGGTTCCA CCGCAAATGC GCGCGGCGGC GCCGTGA
|
Protein sequence | MASFKSSLRR GLRVSGCALW LALSKLDAAH AAAGPDASPD AWPSAALSDR AAVEVDASQL LPTPQLVRWQ VDLDRRGLRS TGSPAHERYI DVLRRRLARA GVQQLHTEAT TLARWAVRHW RLDVADGAPR ERIDVAGYLP YSGDTGRDGI VAPIRYLAAG QTPDADVAGK IAVVEWPALP LTGAFFRERA LRVFDPDNAF APSAPYVRTS FMLGTLTAML DRLQAAGAAG VVMIADTSSA EATRLYAPYD GRLRRVPGLF VDRATGAKLA SLAERRATLR LSLDAGVERV HTRNLIGIIP GMSDELTVVN SHTDGTNGIE DNGPNAIVAI AQYLSRLPRA ALPRTVMILL SSGHFAGGVG AEDFIARHAR DGLVARIACV VTIEHLGAQE WLPNAQGALA PTGRAEPTAL FMPAVPALVD AADALVRRAN AAPAFVMPPL NPNGDGSAND ALWPGEGQYF WGRARVPTIN LITGPTYLLN YGVSTAKKID YARLRREIAA TTQMLLDLSR VPFDALRAVP PQMRAAAP
|
| |