Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1066 |
Symbol | |
ID | 4885101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1043098 |
End bp | 1044108 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640126994 |
Product | putative lipoprotein |
Protein accession | YP_001058116 |
Protein GI | 126438447 |
COG category | [S] Function unknown |
COG ID | [COG5430] Uncharacterized secreted protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTTGAGA TAAGTCAAAA AATGACGATA ACGAATATCA GGCAGCGCGA TTCGAAGCAA TGGCTGCTCG CGCTGCTCGT CGCGTGGCTG CTCGCGTGCC CGTGGGGGGC GGCGCACGCC GAGACGTGCT CGGTCACGAC GCCCGCGCCG AATTTCGGCT CGGTCGATCC GATCACGCTC GCCGCCGTGT CGACGACCGC GACGATGACG GTTACCTGCA CGTGGTCGGC CGTCACGCTC ACGCCGAACG TGCTCGTCTG CCTGAACCTC GGCGGCACCA GCCCGCGCTA TCTGACGAAC GGCTCGAACC AGATGCAGTA CGATCTGTAC CAGGATTCGG GGCACACGGT GAGCTGGGGC TCGTCGTACT ACGGCACGAC GCCGATTTCG CTCACGCTCG TGAAGCCCGC GCTCAGCACG AGCGCGAGTT CGACCGTCAC GATCTACGGC CAGATCGCCG CGAACCAGCC GACCGTGCCG ACGGTCGGCA ACGCGAGCAC CACCTATTCG CAGACGTTCG GCGGCAACAC GACATCGCTG AACTACAGCT TCTACGCGCT CGCGCCGCTG CCGTGCGCGT CGCAATCGTC GTTCGGCACG TTCGCGTTCA CCGCGAGCGC GACCGTCGTC AACGATTGCT TCATCAACGC CACCAACGTC GCGTTCGGCT CGACGGGCGT GATCCAAGGC GCGCTGACGG CGACGGGCAC GATCAGCGCG CAGTGCACGA ACGGCGACGC GTTCCGGATC GCGCTGAACG GCGGCGCGAG CGGCAACGTC GCCGCGCGCG CGATGCAGCG CACGGGCGGC GGCGGGGCCG TCAACTATCA GCTGTATCTC GACGCCGCGC ATTCGACGAT CTGGGGCGAC GGCACGGCCG GCACGTCGAC GGCGACGGGC ACGGGTAGCG GGCTGTCGCA GTCGCTCACC GTGTACGGCC AGGTGCCCGC GCAGACCACG CCCGCGCCCG GCACCTACAG CGACACGATC ACCGCGACGA TCACGTTCTG A
|
Protein sequence | MLEISQKMTI TNIRQRDSKQ WLLALLVAWL LACPWGAAHA ETCSVTTPAP NFGSVDPITL AAVSTTATMT VTCTWSAVTL TPNVLVCLNL GGTSPRYLTN GSNQMQYDLY QDSGHTVSWG SSYYGTTPIS LTLVKPALST SASSTVTIYG QIAANQPTVP TVGNASTTYS QTFGGNTTSL NYSFYALAPL PCASQSSFGT FAFTASATVV NDCFINATNV AFGSTGVIQG ALTATGTISA QCTNGDAFRI ALNGGASGNV AARAMQRTGG GGAVNYQLYL DAAHSTIWGD GTAGTSTATG TGSGLSQSLT VYGQVPAQTT PAPGTYSDTI TATITF
|
| |