Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0692 |
Symbol | |
ID | 4888785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 662701 |
End bp | 664362 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640130632 |
Product | hypothetical protein |
Protein accession | YP_001061691 |
Protein GI | 126442783 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.850291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGCGC GCCGCGCCGT TGCGCTCGGG GGCTGGGCGC TGATGTCGGT CCTCGCGGCG GCTCGCGCGC ACGCCGCGGA TGCCATGGAT GCGCCCGGCG CCGCCGCCGC GGCGGCGGCG CCGGGATCGA GGTTCGCACG TGTGCCGGCG ACATCCGAAT CCGCGCCCTC GTCCGCGTTC GAATTCGCGT CCCCGGCCGC GTCCGCGTCC GCGCTTGCGG TCGTTCCCAT GTCCGCTCTC ACGCTCGCGT CCGCGCCTGC GCCCGCCCCC GCGCCATCGG CTGCGCCGTC GCCTTCGCCC ACATCGCCGC CCGTGCAGCC GGGCGTCCGG CCGGGCGCGT CATGCGCGAA GACACGCCCC GCGACCCTGT TCAACCGCTG GCAGGAAGAC TGGTCGGCGC TCGCGGACCC GTGCGTGCCG CGCCGGCCGC TCGACGCGCT CAAGTACGTG CCGCTCTTCG GCCGCGTCGA TTCGTATCTG TCGCTCGGCG CGGGGCTGCG CGAGCGGCTC GAGGTCAACG ACGCGCCGCT CTTCGGCCTC GGCCGCGCGC GCGGCGATAC CTACGTGCTG CAGCGCGTGC AGGTGCATGC GGACCTGCGC ATCGCCGGGC ACGTGCAGGC GTTCGTGCAA CTCGAGGACG CGCGCCCGTT CGGCAAGGAC AACGTCGGCC CCGTCGATCG CAATCGCGTC GATCTGCGTC AGGCGTTCGT CACCTACGTC GACGCGATCG GCTCCGGCGC GTTCAAGGCG CGGGTCGGCC GCCAGGAGAT GGCGTTCGAC TTGCAGCGCT TCGTGTCGGT GCGCGACGGC CCGAACGTGC GCCAGGCGTT CGACGGCATC TGGGCCGATT GGGAGCAGGG GCCGTGGCGC TTGATCGGCT ATGCGACGCA GCCCGTCCAG TATCGCGACG ACGGCGCGTT CGACGACGTG TCGAACCGCA ATCTCACGTT CAGCGGCGTG CGTATCGAGC GGCAGCGCGT GGGGCCGGGC GACCTGTCCG CGTATTACTC GCGATACAAC CGCACGCAGG CGCAGTTTCC CGACGGCGCG GGCGGCGAAC ATCGCGACGT GTTCGACGTC CGCTACGCGG GCAAGCGGCG CAATGTCGAT TGGGACATCG AAGGGATGTA TCAGACGGGC CGCGTCGGCG CGCAACGCAT CGAGGCGTGG GCGGTGGGCT CGCTCGCCGG CTATACGTTC GCCGGCGTCG GCTGGATGCC GCGCATCGGC TTGCAGGTGG ATGCGGCGTC GGGCGACCGC CGTCCGCGCG ACGGCCGGAT CGAGACGTTC AATCCGCTGT TTCCGAACGG CTATTACTTC GCGCTTGCCG GCTACACCGG CTACACGAAC CTGATTCACG TGAAGCCGTC GCTCACGCTC AAGCCAAGCA GCGCGCTCAC GCTGCTCGCG GCGGTGGGCT TGCAATGGCG CGCGACGACG GCCGACGCGG TCTACGCGCA GGGCGCGACG CCCGTGCCGG GCACGGCGGG CCGGGGCGGC AACTGGACGG GCTTCTACAC GCAGTTGCGC GCGGACTGGG CAGTGACGGC GAATCTGGCG GCGGCGCTCG AAGTCGTGCA TTTCCAGATC GGCGACGCGC TTCGCGCGGC GGGCGGGCGC AATGCGGACT ACGTCGGCGC GGAGCTGAAG TTCGGCTGGT AG
|
Protein sequence | MIARRAVALG GWALMSVLAA ARAHAADAMD APGAAAAAAA PGSRFARVPA TSESAPSSAF EFASPAASAS ALAVVPMSAL TLASAPAPAP APSAAPSPSP TSPPVQPGVR PGASCAKTRP ATLFNRWQED WSALADPCVP RRPLDALKYV PLFGRVDSYL SLGAGLRERL EVNDAPLFGL GRARGDTYVL QRVQVHADLR IAGHVQAFVQ LEDARPFGKD NVGPVDRNRV DLRQAFVTYV DAIGSGAFKA RVGRQEMAFD LQRFVSVRDG PNVRQAFDGI WADWEQGPWR LIGYATQPVQ YRDDGAFDDV SNRNLTFSGV RIERQRVGPG DLSAYYSRYN RTQAQFPDGA GGEHRDVFDV RYAGKRRNVD WDIEGMYQTG RVGAQRIEAW AVGSLAGYTF AGVGWMPRIG LQVDAASGDR RPRDGRIETF NPLFPNGYYF ALAGYTGYTN LIHVKPSLTL KPSSALTLLA AVGLQWRATT ADAVYAQGAT PVPGTAGRGG NWTGFYTQLR ADWAVTANLA AALEVVHFQI GDALRAAGGR NADYVGAELK FGW
|
| |