Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2051 |
Symbol | |
ID | 4900974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2033402 |
End bp | 2034439 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640135281 |
Product | hypothetical protein |
Protein accession | YP_001066316 |
Protein GI | 126452099 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.350134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTACTTCG AAAGCGGGAA CTCTCCGACC TACAATTCGA ATATCCCTTA TATCGATGTT CAGATTTGCG TACCGGGAAC GAGCCAATGC GTCAGGGTCG ATCACGTGCA AGTCGATAGT GGATCGACCG GACTGCGCAT AATGAAGTCG GCCTTGAATG GATTGACTCT CCCATTGCAG GCCAATACGA GCGGTGCGCA GATCAGCGAG TGCATGGCGT ATGCCACCAG CGAAGTCTGG GGGCCGGTTG CCCGTAGCGA CGTTTACGTG GGCGGAGAAT TCGGCGGTAA CCAGACGCTA CAGGTTATTG ACGATAGCGC TTCGGCGAGC GTCCCCGCCA GTTGCAGCTC ACAGGGGGCA CTGACCGACA CGACGAGCAG CTTTGGCGGC AAGGGAATCA TCGGAATCAA TCAGCTGTTT AGCGATGGCG GCAGCTACTA CGCATGTACC GGCACCGGAT GCTCGCAGCT AGACAGCAGT CCCGTGACGG TCACCAATCC GGTGGCTGGC TTCGCCCAGG ACAACAATGG CGTGTCGATC CAAATTCAGC CGATTTCGGA CCTCGGTGCG ACGTCGGCAT CCGGACAACT GATCTTCGGT ATCAACACGG CAAGCAACAA TCGAATCGCT GGGGCAGCGA TACTGTACAC GGACAAGAAC GGCGATCTCT CGGTCAACAC GGGTACGCGT ACCATGCCGG GCTTTATCGA TAGCGGGACA GCGAGCTATT TCTTTCCGGA CGACGGCAGC ATCCCAATCT GCTCCGATAA CCGGAATTGG TATTGCCCGC CCTCGCCGAT TAGCGTGAAC GTCGACGTCA TCGCCGGCGA CGCGAGCGTG GATCGCCCAT CGTCGTATCG GCTATCAAGC TACGACAGTA TTTCCGCGGA CATGGAGGTC GCGCCGACTG GGGAAAGCGG CTCCGTGTTC AACGATGGCG CTTCTTGGTA TGTGCTCGGC CTGCCGTTCT ATATCAATCG CCGCGTCTAT GGGTCAATTC AGCAGAGCGA CAGCGACCCG ATGTACTACG CGTTTTAG
|
Protein sequence | MYFESGNSPT YNSNIPYIDV QICVPGTSQC VRVDHVQVDS GSTGLRIMKS ALNGLTLPLQ ANTSGAQISE CMAYATSEVW GPVARSDVYV GGEFGGNQTL QVIDDSASAS VPASCSSQGA LTDTTSSFGG KGIIGINQLF SDGGSYYACT GTGCSQLDSS PVTVTNPVAG FAQDNNGVSI QIQPISDLGA TSASGQLIFG INTASNNRIA GAAILYTDKN GDLSVNTGTR TMPGFIDSGT ASYFFPDDGS IPICSDNRNW YCPPSPISVN VDVIAGDASV DRPSSYRLSS YDSISADMEV APTGESGSVF NDGASWYVLG LPFYINRRVY GSIQQSDSDP MYYAF
|
| |