Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2106 |
Symbol | |
ID | 4901868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 2097954 |
End bp | 2099003 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640135336 |
Product | hypothetical protein |
Protein accession | YP_001066371 |
Protein GI | 126453392 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.588734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCG TTGACGAAGA CGACATCGGC ACGGCGAGCG GCCGCGACGA AGGCGACTGG GTGCCCAACC GGTTTTGCTT GCGCAACGCC TGGTTTCCCC TCGCGCATAC GTTCGAAATC GGCGAGCGCG CGTCGCGCTG GCAGATCTAC TCGCAGCCGT GCTATCTGTG GCGCGCACGC GGGCGCATCC ATGCATCGCG CCGGCATCCG GACCTGCCCG CCGCCCCCGC CACGCCCGCC ATGCCCGCCG CGCCGGACTC GCCGTTCGAG CCGCCCGAGC GCTATCCGGT GGTCGAGCGA TTCGGCTACG TATGGATCTG GTACGGCGAC CCGGAGCACG CGAGCGACGC GCTCGTGCCC GACGTGCCGT TCCTGCCGCG CGAAGGGGGG CTGCCCGAGC GCATGCAGGG CAACATCCGG CTCGACTGCT GCACGCCGCT GCTCGTCGAG AACCTGCTCG ACCTGACGCA CGCGGACTAT CTGCACGCGA ACCTGCTCGG CGACGAGCAA TCCGAAGAGG ATCGCGTCGA CGTGCGGTTC ACCTCCGAGA CGGTGACGAT GATCCGGCAG TGCACGAACA AATCGATCGC GCCGATCATG CGCTGGTTCG GCGGCGTGCG CGCGAAGTAT CAGGACGTTC ACGTCGTGAT CCACGTGCAT GTGCGCAGCT CCGTCGCGGT CGCGTACGGA CGCTACATGC CGGGCATCGA TCTGCCGATC TTCCACCCGT GCGTGCCGGA ATCGCGCGAC CGGTGCCGGC TCAGCTTCGC GTTGAACATG ACGCGAACGC CGTGGCTGCT GCGCGCGCTG ATGCCGCTCA CGCCTTACAT CGTGCTGCCG CAGGACAATC GCATGATCGG CCCGCAAAGC ACCCGCTACC GGGATGCCGG CGAGCGCCGC GATCTGTATT CGCGCTTCGA CCGCGCGGGG CTGCGGTATC GGCTCCTGCT GCAGCAGCTC GCCCGGCGGC AGCGCGACGG CGATTTCTCG TACGCCCCCG ATGCGCTGCC CGGCCAGGAC GCGCGCGGCA TTCTCGGCAT GCCGGACTAG
|
Protein sequence | MATVDEDDIG TASGRDEGDW VPNRFCLRNA WFPLAHTFEI GERASRWQIY SQPCYLWRAR GRIHASRRHP DLPAAPATPA MPAAPDSPFE PPERYPVVER FGYVWIWYGD PEHASDALVP DVPFLPREGG LPERMQGNIR LDCCTPLLVE NLLDLTHADY LHANLLGDEQ SEEDRVDVRF TSETVTMIRQ CTNKSIAPIM RWFGGVRAKY QDVHVVIHVH VRSSVAVAYG RYMPGIDLPI FHPCVPESRD RCRLSFALNM TRTPWLLRAL MPLTPYIVLP QDNRMIGPQS TRYRDAGERR DLYSRFDRAG LRYRLLLQQL ARRQRDGDFS YAPDALPGQD ARGILGMPD
|
| |