Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2761 |
Symbol | |
ID | 4887777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2631985 |
End bp | 2633175 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132697 |
Product | sulfotransferase domain-containing protein |
Protein accession | YP_001063753 |
Protein GI | 126443187 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCATG CAGCAACCGG CGCGGCCATC GGCACGCCCA TCGGCGCGCC CATCGGCGCC GTTCGTCCCC GCCCCGTGCT GATGATTCCG CTGCGGCGCT GCGGCAGCCA CGCGCTGCGG CTGCGGCTGA ATCTCAATTC GCAGTTCTAT TCGCCATATC CGCTGCACAT CGTCGACTTC ATGCCGCTGT TGCCGCTGTA CGGCGATCTC GCCGACGACC GCGCGTATTT CCGGCTCGTC GCCGACATCG TCGGCCTGCA GGCGGCGAGC ATGGTCAAGT GGCCCGGCGT CGCGTTCGAT CCGGTCGAGA TCTTCGACGC GCTTCGGCAC GCGCCGCGCA GCGCCCATCG CATCGTCTGG GAGCTGCTGC TGCGCGCGGG CGAGCGCGAA GGCGCGCGCG TCGTGATGGA CAAGTCGCTC GACAGCGTGC ACTACGCCGA CGAGCTGATG ACGCTGTATC CGGACATGCT GTTCCTGAAC GTCGTGCGCG ATCCGCGCGC GCAGGTCGCG TCGATGAACC GCGCGATCAT TCACGATTTC GATACGCTGC TCAACGCGCA GGCGTGGGTG GCCGCGCATC GCGCGGCCGA TGTCCTGATC GCGCGCCATC CGCAGCGCGT GCTGACGATT CGCTACGAGG ATTTCCTGTC GGATCAGGCG CACACGTTGC AGCGCGTATG CGCGTTCTTC GGCATCGATT TCCTGCCGCG GATGCTCGAC ATCGCGAATT CGCCGGAGGC GCGGCATATC TCGCGCATGT CCGAGCTGTG GGCGTCGAAC TGTTTCGCGC CGATCGCCGC GAATGCGGAC AAGTTCAAGC AGCAGCTATC GACTGCCGAG ATCGCGACGA TCGAGACGCT CGCGCACGAA TACATGCAAC GCTACGGCTA TCAGCAGATG ACCGACGCGA CCGCGATGCC CGACGCGTTC GCCGCCGCCG CCGCGCGCCG CCGCTCCGAC GCGCGGCGAC GGCACGCATG GCGCGAGCTC GAGCAGTCGA ATTTCCGTGA TTTCGTGCTG CGCCGGCATC GCGCCGACTA TCTCGAGACG GTGCGCGCCC GCTTGCAGCG GCATGCGAGC GCGCAGGCGG ATTCGCGTGC CGATTTGCGT GCCGATTCGC CGGCCGATTC GCCGGCCGGC GCGCCCGGGC GGCGCGATAC GCTGACCGCG GCCTTCGACG TAACCGACTG A
|
Protein sequence | MTHAATGAAI GTPIGAPIGA VRPRPVLMIP LRRCGSHALR LRLNLNSQFY SPYPLHIVDF MPLLPLYGDL ADDRAYFRLV ADIVGLQAAS MVKWPGVAFD PVEIFDALRH APRSAHRIVW ELLLRAGERE GARVVMDKSL DSVHYADELM TLYPDMLFLN VVRDPRAQVA SMNRAIIHDF DTLLNAQAWV AAHRAADVLI ARHPQRVLTI RYEDFLSDQA HTLQRVCAFF GIDFLPRMLD IANSPEARHI SRMSELWASN CFAPIAANAD KFKQQLSTAE IATIETLAHE YMQRYGYQQM TDATAMPDAF AAAAARRRSD ARRRHAWREL EQSNFRDFVL RRHRADYLET VRARLQRHAS AQADSRADLR ADSPADSPAG APGRRDTLTA AFDVTD
|
| |