Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3094 |
Symbol | |
ID | 4885642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3036010 |
End bp | 3037851 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640129022 |
Product | glycosyl transferase, group 2 family protein |
Protein accession | YP_001060106 |
Protein GI | 126438818 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTTTG ATATATCCGT CGTGGTGTAT CGAGAGACGG AGGAAACCCT CGACGGTCTT CTCGACAGCC TCGCCGCTCA AGCATCATGC CCGGATACGG TGGTTCGTGT CTGGTTGCGC AACAACGATC CCGCCGATGC CGATCGGTGG GATCGGTTCG TGCACGATCG GTCATGGTAT CCATTCGAGA TTTCGATCTC GCATTCTCCG CAGAACGTGG GCTTCGGGCG CGCGCACAAT GCGACGTTCG AAATGGCCGA CGCTCCGTTT TTCTTTGTCT TGAATCCGGA TACGCGACTG CATTCGACTG CCGTCTCGGC ATTGCGGAAG GCGATAGACA CGTCTGCCGG CGACGTGGGC GCCTGGGAGT TGCGGCAACT GCCGTACGAG CATCCGAAAC TGTACGATCC CGTTTCGCTG AGCACGGACT GGGTGACAGG TGCCGCCGTC GTGTTCCGGC GTGCCGCGTT TGCGCAGGTG CGCGGTTTCG AGCCGCGTAT CTTCATGTAT GGGGAAGATG TCGACCTATC GTGGCGAATG CGCGCCGCGG GCTGGGTATT GCATTATGTG CCGCATGCTG TCGTCGTACA CCCCACGTAT TCGAAACCGA TGGAGGCCAA ACCGCTCCAG ATCGCAGGCG GCGTGGTCGC ATCGTTGCAA TTGCGTACGA GGTTCGGGTC CTGGCTCGAT ATCGCTCGCG GTCTAGGTTG CTGGGTTGCC GAACTCGCGA GGCCGGCGCG CTTCCCGCAT GCGCGGCGCA CGCATCTGAT GGCACTGGCT CGATATCTTC GGAGCGCAGC CTACTTCAGG CGCACGGGGG CACGGTATCG CAAAGGCGGT TTTCGCCCCG GTTTTCGTTT TTGGGGATAT GGTGACCGAC GCGACGGCGC ATTTTTTGCC TTCGCGGTCG AGGAACTCGA CGCGCGCACG GTGCCGCTTG TCTCGATCAT CGTGCGAACG CATCGCCGAC CGGCATTGTT GCGGGAGGCG CTGATGTCGC TGTCGCATCA AACCTATCCG CGTGTCGAAG TCATTGTCGT CGAGGACGGC GAGCCGAATA GCCGCGCGAT GATCGAACGC GAATTTGCAG GGCGCCTTGA TATCCGCTAT GAGGCGACGG GCATGCCGGT AGGCAGGAGT GCCGCCGGGA ATCTGGGCCT TTCGCTCGCC GCCGGGGAAT GGCTGGGATT TCTTGACGAC GACGACCAGT TCTACGCAGA CCACGTCGAG GCAATGATGC AGGTCGCGCG AAGCGGTACG AACCGGGCAG TCTATGGTGC GTCGCACGAG ATTCCGACCG AATTTGCACA ATTGACGGAC GAGGCCGCGA CATATCGCGA AGAGCCCGCG TCGCTCAAGT ATCGGCCATA TTCCCGCCTG GCGATGTGGC AGGAGAACCT TGCGCCGATT CAAGCAGTCC TCTTCCATCG AAGCTTGTAT GACGAATTGG GCGGATTTGA CGAGGACCTC GATCAACTCG AAGACTGGGT GCTGTGGGTG CGCTATTCGT GTGCGACTGA CTTCTCTTCG TTCCTGCGGG TGACATCACG CTACCGGGTG CCCATGGCGG CCAAGGTTGC CGTTGAGCGT CAGGCCAAGC TGCATGAAGC CTATGCCGTC GCCCTGGAGC GACAGCGAGC GATGCGAGTG ACGCTTAGCC CGTTCGACGT TGTCGCCATG GCGGAAGAGC AGGCCCGTCG GCATGCTATC GTCCACGTTT CGAGGCAAAC CGCGCGAAAG CTGATCGTGC GAGTGCCGTT CATGCGAACG TTGTTATCGA GCCAGGCGGG ATGGCGACGG CGCATGAGAG CGCTATATCG TCGAATGTCG CCGCGCTCCT GA
|
Protein sequence | MRFDISVVVY RETEETLDGL LDSLAAQASC PDTVVRVWLR NNDPADADRW DRFVHDRSWY PFEISISHSP QNVGFGRAHN ATFEMADAPF FFVLNPDTRL HSTAVSALRK AIDTSAGDVG AWELRQLPYE HPKLYDPVSL STDWVTGAAV VFRRAAFAQV RGFEPRIFMY GEDVDLSWRM RAAGWVLHYV PHAVVVHPTY SKPMEAKPLQ IAGGVVASLQ LRTRFGSWLD IARGLGCWVA ELARPARFPH ARRTHLMALA RYLRSAAYFR RTGARYRKGG FRPGFRFWGY GDRRDGAFFA FAVEELDART VPLVSIIVRT HRRPALLREA LMSLSHQTYP RVEVIVVEDG EPNSRAMIER EFAGRLDIRY EATGMPVGRS AAGNLGLSLA AGEWLGFLDD DDQFYADHVE AMMQVARSGT NRAVYGASHE IPTEFAQLTD EAATYREEPA SLKYRPYSRL AMWQENLAPI QAVLFHRSLY DELGGFDEDL DQLEDWVLWV RYSCATDFSS FLRVTSRYRV PMAAKVAVER QAKLHEAYAV ALERQRAMRV TLSPFDVVAM AEEQARRHAI VHVSRQTARK LIVRVPFMRT LLSSQAGWRR RMRALYRRMS PRS
|
| |