Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0996 |
Symbol | |
ID | 4905443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 966223 |
End bp | 968196 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640144102 |
Product | glycosyl transferase, group 2 family protein |
Protein accession | YP_001075032 |
Protein GI | 126457212 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.231155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATGT TCGACTTCTT TCTCGCCAAC CGCTCGCTCC TCGACATCAA CGGCGGCGTG CTGCTCGTCG TGCTGCTGCT CACGCGCGCC GGCAGGCGCG AACGCGCGGC CGACCGGATC GTGTTCGGCG GCGTGACTGT CGTGCTGCTG CTCGTCTATC TGGCGTGGCG CGCGACGCAG ACGCTGCCCG AGCCGCGCAT GGATTTTCCG AGCGCATGGG CGCACGTGTT CTTCGGTTTC GAATGCATGG CGCTCGCGTA TACGCTGATC TCCGTCGCCG TGCTCACGCG CACGACCGAG CGCACGCCGC AAGCCGACGC CGGCGAGGCC GCGCTGCGGC GCGCGGCCGC GCCGCCGCGC GTCGACATCT TCATCGCGAC CTACAACGAG GGGCTCGACA TCCTGGAGAA GACGATCGTC GCCGCGCTCG CGATCGACTA TCCGGATTTT CGCGTGTGGG TGCTCGACGA CACGCGCCGC GACTGGCTCA AGGCGTTCTG CGACCAGGTG GGCGCGCGCT ACGCGACGCG CGCGGACAAC GCGCACGCGA AGGCCGGCAA TCTCAACAAC GGCCTGCGCC TGAGCGCGCG AAGCGAAGGC GGCGGCGCGC CGTACATCAT GGTGCTCGAC GCCGATTTCG CGCCGCACCG CAAGATCCTG CTGCGCACGG TCGGGCTCTT CGCCGATCGC ACGGTCGGCA TCGTGCAGAC CCCGCAGTTC TACTACAACG CCGATCCCGT GCAGTACAAC CTGCGCTCGG CCGAATGCTG GGTCGACGAG CAGCGCGCGT TCTTCGACGT GATGCAGCCC GCGAAGGACG CATGGAGCGC GGCGTTCTGC ATCGGCACGT CGTTCGTCGT GCGACGCGAT CTCGTCGAGC GCATCGGCGG CTTTCCGACC GGCACGGTCA CCGAGGACAT CCACCTCACG TATCGGCTGA TGCGGCACGG CTACGTGACG CGCTGGCTCA ACGAGCGGCT GTCGGTCGGC CTGTCGGCCG AGGGGCTGCC CGAATACATC AGCCAGCGCA GCCGCTGGGC GCTCGGCACG ATCCAGGTCG CGCTGACGCC GGACGGCCCG CTGCGCGGCC GCGGCTACAC GTTCGCGCAG CGGCTGCACT ACGTGCACGG CATGCTGCAC TGGCTGAGCC GCCCGTTCAC GCTGATGCTG CTGCTCGGCC CGCTCCTGTA CTGGTACTTC GACATCCCGA CGCTCTACGG CGAGCCGCTG CAGTTTCTCG CCTACGGCCT GCCCGCGCTG ATCGCCTACT GGGGCTACAG CATGTGGATC ACCGGCCGGC GCGCGCTGCC CGTGTTCACC GAAGTCACCC AGATCGTCTG CGCGCTCGCC GTCAGCCTGT CGCTCGCGAG CGCGCTCGTG CGGCCGTTCG GTCGCCCGTT CAAGGTGACG AACAAGGGCC TCGACCGCTC GAAGCTGATC GTCCACGGCA GGTTCGCCGC GTTCTACGCG GGCCTGCTCG TGCTGTCCGC GCTCGGGCTC GGTCGCGCGC TCGCGAGCGC GCCCGGCGCG CCGGGGCTCG CGTTCAACGC CGCGTGGACC GCGATTTCGC TCGCGCTCTA TCTCGCGTCG CTGCTCGTGT GCATCGAGCT GCCGCGGCCG CGCAGGGAGG AGCGCTTCCC GTATCGCGCA CGGGCGCGGC TGCGCGTCGG CGATCGCGAG TACGCGGTCG CGACGCGCGA TCTGTCGTGC AACGGCGCCG CCGTGACGAC GGCGCAGGCC GCGTCGCTGC CGCTGCATGC GGCGGGCGCG CTGTGGCTCG CGCCGATCGG CTGGATCCCG TGCCGCATCG TGCGTCGCGA CGGCGCGCTG CTCGGCGTCG CGCTCGGCGC GGACACCGCC GCGCGGCACG GCCTGATCCG GCTGTTGTTC GGCGATCCGC CTCACAACAT CGCGATTCGC GGCCGACCCC GGCTCGCGAT CTCGCGCCTG ATGCAGCGCG CGCTGTCCGG CTGA
|
Protein sequence | MTMFDFFLAN RSLLDINGGV LLVVLLLTRA GRRERAADRI VFGGVTVVLL LVYLAWRATQ TLPEPRMDFP SAWAHVFFGF ECMALAYTLI SVAVLTRTTE RTPQADAGEA ALRRAAAPPR VDIFIATYNE GLDILEKTIV AALAIDYPDF RVWVLDDTRR DWLKAFCDQV GARYATRADN AHAKAGNLNN GLRLSARSEG GGAPYIMVLD ADFAPHRKIL LRTVGLFADR TVGIVQTPQF YYNADPVQYN LRSAECWVDE QRAFFDVMQP AKDAWSAAFC IGTSFVVRRD LVERIGGFPT GTVTEDIHLT YRLMRHGYVT RWLNERLSVG LSAEGLPEYI SQRSRWALGT IQVALTPDGP LRGRGYTFAQ RLHYVHGMLH WLSRPFTLML LLGPLLYWYF DIPTLYGEPL QFLAYGLPAL IAYWGYSMWI TGRRALPVFT EVTQIVCALA VSLSLASALV RPFGRPFKVT NKGLDRSKLI VHGRFAAFYA GLLVLSALGL GRALASAPGA PGLAFNAAWT AISLALYLAS LLVCIELPRP RREERFPYRA RARLRVGDRE YAVATRDLSC NGAAVTTAQA ASLPLHAAGA LWLAPIGWIP CRIVRRDGAL LGVALGADTA ARHGLIRLLF GDPPHNIAIR GRPRLAISRL MQRALSG
|
| |