Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2226 |
Symbol | celA |
ID | 4885938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2153358 |
End bp | 2155898 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640132163 |
Product | cellulose synthase, catalytic subunit (UDP-forming) |
Protein accession | YP_001063220 |
Protein GI | 126443249 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03030] cellulose synthase catalytic subunit (UDP-forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.467064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGC GCGCGGCATG GGGCCGGCGA ATCGGCGGCG CGGCGGCGCG CGTGCGCGGC TGGATCGCGC GCGGTCTCGG CGTGCCCGCC AAGGGTTCGT CGTTCGACTG GCTCGTGCGC GTGTTCTTTC ACGCGCCCGC GCCGGGCAGG CGCGATGTCG TGCGCGACGG GCTGCGCGCG GCGATTCTGT GGGCCGCCGC GCAATGGGGC GTGACGCAGC CGCGCCGCGC GCGCGATTGG CTCTGGCGCG CGTTCGTGCG CGCGCCGCAT GAAAGGCGAT CGCGCGCGCG CGATCCGTTC GCATGGATCG ACGCGACGCT CGTGCCGGCG TTCATGTTCG CGCGCCGCGC GCGGCGGCGC GTCGACGCCT GGCTCGCCCG TCTGCCGTGG CACCGCTGGG GCGCGCGCCT CGAGCACGGC GCGCAGCGGG CCGTCGCACG CCGCTGGCTG TTGCCGGCGA GCGCGGCGGC GGGCGTTGCG CTGTGGCTCG CGGCCGGCAC GTCGCCGCTC GCGCCCGCCG GCCAGTTCGC GTTCTTCGCG ACGCTCGCCG CGCTCGCGCT CGCGCTGCGC CGCGTGCCCG GCCGCTTGCC GACGCTCGCG CTCGCGACGT TCGCGCTGCT CGCGATGGCC CGCTACATCT GGTGGCGCAG CACCGAGACG CTCGATTTGC GCACACCGGT CGAGGCGTGC GTCGGCTATC TGCTGTACGC GGCCGAGGCG TACACGTGGC TCATCCTCGT GCTCGGCTTC GTGCAGACCG CGTGGCCGCT CGAGCGCCCG GTCGCGCGCC TGCCGGACGA TCCCGCCGGC TGGCCGAGCG TCGACGTCTA CATTCCGACG TACAACGAGC CGCTCGCGGT CGTGAAGCCG ACGATCTTCG CCGCGCAAAG CCTCGATTGG CCGGCGGACA AGCTCAACGT CTATCTGCTC GACGACGGCC GCCGCCCGGA GTTCGAGGCG TTCGCGCGCG ACGCGGGCAT CGGCTATCTG ACGCGCGACG ACAATCGCCA CGCGAAGGCC GGCAACATCA ATAGCGCGCT TGCGCGCACG CACGGCGAAT ACATCGCGAT CTTCGACTGC GATCACGTGC CGACGCGCTC GTTCCTGCAG ACGACGATGG GCGCGTTCCT TCGCGATCCG AACTGTGCGC TCGTGCAGAC GCCGCATCAT TTCTTCTCGC CGGACCCGTT CGAGCGCAAC CTCGGCACGT TCCGCCGCGT GCCGAACGAG GGCAGCCTGT TCTATGGCCT CGTGCAGGCG GGCAACGATC TGTGGAACGC CGCGTTCTTC TGCGGTTCGT GCGCGGTGCT CAAGCGCGGC CCGCTCGAAG CAATCGGCGG CGTCGCGATC GAGACCGTCA CCGAGGATGC GCACACCGCG CTCAAGCTGC ACCGCCGCGG CTACACGAGC GCCTATCTGC CGACCGTGCA GGCCGCCGGC CTCGCGACCG AAAGCCTCGC GGGCCACATC CGGCAGCGCG CGCGCTGGGC GCGCGGGATG GCGCAGATCT TCCGGATCGA CAACCCGTTC GTCGGCCGCG GCCTCGGCTT CTTCCAGCGC GTGTGCTACG GCAACGCGAT GCTGCATTTC TTCTACGGCA TCCCGCGGCT CGTGTTCCTG ACGATGCCGA TCGCGTATCT GTTCTTCCAT TTGTATTTCA TCAACGCGTC GGCGCTCGCG CTCGCGAGCT ACGTGCTGCC GTACATGGCG CTCGCGCACA TCGCGAACGC GCGGATGCAG GGGCGCTTTC GGCATTCGTT CTGGGCCGAG GTGTACGAAT CGGTGCTCGC GTGGTACATC GCGCTGCCGA CGACGCTCGC GTTCCTGAGC CCGAGGCACG GCCGCTTCAA CGTGACCGCG AAGGGCGGGC GCATCGACGA AGGCTACGTC GACTGGGCGA CGTCCAGGCC CTATCTCGCG CTGTTCGTGC TGAACGTCGC GGCGATCGCG GCGGGCGGCG TGCGTCTCGC CGCGAGCGGC GGCGACGAAG CGTCGACGAT CCTGATGAAC GTCGCGTGGG CGTTGTACAA CCTCGCGATG CTCGGCGCGG CGCTCGCGGT CGCGCGCGAG GCGAAACAGG TGCGCGTCAC GCACCGGGTC GCGATGCGGG TGCCGGCGAC GCTGTTGCTC GCCGACGGCA CGACGGCCGC GTGCCACACG AAGGATTATT CGGCGGGCGG CTTGGGGCTC GACGCGGTGC CGCTCGCGCG CATCGCGCTC GGCGATACGC TCGACGTATG CGTGAGCCGC GGCGATCGCC CGTTCCATTT TCCGGTGCGC GTGACCCGCG TCGACGCCGC GCATCTCGGC GTGCAGTTCG AGCCATTGAC GCTCGAGCAG GAGCGGCAGC TCGTGCAGTG CACGTTCGGC CGCGCGGATG CGTGGCTCGA CTGGCGCGAC GCGCGCGCCG AGCACGACGA TGCGCCGCTG CGCGGGCTGA AGGAGGTGCT GTCGATGGGG TTCGAAGGCT ACGCACGGAT GCTGCGGGCC GCGACGAGCG CGGTGCGGGC GCGGCGCGCG GCCGACCGGA CGCGTCATTG A
|
Protein sequence | MTARAAWGRR IGGAAARVRG WIARGLGVPA KGSSFDWLVR VFFHAPAPGR RDVVRDGLRA AILWAAAQWG VTQPRRARDW LWRAFVRAPH ERRSRARDPF AWIDATLVPA FMFARRARRR VDAWLARLPW HRWGARLEHG AQRAVARRWL LPASAAAGVA LWLAAGTSPL APAGQFAFFA TLAALALALR RVPGRLPTLA LATFALLAMA RYIWWRSTET LDLRTPVEAC VGYLLYAAEA YTWLILVLGF VQTAWPLERP VARLPDDPAG WPSVDVYIPT YNEPLAVVKP TIFAAQSLDW PADKLNVYLL DDGRRPEFEA FARDAGIGYL TRDDNRHAKA GNINSALART HGEYIAIFDC DHVPTRSFLQ TTMGAFLRDP NCALVQTPHH FFSPDPFERN LGTFRRVPNE GSLFYGLVQA GNDLWNAAFF CGSCAVLKRG PLEAIGGVAI ETVTEDAHTA LKLHRRGYTS AYLPTVQAAG LATESLAGHI RQRARWARGM AQIFRIDNPF VGRGLGFFQR VCYGNAMLHF FYGIPRLVFL TMPIAYLFFH LYFINASALA LASYVLPYMA LAHIANARMQ GRFRHSFWAE VYESVLAWYI ALPTTLAFLS PRHGRFNVTA KGGRIDEGYV DWATSRPYLA LFVLNVAAIA AGGVRLAASG GDEASTILMN VAWALYNLAM LGAALAVARE AKQVRVTHRV AMRVPATLLL ADGTTAACHT KDYSAGGLGL DAVPLARIAL GDTLDVCVSR GDRPFHFPVR VTRVDAAHLG VQFEPLTLEQ ERQLVQCTFG RADAWLDWRD ARAEHDDAPL RGLKEVLSMG FEGYARMLRA ATSAVRARRA ADRTRH
|
| |