Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_2009 |
Symbol | |
ID | 4789678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 2062325 |
End bp | 2067049 |
Gene Length | 4725 bp |
Protein Length | 1574 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | cellulose synthase operon protein C |
Protein accession | YP_001025805 |
Protein GI | 124382561 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.175736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGGCGG GGCCGCAGGC GAGGCGGCGC GCGCGGCGCG GCGCGGGCTG CGCCGGGATC GCGGCCGTCG CGGCGATCGC GTTCGCCGTG CCATGCTTCG CGTCCGCCGC GGCGGACGAC GTTGCTGCGC GGTGGGCGGC ATCGGAACGA TCGCTCGCCG ATGCGCACGC TTGCGCCGGT GTGCGCCCGG CCGCGCCCGG ATGGGCGGTT TCGATGGAGG CGGCGGCGCC GCCGCCGCGG TTCGTGCCGC CGGGCCCGAA TACGTCGCCG GGCGCGATGC AGGCGAATGT CGAAGCGATG TCGCCGGCCG CGCCGGCGCG GGCCGCCGCG ATGACGCGAG CGTCGTCGGC GCGGCTTGAG CCGGCTCCGA GCGCGTCTGT CGATGCGGCT TCGGCGAATG TCGAGGCGAT GCCGCCGGCC GCGCCGGCGC AGGCCGCCGC GATGACGCGA GCGTCGTCGG CGCGGCTTGA ATCGGCTCCG GGTGCGTCTG TCGATGCGGC TTCGGCGAAT GTCGAGGCGA TGCCGCCGGC CGCGCCGGCG CGGGCTGGGG CGATGGCGCG AGCGTCGTCG GCGCGGCTTG AGCCGGCTCC GAGCGCGTCT GTCGATGCGG CTTCGGCGAA TGTCGAAGCG ATGCCGCCGG CCGTGCCGGC GCAGGCCGCC GCGATGGCGC AGGCGTCGCC TGTTCGGCCT GCGTCGGTGC CGCGCGCGTC GTTCGATGCG CCGTCCGCGC AGGCAGGCTC GATGCCGCCG GACACGGCGC ATCGTCACGC GTCGGCGGCC TCGTCGCTCG CGCGCACGAC CGCGCCGGCG TTGCCGTTCG CGCCGGCCGG CTTCGCGTCG GCCGCGCTTC GCGCGTTCGC GCCGACGTAT CCGATGTCGC TCATGTCGCC CATGTCGCCG AAGCAAACGC CGCCGGTCCG CTTCGTGCCC ACGTCGCCGC GTGGCGCGCC GGCGACGCCG CAGCCCGTCG CGACGCCGCA CGCGCCCGAC GCTGATGCCG TGCGCGCGCT GTACGCGACG GCGCGCATGT GGGCGAACAA GCATCGCGAC GACCTCGCGC GCGACGCGCT GCGCAAGGCG CTGCTGATCG CGCCGCGCGA TCCGGCGCTG CTCGCCGAGC ACACGCGGAT CCTGCTGCGC GTCGGCGACG CGAAGGGCGC GCGCGCGTCG CTCGAACGCC TGAAGCAGGC GGCGCCCGGC GCGCTCGCGA CGCGCCAGGT CGACGACGAA TACCGCGTCG CGACGAGCGG CCGCGAGGAG ATGGCGCAGA TTCGGCTGCT CGCGCGCAGC GGGCGCGGTG ACGAAGCGGC CCGCCGCATC GTCGCGCTGT TCCCGCACGG CGCGCCGGCG GGCTCGCTCG GCGCCGAGTA TTACCAGATC GTCGCGAGCG CGCCCGATGG CCGCGCGCGC GCGATCGACG CGCTGCGCCG GGCCGTCGCC GCCGATCCGG CGGACGCCGA CGCGGCGACG GTGCTCGCGA AGCTGCTGAA CCAGCGCGAC GACACGCGCG CGCAAGCGAA CCGGCTCGCG TGGGCGCTCG TCGCGCGGCC GGATACGGAC CGGCGCGCCT CGCTCGCGCT GTGGCGCAGC GTGCTGCAGT CGGCCGGGCG CGACCTCGCG TATCTCGACG CATTGCGCGC CTATCTGACG TTCGCACCCG AGGACGACGA GTTTCGCGGC AACGCCGCCG CGCTCGAGCA GCAGCGCGAC GCGCGGCTTC GCCTCGCGCG CGATCCGGAC TACATCCCGC AGCAGCGCGG CCTGCAGGCG CTTGCGCGCG GCGATCCGGC GGCCGCCGAG CCGCTCCTCG CGCGTGCCGC GGCCGCGCGC GCGGGCGATC CGGACGCGCT CGGCGGCCTT GGGCTGTTGC GGCTGTGGCA GGGCCGCCAC GACGAAGCGC GCGCGCTGTT CGTGCGCGCG GCCGCGCTCG CGCCGGACAA TCGCGCGAAG TGGGTGGGGC TCGCGGGCAC CGCGCGCTTC TGGGGCACGC TTGCGCAGGG CCGCGCGGCC GCCGCGCAAG GGCGCCCGGA CGACGCGCAG CGCGCCGCGC GCGCGGCGCT CGCGCTCGAT CCGCAGAGCG CCGACGCGAA GCTGCTGCTC GCCGATTCGC TGCTCGCGCG GCGCGACTGG CGCGCCGCGC AGCCGCTGCT GCGCGCGCTG CTCGACGCGC GCGCGCCGAG CGTGTCTGCC GTGCGCAGCA TGCGCACGCT ATACGAGCAA ACCGGCCGCG CCGACGCGTT CGGGCCGCTC GTCGACGCGC TGCAAAGCCG CTTCGCCGCG CCCGACGATC GCGCGGCGCT TTCGCGGCTG CGCGCCGAGT GGCTCGCGCA GCAGGCCGAC GCGCTCGCGG CGGCGGGCCA GCGCGGGCCG GCCGCGCAGC GCTACGAGGC GTCGCTGCGC ATCGCGCCCG ATGCGCCGTG GGCGCGCTTC GCGCTCGCGC GGCTGTATCG CGACATGGGC CTGCCGCAGC TCGGCCGCGC GGTGATGGAC GAGGGGCTCG CCGGGTCGGA TGCGGCCGAC ATGCGCTACG CGGCCGCGCT CTATCGATAC GCGCTCGACG ACGTCGCGGG GGCGCGGGCT GCGTTCGCGG CGATCGCGGA CGCGAGCCGC ACGCAAGGCA TGCGCGCGTT CGCACGCAAG CTCGATGCCG AGGCGGCGCT TGCCGATGCG CGGGCCGCGC TCGCGCGCGA GGACCGCGCG GCGGCTCATG CCGCGCTCGA GCGCGCGCGC CGCGCCGCGC CCGACGATCC GGACATGCTC GCCGCGGTCG GGGCGCAGTG GATCGACATC GGCGAAATCG AGCGCGGTCT CGCGCCGCTG CGCGACTGGA TCGTCGCGCA TCCGCGCGAG GCCGACGCCG ACGTGCGGCT GCGCTACGGC GACCTGCTCG GCGGCGCGCG GCGCGACGAC GCACTGGCCG CCTGGCTCGA CACGCTGCGC CGCGGCGGCG CGCCGCTCGA CGCCGCGCAA AGCGCGCGCC TCGAGGATCA GGCGCTGCGG CTCGTGCTGC GCGAAACCGA TGACGCGCTC GATCGCGACG ATTACGAGGC GGCCGCGCGC GCGCTCGATC GCGCGAGCCC GGCAGGCAGG GCCGATCGCC GCTATGCGCT CGAGGCGGCC GAGCTCGCGC GCGCGCAGGG GCGCTACGAC ACCGCGCGCG CGGCGCTCGC GCCGCTGCTC GCGTGCGCGC CCGACGACGC CGACGCGCAG CTCGCGCTCG CGCGCATTCT CGACGACGAC GGCGCGCCCG CGGATGCGCT CGCGCTCGTG CGCGACGTGC TCGCGCGCAC GCCGCCCGAC GATGTCGACA CGCAGCTGTC CGCGCTGCGC CGGCTGACCG CGCTGCGCCG CGCGCGGGAT GCGGCCGCGC TCGCCGACAC GCTGCGCGCC GCCTATCCGG CGCGCGCGGA CGTGACGGTC GCGGCCGGGC GGGTCGCGCA GGCGCTCGGC CGCTATGACG ACGCGGCGTC GCTGTACTGG CTGTCGCTCG CGCAGGAACG CGCGACGGGC ATCGCGCCGC GCCGCGACGG CGCGACGCCC GCGCAGGCCG CATGGGCCGA CTTGCAGCAG CGCCGCGATC CGCAGATCGA AGCGGGGTGG CTGCCCGCGT ACAAGTCCGG CGACGAGGGC GTGTCCGCGT ATCGCGCGCA TCAGATGCCG GTTTATCTGC AGATGCCGTA TCGCTACGAC GGTCACGCGT TCGTTCATCT CGACGCGGTG CGGCTCGACG CGGGCACGCT CGATACGAGC GATCCGCGCG CGTATGCGTT CGACACGTTC GCGACGCATC CGGCGCTCGC CGACGCAGCC GCGCCGGGCG GCGCGCTGCG CCAGCGCGCG GCGGGCATCG GCGGCGGCAT CGGCTATCGC GACGACGCGT GGCGCGTCGA CGTCGGCACG ACGCCGCTCG GCTTTCCGGT GCATTACGTC GTCGGCGGCG TGCGCTACCG GTTCGGCGCG GGCCCGGCGA GCGTCACCGT CAGCGCGGCG CGGCGGCCGG AGACGGGCAG CATGCTGTCG TACGCGGGGC TGCGCGATCC GTGGACGGGC GCGACCTGGG GCGGCGTGCG GCGCGATAGC GTCGGCGTGC GCGCATCCGT CGACATCGGC CGCGTGAATC TGTTCGCCGA TCTCGCCGCC GCGCGGCTGA CCGGGCGCAA CGTCGCCGAG AACGCGGCCG TCACGCTGCG CACGGGCTTC ATGGCGCCCG TCTATCGGCG CGCGGACATG CGCGTGAGCG CGGGGCTCGT CGGCAACGCG TGGCACTACG CGCAGAACCT GCGCTACTAC ACGTACGGGC AGGGCGGCTA CTACAGCCCG CAGCGCTACC TGTCGATCGG CATGCCGCTC GAATGGGCGG GGCGGCGCGG CGCGTTCACG TGGGACGTGA CCGCGACGGT CGGCGTGTCG AATTCGTACG AGCGCGATTC GCCGTATTTT CCGAATGGGC TGCCGGGCTC GACGCTCGTC AAGTCCGCGC CGGCGCTCGG CAACCCGGTG TTCTCGCGCG GCTCGACGCG CGGCGTGTCG TTCTGGTACG GGTTCGCGGG CGTCGCCGAG TATCGCGTGA ACGGGCGGCT CGCCGTGGGC GCGCGATTCG ATATCGACCA CGCGCACGAC TACGCGCCGA GCGCCGGGCT GCTGTACGTG CGCTATGCGT TCGACGCGCG CAAGGACAGC GGCGGCTTCT CGCCGTCGCC GGTCCGACTC TATTCGAGCT ACTGA
|
Protein sequence | MTAGPQARRR ARRGAGCAGI AAVAAIAFAV PCFASAAADD VAARWAASER SLADAHACAG VRPAAPGWAV SMEAAAPPPR FVPPGPNTSP GAMQANVEAM SPAAPARAAA MTRASSARLE PAPSASVDAA SANVEAMPPA APAQAAAMTR ASSARLESAP GASVDAASAN VEAMPPAAPA RAGAMARASS ARLEPAPSAS VDAASANVEA MPPAVPAQAA AMAQASPVRP ASVPRASFDA PSAQAGSMPP DTAHRHASAA SSLARTTAPA LPFAPAGFAS AALRAFAPTY PMSLMSPMSP KQTPPVRFVP TSPRGAPATP QPVATPHAPD ADAVRALYAT ARMWANKHRD DLARDALRKA LLIAPRDPAL LAEHTRILLR VGDAKGARAS LERLKQAAPG ALATRQVDDE YRVATSGREE MAQIRLLARS GRGDEAARRI VALFPHGAPA GSLGAEYYQI VASAPDGRAR AIDALRRAVA ADPADADAAT VLAKLLNQRD DTRAQANRLA WALVARPDTD RRASLALWRS VLQSAGRDLA YLDALRAYLT FAPEDDEFRG NAAALEQQRD ARLRLARDPD YIPQQRGLQA LARGDPAAAE PLLARAAAAR AGDPDALGGL GLLRLWQGRH DEARALFVRA AALAPDNRAK WVGLAGTARF WGTLAQGRAA AAQGRPDDAQ RAARAALALD PQSADAKLLL ADSLLARRDW RAAQPLLRAL LDARAPSVSA VRSMRTLYEQ TGRADAFGPL VDALQSRFAA PDDRAALSRL RAEWLAQQAD ALAAAGQRGP AAQRYEASLR IAPDAPWARF ALARLYRDMG LPQLGRAVMD EGLAGSDAAD MRYAAALYRY ALDDVAGARA AFAAIADASR TQGMRAFARK LDAEAALADA RAALAREDRA AAHAALERAR RAAPDDPDML AAVGAQWIDI GEIERGLAPL RDWIVAHPRE ADADVRLRYG DLLGGARRDD ALAAWLDTLR RGGAPLDAAQ SARLEDQALR LVLRETDDAL DRDDYEAAAR ALDRASPAGR ADRRYALEAA ELARAQGRYD TARAALAPLL ACAPDDADAQ LALARILDDD GAPADALALV RDVLARTPPD DVDTQLSALR RLTALRRARD AAALADTLRA AYPARADVTV AAGRVAQALG RYDDAASLYW LSLAQERATG IAPRRDGATP AQAAWADLQQ RRDPQIEAGW LPAYKSGDEG VSAYRAHQMP VYLQMPYRYD GHAFVHLDAV RLDAGTLDTS DPRAYAFDTF ATHPALADAA APGGALRQRA AGIGGGIGYR DDAWRVDVGT TPLGFPVHYV VGGVRYRFGA GPASVTVSAA RRPETGSMLS YAGLRDPWTG ATWGGVRRDS VGVRASVDIG RVNLFADLAA ARLTGRNVAE NAAVTLRTGF MAPVYRRADM RVSAGLVGNA WHYAQNLRYY TYGQGGYYSP QRYLSIGMPL EWAGRRGAFT WDVTATVGVS NSYERDSPYF PNGLPGSTLV KSAPALGNPV FSRGSTRGVS FWYGFAGVAE YRVNGRLAVG ARFDIDHAHD YAPSAGLLYV RYAFDARKDS GGFSPSPVRL YSSY
|
| |