Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2144 |
Symbol | |
ID | 4905293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2097954 |
End bp | 2102678 |
Gene Length | 4725 bp |
Protein Length | 1574 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640145249 |
Product | putative cellulose synthase operon protein C |
Protein accession | YP_001076177 |
Protein GI | 126457527 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGACG CCCGCGCGTT GACGGCGGGG CCGCAGACGA GGCGGCGCGC GCGGCGCGGC GCGGGCTGCG CCGGGATCGC GGCCGTCGCG GCGATCGCGT TCGCCGTGCC ATGCTTCGCG TCCGCCGCGG CGGACGACGT TGCTGCGCGG TGGGCGGCAT CGGAACGATC GCTCGCCGAT GCGCACGCTT GCGCCGGTGT GCGCTCGGCC GCGCCCGGAT GGGCGGTTTC GATGGAGGCG GCGGCGCCGC CGCCGCGGTT CGTGCCGCCG GGCCCGAATA CGTCGCCGGG CGCGATGCAG GCGAATGTCG AAGCGATGCC GCCGGCCGCG CCGGCGCAGG CCGCCGCGAT GACGCGAGCG TCGTCGGCGC GGCTTGAGCC GGCTCCGAGC GCGTCTGTCG ATGCGTCTTC GGCGAATGTC GAGGCGATGC CGCCGGCCGC GCCGGCGCAG GCCGCCGCGA TGACGCGAGC GTCGTCGGCG CGGCTTGAAT CGGCTCCGAG CGCGTCTGTC GATGCGTCTT CGGCGAATGT CGAAGCGATG CCGCCGGCCG TGCCGGCGCG GGCCGCCGCG ATGACGCGAG CGTCGTCGGC GCGGCTTGAG CCGGCTCCGA GCGCGTCTGT CGATGCGTCT TCGGCGAATG TCGAAGCGAT GCCGCCGGCC GCGCCGGCGC GGGCCGCCGC GATGGCGCAG GCGTCGCCTG TTCGGCCTGC GTCGGTGCCG CGCGCGTCGT TCGATGCGCC GTCCGCGCAG GCAGGCTCGA TGCCGCCGGA CACGGCGCAT CGTCACGCGT CGGCGGCCTC CTCGCCCGCG CGCACGACCG CGCCGGCCGG CTTCGCGTCG GCCGCGCTTC GCGCGTTCGC GCCGACGTAT CCGATGTCGC TCATGTCGCC CACGTCGCCG AAGCAAACGC CGCCGGTCCG CTTCGTGCCC ACGTCGCCGC GTGGCGCGCC GGCGACGCCG CAGCCCGTCG CGACGCCGCA CGCGCCCGAC GCTGATGCCG TGCGCGCGCT GTACGCGACG GCGCGCATGT GGGCGAACAA GCATCGCGAC GACCTCGCGC GCGACGCGCT GCGCAAGGCG CTGCTGATCG CGCCGCGCGA TCCGGCGCTG CTCGCCGAGC ACACGCGGAT CCTGCTGCGC GTCGGCGACG CGAAGGGCGC GCGCGCGTCG CTCGAACGCC TGAAGCAGGC GGCGCCCGGC GCGCTCGCGA CGCGCCAGGT CGACGACGAA TACCGCGTCG CGACGAGCGG CCGCGAGGAG ATGGCGCAGA TTCGGCTGCT CGCGCGCAGC GGGCGCGGTG ACGAAGCGGC CCGCCGCATC GTCGCGCTGT TCCCGCACGG CGCGCCGGCG GGCTCGCTCG GCGCCGAGTA TTACCAGATC GTCGCGAGCG CGCCCGACGG CCGCGCGCGC GCGATCGACG CGCTGCGCCG GGCCGTCGCC GCCGATCCGG CGGACGCCGA CGCGGCGACG GTGCTCGCGA AGCTGCTGAA CCAGCGCGAC GACACGCGCG CGCAAGCGAA CCGGCTCGCG TGGGCGCTCG TCGCGCGGCC GGATACGGAC CGGCGCGCCT CGCTCGCGCT GTGGCGCAGC GTGCTGCAGT CGGCCGGGCG CGACCTCGCG TATCTCGACG CATTGCGCGC CTATCTGACG TTCGTACCCG AGGACGACGA GTTTCGCGGC AACGCCGCCG CGCTCGAGCA GCAGCGCGAC GCGCGGCTTC GCCTCGCGCG CGATCCGGAC TACATCGCGC AGCAGCGCGG CCTGCAGGCG CTTGCGCGCG GCGATCCGGC GGCCGCCGAG CCGCTCCTCG CGCGTGCCGC GGCCGCGCGC GCGGGCGATC CGGACGCGCT CGGCGGCCTT GGGCTGCTGC GGCTGCGGCA GGGCCGCCAC GACGAAGCGC GCGCGCTGTT CGTGCGCGCG GCCGCGCTCG CGCCGGACAA TCGCGCGAAG TGGGTGGGGC TCGCGGGCAC CGCGCGCTTC TGGGGCACGC TTGCGCAGGG CCGCGCGGCC GCCGCGCAAG GGCGCCCGGA CGACGCGCAG CGCGCCGCGC GCGCGGCGCT CGCGCTCGAT CCGCAGAGCG CCGACGCGAA GCTGCTGCTC GCCGATTCGC TGCTCGCACG GCGCGACTGG CGCGCCGCGC AGCCGCTGCT GCGCGCGCTG CTCGACGCGC GCGCGCCGAG CATGTCTGCC GTGCGCAGCA TGCGCACGCT ATACGAGCAA ACCGGCCGCG CCGATGCGTT CGGGCCGCTC GTCGACGCGC TGCAAAGCCG CTTCGCCGCG CCCGACGATC GCGCGGCGCT TTCGCGGCTG CGCGCCGAGT GGCTCGCGCA GCAGGCCGAC GCGCTCGCGG CGGCGGGCCA GCGCGGGCCG GCCGCGCAGC GCTACGAGGC GTCGCTGCGC ATCGCGCCCG ATGCGCCGTG GGCGCGCTTC GCGCTCGCGC GGCTGTATCG CGACATGGGC CTGCCGCAGC TCGGCCGCGC GGTGATGGAC GAGGGGCTCG CCGGGTCGGA TGCGGCCGAC ATGCGCTACG CGGCCGCGCT CTATCGATAC GCGCTCGACG ACGTCGCGGG GGCGCGGGCT GCGTTCGCGG CGATCGCGGA CGCGAGCCGC ACGCAAGGCA TGCGCGCGTT CGCACGCAAG CTCGATGCCG AGGCGGCGCT TGCCGATGCG CGGGCCGCGC TCGCGCGCGA GGACCGCGCG GCGGCTCATG CCGCGCTCGA GCGCGCGCGC CGCGCCGCGC CCGACGATCC GGACATGCTC GCCGCGGTCG GGGCGCAGTG GATCGATATC GGCGAAATCG AGCGCGGTCT CGCGCCGCTG CGCGACTGGA TCGTCGCGCA TCCGCGCGAG GCCGACGCCG ACGTGCGGCT GCGCTACGGC GACCTGCTCG GCGGCGCGCG GCGCGACGAC GCACTGGCCG CCTGGCTCGA CACGCTGCGC CGCGGCGGCG CGCCGCTCGA CGCCGCGCAA AGCGCGCGCC TCGAGGATCA GGCGCTGCGG CTCGTGCTGC GCGAAACCGA TGACGCGCTC GATCGCGACG ATTACGAGGC GGCCGCGCGC GCGCTCGATC GCGCGAGCCC GGCGGGCAAG GCCGATCGCC GCTATGCGCT CGAGGCGGCC GAGCTCGCGC GCGCGCAGGG GCGCTACGAC ACCGCGCGCG CGGCGCTCGC GCCGCTGCTC GCGCGCGCGC CCGACGACGC CGACGCGCAG CTCGCGCTCG CGCGCATTCT CGACGACGAC GGCGCGCCCG CGGATGCGCT CGCGCTCGTG CGCGACGTGC TCGCGCGCAC GCCGCCCGAC GATGTCGACA CGCAGCTGTC CGCGCTGCGC CGGCTGACCG CGCTGCGCCG CGCGCGGGAT GCGGCCGCGC TCGCCGACAC GCTGCGCGCC GCCTATCCGG CGCGCGCGGA CGTGACGGTC GCGGCCGGGC GGGTCGCGCA GGCGCTCGGC CGCTATGACG ACGCGGCGTC GCTGTACCGG CTGTCGCTCG CGCAGGAACG CGCGACGGGC ATCGCGCCGC GCCGCGACGG CGCGACGCCC GCGCAGGCCG CATGGGCCGA CTTGCAGCAG CGCCGCGATC CGCAGATCGA AGCGGGGTGG CTGCCCGCGT ACAAGTCCGG CGACGAGGGC GTGTCCGCGT ATCGCGCGCA TCAGATGCCG GTTTATCTGC AGATGCCGTA TCGCTACGAC GGTCACGCGT TCGTTCATCT CGACGCGGTG CGGCTCGACG CGGGCACGCT CGATACGAGC GATCCGCGCG CGTATGCGTT CGACACGTTC GCGACGCATC CGGCGCTCGC CGACGCAGCC GCGCCGGGCG GCGCGCTGCG CCAGCGCGCG GCGGGCATCG GCGGCGGCAT CGGCTATCGC GACGACGCGT GGCGCGTCGA CGTCGGCACG ACGCCGCTCG GCTTTCCGGT GCATTACGTC GTCGGCGGCG TGCGCTACCG GTTCGGCGCG GGCCCGGCGA GCGTCACCGT CAGCGCGGCG CGGCGGCCGG AGACGGGCAG CATGCTGTCG TACGCGGGGC TGCGCGATCC GTGGACGGGC GCGACCTGGG GCGGCGTGCG GCGCGATAGC GTCGGCGTGC GCGCATCCGT CGACATCGGC CGCGTGAATC TGTTCGCCGA TCTCGCCGCC GCGCGGCTGA CCGGGCGCAA CGTCGCCGAG AACGCGGCCG TCACGCTGCG CACGGGCTTC ATGGCGCCCG TCTATCGGCG CGCGGACATG CGCGTGAGCG CGGGGCTCGT CGGCAACGCG TGGCACTACG CGCAGAACCT GCGCTACTAC ACGTACGGGC AGGGCGGCTA CTACAGCCCG CAGCGCTACC TGTCGATCGG CATGCCGCTC GAATGGGCGG GGCGGCGCGG CGCGTTCACG TGGGACGTGA CCGCGACGGT CGGCGTGTCG AATTCGTACG AGCGCGATTC GCCGTATTTT CCGAATGGGC TGCCGGGCTC GACGCTCGTC AAGTCCGCGC CGGCGCTCGG CAACCCGGTG TTCTCGCGCG GCTCGACGCG CGGCGTGTCG TTCTGGTACG GGTTCGCGGG CGTCGCCGAG TATCGCGTGA ACGGGCGGCT CGCCGTGGGC GCGCGATTCG ATATCGACCA CGCGCACGAC TACGCGCCGA GCGCCGGGCT GCTGTACGTG CGCTATGCGT TCGACGCGCG CAAGGACAGC GGCGGCTTCT CGCCGTCGCC GGTCCGACTC TATTCGAGCT ACTGA
|
Protein sequence | MLDARALTAG PQTRRRARRG AGCAGIAAVA AIAFAVPCFA SAAADDVAAR WAASERSLAD AHACAGVRSA APGWAVSMEA AAPPPRFVPP GPNTSPGAMQ ANVEAMPPAA PAQAAAMTRA SSARLEPAPS ASVDASSANV EAMPPAAPAQ AAAMTRASSA RLESAPSASV DASSANVEAM PPAVPARAAA MTRASSARLE PAPSASVDAS SANVEAMPPA APARAAAMAQ ASPVRPASVP RASFDAPSAQ AGSMPPDTAH RHASAASSPA RTTAPAGFAS AALRAFAPTY PMSLMSPTSP KQTPPVRFVP TSPRGAPATP QPVATPHAPD ADAVRALYAT ARMWANKHRD DLARDALRKA LLIAPRDPAL LAEHTRILLR VGDAKGARAS LERLKQAAPG ALATRQVDDE YRVATSGREE MAQIRLLARS GRGDEAARRI VALFPHGAPA GSLGAEYYQI VASAPDGRAR AIDALRRAVA ADPADADAAT VLAKLLNQRD DTRAQANRLA WALVARPDTD RRASLALWRS VLQSAGRDLA YLDALRAYLT FVPEDDEFRG NAAALEQQRD ARLRLARDPD YIAQQRGLQA LARGDPAAAE PLLARAAAAR AGDPDALGGL GLLRLRQGRH DEARALFVRA AALAPDNRAK WVGLAGTARF WGTLAQGRAA AAQGRPDDAQ RAARAALALD PQSADAKLLL ADSLLARRDW RAAQPLLRAL LDARAPSMSA VRSMRTLYEQ TGRADAFGPL VDALQSRFAA PDDRAALSRL RAEWLAQQAD ALAAAGQRGP AAQRYEASLR IAPDAPWARF ALARLYRDMG LPQLGRAVMD EGLAGSDAAD MRYAAALYRY ALDDVAGARA AFAAIADASR TQGMRAFARK LDAEAALADA RAALAREDRA AAHAALERAR RAAPDDPDML AAVGAQWIDI GEIERGLAPL RDWIVAHPRE ADADVRLRYG DLLGGARRDD ALAAWLDTLR RGGAPLDAAQ SARLEDQALR LVLRETDDAL DRDDYEAAAR ALDRASPAGK ADRRYALEAA ELARAQGRYD TARAALAPLL ARAPDDADAQ LALARILDDD GAPADALALV RDVLARTPPD DVDTQLSALR RLTALRRARD AAALADTLRA AYPARADVTV AAGRVAQALG RYDDAASLYR LSLAQERATG IAPRRDGATP AQAAWADLQQ RRDPQIEAGW LPAYKSGDEG VSAYRAHQMP VYLQMPYRYD GHAFVHLDAV RLDAGTLDTS DPRAYAFDTF ATHPALADAA APGGALRQRA AGIGGGIGYR DDAWRVDVGT TPLGFPVHYV VGGVRYRFGA GPASVTVSAA RRPETGSMLS YAGLRDPWTG ATWGGVRRDS VGVRASVDIG RVNLFADLAA ARLTGRNVAE NAAVTLRTGF MAPVYRRADM RVSAGLVGNA WHYAQNLRYY TYGQGGYYSP QRYLSIGMPL EWAGRRGAFT WDVTATVGVS NSYERDSPYF PNGLPGSTLV KSAPALGNPV FSRGSTRGVS FWYGFAGVAE YRVNGRLAVG ARFDIDHAHD YAPSAGLLYV RYAFDARKDS GGFSPSPVRL YSSY
|
| |