Gene BURPS1106A_A2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2144 
Symbol 
ID4905293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2097954 
End bp2102678 
Gene Length4725 bp 
Protein Length1574 aa 
Translation table11 
GC content76% 
IMG OID640145249 
Productputative cellulose synthase operon protein C 
Protein accessionYP_001076177 
Protein GI126457527 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGACG CCCGCGCGTT GACGGCGGGG CCGCAGACGA GGCGGCGCGC GCGGCGCGGC 
GCGGGCTGCG CCGGGATCGC GGCCGTCGCG GCGATCGCGT TCGCCGTGCC ATGCTTCGCG
TCCGCCGCGG CGGACGACGT TGCTGCGCGG TGGGCGGCAT CGGAACGATC GCTCGCCGAT
GCGCACGCTT GCGCCGGTGT GCGCTCGGCC GCGCCCGGAT GGGCGGTTTC GATGGAGGCG
GCGGCGCCGC CGCCGCGGTT CGTGCCGCCG GGCCCGAATA CGTCGCCGGG CGCGATGCAG
GCGAATGTCG AAGCGATGCC GCCGGCCGCG CCGGCGCAGG CCGCCGCGAT GACGCGAGCG
TCGTCGGCGC GGCTTGAGCC GGCTCCGAGC GCGTCTGTCG ATGCGTCTTC GGCGAATGTC
GAGGCGATGC CGCCGGCCGC GCCGGCGCAG GCCGCCGCGA TGACGCGAGC GTCGTCGGCG
CGGCTTGAAT CGGCTCCGAG CGCGTCTGTC GATGCGTCTT CGGCGAATGT CGAAGCGATG
CCGCCGGCCG TGCCGGCGCG GGCCGCCGCG ATGACGCGAG CGTCGTCGGC GCGGCTTGAG
CCGGCTCCGA GCGCGTCTGT CGATGCGTCT TCGGCGAATG TCGAAGCGAT GCCGCCGGCC
GCGCCGGCGC GGGCCGCCGC GATGGCGCAG GCGTCGCCTG TTCGGCCTGC GTCGGTGCCG
CGCGCGTCGT TCGATGCGCC GTCCGCGCAG GCAGGCTCGA TGCCGCCGGA CACGGCGCAT
CGTCACGCGT CGGCGGCCTC CTCGCCCGCG CGCACGACCG CGCCGGCCGG CTTCGCGTCG
GCCGCGCTTC GCGCGTTCGC GCCGACGTAT CCGATGTCGC TCATGTCGCC CACGTCGCCG
AAGCAAACGC CGCCGGTCCG CTTCGTGCCC ACGTCGCCGC GTGGCGCGCC GGCGACGCCG
CAGCCCGTCG CGACGCCGCA CGCGCCCGAC GCTGATGCCG TGCGCGCGCT GTACGCGACG
GCGCGCATGT GGGCGAACAA GCATCGCGAC GACCTCGCGC GCGACGCGCT GCGCAAGGCG
CTGCTGATCG CGCCGCGCGA TCCGGCGCTG CTCGCCGAGC ACACGCGGAT CCTGCTGCGC
GTCGGCGACG CGAAGGGCGC GCGCGCGTCG CTCGAACGCC TGAAGCAGGC GGCGCCCGGC
GCGCTCGCGA CGCGCCAGGT CGACGACGAA TACCGCGTCG CGACGAGCGG CCGCGAGGAG
ATGGCGCAGA TTCGGCTGCT CGCGCGCAGC GGGCGCGGTG ACGAAGCGGC CCGCCGCATC
GTCGCGCTGT TCCCGCACGG CGCGCCGGCG GGCTCGCTCG GCGCCGAGTA TTACCAGATC
GTCGCGAGCG CGCCCGACGG CCGCGCGCGC GCGATCGACG CGCTGCGCCG GGCCGTCGCC
GCCGATCCGG CGGACGCCGA CGCGGCGACG GTGCTCGCGA AGCTGCTGAA CCAGCGCGAC
GACACGCGCG CGCAAGCGAA CCGGCTCGCG TGGGCGCTCG TCGCGCGGCC GGATACGGAC
CGGCGCGCCT CGCTCGCGCT GTGGCGCAGC GTGCTGCAGT CGGCCGGGCG CGACCTCGCG
TATCTCGACG CATTGCGCGC CTATCTGACG TTCGTACCCG AGGACGACGA GTTTCGCGGC
AACGCCGCCG CGCTCGAGCA GCAGCGCGAC GCGCGGCTTC GCCTCGCGCG CGATCCGGAC
TACATCGCGC AGCAGCGCGG CCTGCAGGCG CTTGCGCGCG GCGATCCGGC GGCCGCCGAG
CCGCTCCTCG CGCGTGCCGC GGCCGCGCGC GCGGGCGATC CGGACGCGCT CGGCGGCCTT
GGGCTGCTGC GGCTGCGGCA GGGCCGCCAC GACGAAGCGC GCGCGCTGTT CGTGCGCGCG
GCCGCGCTCG CGCCGGACAA TCGCGCGAAG TGGGTGGGGC TCGCGGGCAC CGCGCGCTTC
TGGGGCACGC TTGCGCAGGG CCGCGCGGCC GCCGCGCAAG GGCGCCCGGA CGACGCGCAG
CGCGCCGCGC GCGCGGCGCT CGCGCTCGAT CCGCAGAGCG CCGACGCGAA GCTGCTGCTC
GCCGATTCGC TGCTCGCACG GCGCGACTGG CGCGCCGCGC AGCCGCTGCT GCGCGCGCTG
CTCGACGCGC GCGCGCCGAG CATGTCTGCC GTGCGCAGCA TGCGCACGCT ATACGAGCAA
ACCGGCCGCG CCGATGCGTT CGGGCCGCTC GTCGACGCGC TGCAAAGCCG CTTCGCCGCG
CCCGACGATC GCGCGGCGCT TTCGCGGCTG CGCGCCGAGT GGCTCGCGCA GCAGGCCGAC
GCGCTCGCGG CGGCGGGCCA GCGCGGGCCG GCCGCGCAGC GCTACGAGGC GTCGCTGCGC
ATCGCGCCCG ATGCGCCGTG GGCGCGCTTC GCGCTCGCGC GGCTGTATCG CGACATGGGC
CTGCCGCAGC TCGGCCGCGC GGTGATGGAC GAGGGGCTCG CCGGGTCGGA TGCGGCCGAC
ATGCGCTACG CGGCCGCGCT CTATCGATAC GCGCTCGACG ACGTCGCGGG GGCGCGGGCT
GCGTTCGCGG CGATCGCGGA CGCGAGCCGC ACGCAAGGCA TGCGCGCGTT CGCACGCAAG
CTCGATGCCG AGGCGGCGCT TGCCGATGCG CGGGCCGCGC TCGCGCGCGA GGACCGCGCG
GCGGCTCATG CCGCGCTCGA GCGCGCGCGC CGCGCCGCGC CCGACGATCC GGACATGCTC
GCCGCGGTCG GGGCGCAGTG GATCGATATC GGCGAAATCG AGCGCGGTCT CGCGCCGCTG
CGCGACTGGA TCGTCGCGCA TCCGCGCGAG GCCGACGCCG ACGTGCGGCT GCGCTACGGC
GACCTGCTCG GCGGCGCGCG GCGCGACGAC GCACTGGCCG CCTGGCTCGA CACGCTGCGC
CGCGGCGGCG CGCCGCTCGA CGCCGCGCAA AGCGCGCGCC TCGAGGATCA GGCGCTGCGG
CTCGTGCTGC GCGAAACCGA TGACGCGCTC GATCGCGACG ATTACGAGGC GGCCGCGCGC
GCGCTCGATC GCGCGAGCCC GGCGGGCAAG GCCGATCGCC GCTATGCGCT CGAGGCGGCC
GAGCTCGCGC GCGCGCAGGG GCGCTACGAC ACCGCGCGCG CGGCGCTCGC GCCGCTGCTC
GCGCGCGCGC CCGACGACGC CGACGCGCAG CTCGCGCTCG CGCGCATTCT CGACGACGAC
GGCGCGCCCG CGGATGCGCT CGCGCTCGTG CGCGACGTGC TCGCGCGCAC GCCGCCCGAC
GATGTCGACA CGCAGCTGTC CGCGCTGCGC CGGCTGACCG CGCTGCGCCG CGCGCGGGAT
GCGGCCGCGC TCGCCGACAC GCTGCGCGCC GCCTATCCGG CGCGCGCGGA CGTGACGGTC
GCGGCCGGGC GGGTCGCGCA GGCGCTCGGC CGCTATGACG ACGCGGCGTC GCTGTACCGG
CTGTCGCTCG CGCAGGAACG CGCGACGGGC ATCGCGCCGC GCCGCGACGG CGCGACGCCC
GCGCAGGCCG CATGGGCCGA CTTGCAGCAG CGCCGCGATC CGCAGATCGA AGCGGGGTGG
CTGCCCGCGT ACAAGTCCGG CGACGAGGGC GTGTCCGCGT ATCGCGCGCA TCAGATGCCG
GTTTATCTGC AGATGCCGTA TCGCTACGAC GGTCACGCGT TCGTTCATCT CGACGCGGTG
CGGCTCGACG CGGGCACGCT CGATACGAGC GATCCGCGCG CGTATGCGTT CGACACGTTC
GCGACGCATC CGGCGCTCGC CGACGCAGCC GCGCCGGGCG GCGCGCTGCG CCAGCGCGCG
GCGGGCATCG GCGGCGGCAT CGGCTATCGC GACGACGCGT GGCGCGTCGA CGTCGGCACG
ACGCCGCTCG GCTTTCCGGT GCATTACGTC GTCGGCGGCG TGCGCTACCG GTTCGGCGCG
GGCCCGGCGA GCGTCACCGT CAGCGCGGCG CGGCGGCCGG AGACGGGCAG CATGCTGTCG
TACGCGGGGC TGCGCGATCC GTGGACGGGC GCGACCTGGG GCGGCGTGCG GCGCGATAGC
GTCGGCGTGC GCGCATCCGT CGACATCGGC CGCGTGAATC TGTTCGCCGA TCTCGCCGCC
GCGCGGCTGA CCGGGCGCAA CGTCGCCGAG AACGCGGCCG TCACGCTGCG CACGGGCTTC
ATGGCGCCCG TCTATCGGCG CGCGGACATG CGCGTGAGCG CGGGGCTCGT CGGCAACGCG
TGGCACTACG CGCAGAACCT GCGCTACTAC ACGTACGGGC AGGGCGGCTA CTACAGCCCG
CAGCGCTACC TGTCGATCGG CATGCCGCTC GAATGGGCGG GGCGGCGCGG CGCGTTCACG
TGGGACGTGA CCGCGACGGT CGGCGTGTCG AATTCGTACG AGCGCGATTC GCCGTATTTT
CCGAATGGGC TGCCGGGCTC GACGCTCGTC AAGTCCGCGC CGGCGCTCGG CAACCCGGTG
TTCTCGCGCG GCTCGACGCG CGGCGTGTCG TTCTGGTACG GGTTCGCGGG CGTCGCCGAG
TATCGCGTGA ACGGGCGGCT CGCCGTGGGC GCGCGATTCG ATATCGACCA CGCGCACGAC
TACGCGCCGA GCGCCGGGCT GCTGTACGTG CGCTATGCGT TCGACGCGCG CAAGGACAGC
GGCGGCTTCT CGCCGTCGCC GGTCCGACTC TATTCGAGCT ACTGA
 
Protein sequence
MLDARALTAG PQTRRRARRG AGCAGIAAVA AIAFAVPCFA SAAADDVAAR WAASERSLAD 
AHACAGVRSA APGWAVSMEA AAPPPRFVPP GPNTSPGAMQ ANVEAMPPAA PAQAAAMTRA
SSARLEPAPS ASVDASSANV EAMPPAAPAQ AAAMTRASSA RLESAPSASV DASSANVEAM
PPAVPARAAA MTRASSARLE PAPSASVDAS SANVEAMPPA APARAAAMAQ ASPVRPASVP
RASFDAPSAQ AGSMPPDTAH RHASAASSPA RTTAPAGFAS AALRAFAPTY PMSLMSPTSP
KQTPPVRFVP TSPRGAPATP QPVATPHAPD ADAVRALYAT ARMWANKHRD DLARDALRKA
LLIAPRDPAL LAEHTRILLR VGDAKGARAS LERLKQAAPG ALATRQVDDE YRVATSGREE
MAQIRLLARS GRGDEAARRI VALFPHGAPA GSLGAEYYQI VASAPDGRAR AIDALRRAVA
ADPADADAAT VLAKLLNQRD DTRAQANRLA WALVARPDTD RRASLALWRS VLQSAGRDLA
YLDALRAYLT FVPEDDEFRG NAAALEQQRD ARLRLARDPD YIAQQRGLQA LARGDPAAAE
PLLARAAAAR AGDPDALGGL GLLRLRQGRH DEARALFVRA AALAPDNRAK WVGLAGTARF
WGTLAQGRAA AAQGRPDDAQ RAARAALALD PQSADAKLLL ADSLLARRDW RAAQPLLRAL
LDARAPSMSA VRSMRTLYEQ TGRADAFGPL VDALQSRFAA PDDRAALSRL RAEWLAQQAD
ALAAAGQRGP AAQRYEASLR IAPDAPWARF ALARLYRDMG LPQLGRAVMD EGLAGSDAAD
MRYAAALYRY ALDDVAGARA AFAAIADASR TQGMRAFARK LDAEAALADA RAALAREDRA
AAHAALERAR RAAPDDPDML AAVGAQWIDI GEIERGLAPL RDWIVAHPRE ADADVRLRYG
DLLGGARRDD ALAAWLDTLR RGGAPLDAAQ SARLEDQALR LVLRETDDAL DRDDYEAAAR
ALDRASPAGK ADRRYALEAA ELARAQGRYD TARAALAPLL ARAPDDADAQ LALARILDDD
GAPADALALV RDVLARTPPD DVDTQLSALR RLTALRRARD AAALADTLRA AYPARADVTV
AAGRVAQALG RYDDAASLYR LSLAQERATG IAPRRDGATP AQAAWADLQQ RRDPQIEAGW
LPAYKSGDEG VSAYRAHQMP VYLQMPYRYD GHAFVHLDAV RLDAGTLDTS DPRAYAFDTF
ATHPALADAA APGGALRQRA AGIGGGIGYR DDAWRVDVGT TPLGFPVHYV VGGVRYRFGA
GPASVTVSAA RRPETGSMLS YAGLRDPWTG ATWGGVRRDS VGVRASVDIG RVNLFADLAA
ARLTGRNVAE NAAVTLRTGF MAPVYRRADM RVSAGLVGNA WHYAQNLRYY TYGQGGYYSP
QRYLSIGMPL EWAGRRGAFT WDVTATVGVS NSYERDSPYF PNGLPGSTLV KSAPALGNPV
FSRGSTRGVS FWYGFAGVAE YRVNGRLAVG ARFDIDHAHD YAPSAGLLYV RYAFDARKDS
GGFSPSPVRL YSSY