Gene BURPS1106A_A2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2143 
Symbol 
ID4904708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2095685 
End bp2097943 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content76% 
IMG OID640145248 
Producthypothetical protein 
Protein accessionYP_001076176 
Protein GI126458419 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACT CATGCGTGAA ATCCCCGGAC CCGGTGCCCG GCCGGCCCGC GAGCGGCGGG 
GCCGGTGCGC GCGCGCTCGC CCGCCTGCGC GCGTTGTGGC GCGTGTGCTC GCGCGCGGCG
CGGCCGCGCG AGCCCGCGCA TGCGGCGAAC CGGCTCGCGA TCGACGCGCT GCCCGACGAG
TGGGCCGAGC TCGCGCCGGG CGGCCTGTAT GCGGTGTACG CGGCGGCGTG CACGAGCGCG
TGCGACGCGC TGATCTGGGA CAGCGTGCGG GACGCGCGCA CGCGCGACGT CACGGTGGTG
CTCGCGCGCG AGCGCGCGGC GGTCGCGACG CGGCTGCGCG AGCTCGGCTT CGTCGACGGC
ATGCACGCGC GCGGCTGGCC GCGGCGGTTG AACGTGCTGG CGATGCCGCC GGGCGATATC
GCGGCGCGCG GCGCGGCGCG TGAGGGCGCG CCCGCGCCCG TGCCTGCGCC CGCGTTCTCA
CGCCTCGTCG GCGGCCTGCG CGCGCTGAGG CGCTACCGCT TCCGTTCGAA CGCGCTGTAT
TTCGTCGAAG GCGCGGAGCG CTGGTTCAGT TGGCACGATC CGGTCGCGCT GACGCACGAG
GGGTGGGCGC TGGCCGGCTG GTGCCGTTCG CATCGGATCG CGCTCGTGCT GCTGATCGAT
CCGCGGGCGT CGCAAGCGGC CGCGAGCCGC GCCGATGCGC GGCACACGGC CCCGCTGCCC
GACGCACCCG ATGCACCGGA CGCATCCGAG GCGGGTGACG GCGTGCTCGG CGGCGCGCGC
GCGGAGCACG GCGCCGACGA TCGCACCACG CTCTTCGTCG CCGATCGCAC GCGCGCCGCG
CGCGGCGGCT TTCACGGCGC GTGCGCGGGC GTCGCGCAAT TGCAGCGCAC GCACGGCGAG
CTGCGCTGGC GGGTCGATTT CTGGCGCTCG CGCGGCGCGG TCGCCACGGG CGAGGTGCGC
GCGCTGCGCT TCATCGGCGA CGGACGGCTC GCGGCCGTGC CGGCGGCCGG CGCGCACGCG
GCGGGCGGCG GCGCGCGGCT CGCGTTCGAC GAGGCGCGCG TCGTCGTCAG CCGCCGCGTG
GTCGAGCGCG AATCGTGGGT GCCGGGCGAT TGGGAAGTCG TCGACGACAA CGACGCGGTG
CTCGCCGCGT GCGCCGGCGC GCATGCGGCG AGCGCGGTGC TGGCGTTTAC CGGCCGCGCG
CAGCTCGAAG CGCTGTGCGC GACGATCCAT GCGCTGCGCC TGCGGTGCGG CGGCGCGCTG
AAGATCGTCG TCGTCGAGCG CGGCGAGGCG ATGCGCCATC AATTCGAGCT GCTCGCGCTG
AACCTCGGCG CGAACCAGGT CGTCGCGCGC AACCTGCCGT TCTCGCGCGT GCTCGCGGTG
CTGCGCTCGC TGCAGGGCCA GTTGCACGCG CGCCCAGTCG CGGCCGACTA TCGGGCTGCG
CTCGCCGCGT CGCTCGGCGA CACGGCGCTC GGCTATCTGC CTGTCGGCGC GTTCTGCTCG
CAGGCGCGCG CGGTGCTCGA GCGCAGCGCG GTGCTCGCGC TGTCGCATAC GCTCGTGAAG
CTGACGCTGC TGCCCGGCGT CGCGCACGCG CACGCGTTGC GCGCGTGCAC GCCGCGCCGC
GCGGGCGACG TGCTGACCGC CGACGCGCAG CACCTGTATC TGTTCCTGTT CGCCTGCGAG
CTCGCCGATG CGAACGACGT GCTCGGCCAC CTCTTCGACG TGCCCGTCGA GCGGATCTCG
GATCGCGTCG TGCATCTCGC GCAGGACAGC ATCGAGCATG AGCTGAATGC GCTCGACGCG
GCGAACCGGC GCGCGCCGAT CGCGGACTAC AGCGATCTCT TTTCGCCGGC GGCGGTGGCG
ACGCGCGCGG CCGGCGCGCG CGCCTCGGCC GGCGCTCCGG CGGCGGCGCG CGACGGCGAA
CTGTCCGCCG AGCCTATGTC GCCGCATGTG CCGCCGCATG CGCCGCATGT GTCGGGCGCG
CCGACGCCGC CGGGCACGCG CGCCGCGGCG CACGGACCAC CGTGGTGCCC GGCGTTCGCG
CTGTCGTCCG CATCGCAGAC CTCGCGGACA TCGCCCGCCT CGGCGCAGAT CGTCGTACCG
CCGCCGCAGG CGCCGTCGAA CGTATCGCAC GCGCCGCTGT CCGCGACGCC GCGCGCGCCG
CGACCGCGCC GACCGCACGA CGCCGGCGCG GTCGCCGGCG TGCGCACCCG CACCGCCACG
CGCGACGCGA TGCCGTTGCG CCCCAGGGAG GCTGAATGA
 
Protein sequence
MNDSCVKSPD PVPGRPASGG AGARALARLR ALWRVCSRAA RPREPAHAAN RLAIDALPDE 
WAELAPGGLY AVYAAACTSA CDALIWDSVR DARTRDVTVV LARERAAVAT RLRELGFVDG
MHARGWPRRL NVLAMPPGDI AARGAAREGA PAPVPAPAFS RLVGGLRALR RYRFRSNALY
FVEGAERWFS WHDPVALTHE GWALAGWCRS HRIALVLLID PRASQAAASR ADARHTAPLP
DAPDAPDASE AGDGVLGGAR AEHGADDRTT LFVADRTRAA RGGFHGACAG VAQLQRTHGE
LRWRVDFWRS RGAVATGEVR ALRFIGDGRL AAVPAAGAHA AGGGARLAFD EARVVVSRRV
VERESWVPGD WEVVDDNDAV LAACAGAHAA SAVLAFTGRA QLEALCATIH ALRLRCGGAL
KIVVVERGEA MRHQFELLAL NLGANQVVAR NLPFSRVLAV LRSLQGQLHA RPVAADYRAA
LAASLGDTAL GYLPVGAFCS QARAVLERSA VLALSHTLVK LTLLPGVAHA HALRACTPRR
AGDVLTADAQ HLYLFLFACE LADANDVLGH LFDVPVERIS DRVVHLAQDS IEHELNALDA
ANRRAPIADY SDLFSPAAVA TRAAGARASA GAPAAARDGE LSAEPMSPHV PPHAPHVSGA
PTPPGTRAAA HGPPWCPAFA LSSASQTSRT SPASAQIVVP PPQAPSNVSH APLSATPRAP
RPRRPHDAGA VAGVRTRTAT RDAMPLRPRE AE